ROCm
diff --git a/‎python/perf-kernels/README.md‎
Lines changed: 1 addition & 0 deletions b/‎python/perf-kernels/README.md‎
Lines changed: 1 addition & 0 deletions
@@ -42,6 +42,7 @@ This script contains the Flash Attention kernel with the following support
 - Multi and Grouped Query attention
 - ALiBi bias
 - Matrix bias
+- Persistent kernels. Useful when the sequence lengths are up to a moderate length and especially when doing causal attention.
 - Int8 quantization
 
 These are currently supported for the forward kernel only.