diff --git a/docs/source/en/optimization/para_attn.md b/docs/source/en/optimization/para_attn.md index b1b111045590..94b0d5ce3af4 100644 --- a/docs/source/en/optimization/para_attn.md +++ b/docs/source/en/optimization/para_attn.md @@ -29,7 +29,7 @@ However, it is hard to decide when to reuse the cache to ensure quality generate This achieves a 2x speedup on FLUX.1-dev and HunyuanVideo inference with very good quality.
- Cache in Diffusion Transformer + Cache in Diffusion Transformer
How AdaCache works, First Block Cache is a variant of it