Skip to content

Commit 03abeda

Browse files
chengzeyistevhliu
andauthored
Update docs/source/en/optimization/para_attn.md
Co-authored-by: Steven Liu <[email protected]>
1 parent 2d6a2e1 commit 03abeda

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

docs/source/en/optimization/para_attn.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -135,8 +135,7 @@ export_to_video(output, "hunyuan_video.mp4", fps=15)
135135

136136
<small> HunyuanVideo with FBCache </small>
137137

138-
We observe that the first block cache is very effective in speeding up the inference, and maintaining nearly no quality loss in the generated video.
139-
Now, on one single NVIDIA L20 GPU, we can generate 129 frames with 720p resolution in 30 inference steps in 2271.06 seconds. This is a 1.62x speedup compared to the baseline.
138+
First Block Cache reduced the inference speed to 2271.06 seconds compared to the baseline, or 1.62x faster, while maintaining nearly zero quality loss.
140139

141140
</hfoption>
142141
</hfoptions>

0 commit comments

Comments
 (0)