Skip to content

Commit 2d6a2e1

Browse files
committed
fix
1 parent 98626bf commit 2d6a2e1

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

docs/source/en/optimization/para_attn.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -121,20 +121,20 @@ print("Saving video to hunyuan_video.mp4")
121121
export_to_video(output, "hunyuan_video.mp4", fps=15)
122122
```
123123

124-
#### HunyuanVideo without FBCache
125-
126124
<video controls>
127125
<source src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/para-attn/hunyuan-video-original.mp4" type="video/mp4">
128126
Your browser does not support the video tag.
129127
</video>
130128

131-
#### HunyuanVideo with FBCache
129+
<small> HunyuanVideo without FBCache </small>
132130

133131
<video controls>
134132
<source src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/para-attn/hunyuan-video-fbc.mp4" type="video/mp4">
135133
Your browser does not support the video tag.
136134
</video>
137135

136+
<small> HunyuanVideo with FBCache </small>
137+
138138
We observe that the first block cache is very effective in speeding up the inference, and maintaining nearly no quality loss in the generated video.
139139
Now, on one single NVIDIA L20 GPU, we can generate 129 frames with 720p resolution in 30 inference steps in 2271.06 seconds. This is a 1.62x speedup compared to the baseline.
140140

0 commit comments

Comments
 (0)