File tree Expand file tree Collapse file tree 1 file changed +2
-2
lines changed
Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Original file line number Diff line number Diff line change @@ -112,9 +112,9 @@ For cuDNN-accelerated ConvTranspose1d (faster VAE decode):
112112cargo build --release --features cudnn
113113```
114114
115- Requires a [ candle fork] ( https://github.com/Marenz/candle/tree/fast-conv-transpose1d-no-cudnn ) with two upstream PRs:
116- - [ cuDNN ConvTranspose1d] ( https://github.com/huggingface/candle/pull/3383 ) — 100x faster VAE decode vs the default CPU fallback kernel
115+ Requires a [ candle fork] ( https://github.com/Marenz/candle/tree/fast-conv-transpose1d-no-cudnn ) with the following upstream PRs:
117116- [ public ` Model::clear_kv_cache ` for Qwen3] ( https://github.com/huggingface/candle/pull/3381 ) — needed to reset KV state between inference calls
117+ - [ cuDNN ConvTranspose1d] ( https://github.com/huggingface/candle/pull/3383 ) (optional) — 100x faster VAE decode vs the default CPU fallback kernel
118118
119119Depending on your system, you may need additional environment variables for the CUDA build — see [ AGENTS.md] ( AGENTS.md ) for platform-specific notes.
120120
You can’t perform that action at this time.
0 commit comments