Skip to content

Commit 8a71b9c

Browse files
committed
README: cuDNN patch is optional, not required
1 parent 729ed09 commit 8a71b9c

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -112,9 +112,9 @@ For cuDNN-accelerated ConvTranspose1d (faster VAE decode):
112112
cargo build --release --features cudnn
113113
```
114114

115-
Requires a [candle fork](https://github.com/Marenz/candle/tree/fast-conv-transpose1d-no-cudnn) with two upstream PRs:
116-
- [cuDNN ConvTranspose1d](https://github.com/huggingface/candle/pull/3383) — 100x faster VAE decode vs the default CPU fallback kernel
115+
Requires a [candle fork](https://github.com/Marenz/candle/tree/fast-conv-transpose1d-no-cudnn) with the following upstream PRs:
117116
- [public `Model::clear_kv_cache` for Qwen3](https://github.com/huggingface/candle/pull/3381) — needed to reset KV state between inference calls
117+
- [cuDNN ConvTranspose1d](https://github.com/huggingface/candle/pull/3383) (optional) — 100x faster VAE decode vs the default CPU fallback kernel
118118

119119
Depending on your system, you may need additional environment variables for the CUDA build — see [AGENTS.md](AGENTS.md) for platform-specific notes.
120120

0 commit comments

Comments
 (0)