Update on the development branch #2503
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Nov 26, 2024.
This update includes:
examples/sdxl/README.md
. Thanks for the contribution from @Zars19 in Support SDXL and its distributed inference #1514.max_num_tokens
dynamic tuning feature, it can be enabled by setting--enable_max_num_tokens_tuning
togptManagerBenchmark
.Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions