Update on the development branch #2334
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Oct 15, 2024.
This update includes:
examples/commandr/README.md
.examples/falcon/README.md
, thanks to the contribution from @puneeshkhanna in Add support for falcon2 #1926.LLM
API for Mamba2.examples/exaone/README.md
.examples/medusa/README.md
.gptManagerBenchmark
tool.builder_force_num_profiles
intrtllm-build
command to env var.use_fused_mlp
when constructingBuildConfig
from dict, thanks for the fix from @ethnzhng in Include use_fused_mlp when constructing BuildConfig from dict #2081.numNewTokensCumSum
. ([Bug] Lookahead decoding is nondeterministic and wrong after the first call to runner.generate #2263)Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions