Update on the development branch #1125
kaiyux
announced in
Announcements
Replies: 3 comments 9 replies
-
What does it mean |
Beta Was this translation helpful? Give feedback.
3 replies
-
Could you share some use cases of the weightless engine, please? |
Beta Was this translation helpful? Give feedback.
5 replies
-
Could you share if there has been any update on the 800 issue regarding triton backend for enc_dec models? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
The TensorRT-LLM team is pleased to announce that we are pushing an update to the development branch (and the Triton backend) this February 21, 2024.
This update includes:
encoder_input_len_range
should not be 0, thanks to the contribution from @Eddie-Wang1120 in Fix enc_dec bug and Make several improvements to whisper #992gptDecoderBatch
to support batched samplinggptManagerBenchmark
Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions