You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[TRTLLM-9689][feat] Introduce max_topk_logprobs parameter and enhance logprobs handling
- Added max_topk_logprobs parameter to AutoDeployConfig and LlmRequest to control the number of top-k logprobs storable for each token.
- Updated TorchSampler to accommodate max_topk_logprobs in logprobs processing and storage.
- Enhanced logprobs handling in the sampling process to support both sampled and top-k logprobs.
- Enabled batched processing of logprobs to enhance logprobs performance
- Modified tests to validate the new max_topk_logprobs functionality and ensure correct logprobs output.
Signed-off-by: Stefan <[email protected]>
0 commit comments