Skip to content

Conversation

@mayabar
Copy link
Collaborator

@mayabar mayabar commented Aug 31, 2025

Fixes #168

@mayabar mayabar requested review from irar2 and shmuelk August 31, 2025 10:42
start := int(bucketSize*float64(bucketIndex)) + 1
// end is the maximum number in the required bucket
end := int(bucketSize * float64(bucketIndex+1))
// bucketSize := float64(maxTokens-1) / float64(len(cumulativeBucketsProbabilities)-1)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove commented out code

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

Signed-off-by: Maya Barnea <[email protected]>
@mayabar mayabar requested a review from shmuelk August 31, 2025 12:00
@shmuelk
Copy link
Collaborator

shmuelk commented Aug 31, 2025

/lgtm

/approve

@github-actions github-actions bot added the lgtm label Aug 31, 2025
@github-actions github-actions bot merged commit b7d5873 into llm-d:main Aug 31, 2025
4 checks passed
@mayabar mayabar deleted the max-tokens-buckets branch October 29, 2025 13:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Enhance calculation of tokens number in response based on request's max_tokens parameter - histogram should be adapted to the max_tokens

2 participants