Skip to content

Conversation

@mayabar
Copy link
Collaborator

@mayabar mayabar commented Oct 27, 2025

Problem: when random mode is used in the simulator and max-tokens=5 in request, from time to time response contains only one token. In this case tpot metrics are never reported, and test fails.

Solution: use echo mode, this ensures that number of tokens in response is not one.

Fixes #227

@mayabar mayabar requested a review from irar2 October 27, 2025 12:30
@irar2
Copy link
Collaborator

irar2 commented Oct 28, 2025

/lgtm
/approve

@github-actions github-actions bot added the lgtm label Oct 28, 2025
@github-actions github-actions bot merged commit 7e913bc into llm-d:main Oct 28, 2025
4 checks passed
@mayabar mayabar deleted the fix-ttft-tpot-test branch October 29, 2025 13:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Fix occasional ttft and tpot metrics test failures

2 participants