Skip to content

Conversation

tukwila
Copy link
Contributor

@tukwila tukwila commented Sep 12, 2025

Summary

fix bug: #115

error print refer to:

Traceback (most recent call last):
  File "/Users/guangli.bao/go/guidellm/src/guidellm/benchmark/profile.py", line 318, in next_strategy
    raise ValueError(
ValueError: Rate array contains zero values, which may cause division by zero or invalid calculations.
Debug info: {'min_rate': 0.0, 'max_rate': 0.0, 'sweep_size': 10, 'generated_rates': [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0,
0.0], 'zero_count': np.int64(8), 'zero_positions': [0, 1, 2, 3, 4, 5, 6, 7]}
Please adjust parameter range to avoid zero values.

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/Users/guangli.bao/miniforge3/envs/myenv/bin/guidellm", line 8, in <module>
    sys.exit(cli())
             ^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/site-packages/click/core.py", line 1161, in __call__
    return self.main(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/site-packages/click/core.py", line 1082, in main
    rv = self.invoke(ctx)
         ^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/site-packages/click/core.py", line 1697, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/site-packages/click/core.py", line 1697, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/site-packages/click/core.py", line 1443, in invoke
    return ctx.invoke(self.callback, **ctx.params)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/site-packages/click/core.py", line 788, in invoke
    return __callback(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/go/guidellm/src/guidellm/__main__.py", line 314, in run
    asyncio.run(
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/asyncio/runners.py", line 190, in run
    return runner.run(main)
           ^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/asyncio/runners.py", line 118, in run
    return self._loop.run_until_complete(task)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/miniforge3/envs/myenv/lib/python3.11/asyncio/base_events.py", line 653, in run_until_complete
    return future.result()
           ^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/go/guidellm/src/guidellm/benchmark/entrypoints.py", line 29, in benchmark_with_scenario
    return await benchmark_generative_text(**vars(scenario), **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/go/guidellm/src/guidellm/benchmark/entrypoints.py", line 118, in benchmark_generative_text
    async for result in benchmarker.run(
  File "/Users/guangli.bao/go/guidellm/src/guidellm/benchmark/benchmarker.py", line 185, in run
    while scheduling_strategy := profile.next_strategy():
                                 ^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/guangli.bao/go/guidellm/src/guidellm/benchmark/profile.py", line 325, in next_strategy
    raise ValueError("Unable to generate valid rates parameters. Please check your input range.") from e
ValueError: Unable to generate valid rates parameters. Please check your input range.
╭─ Benchmarks ───────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ [14:26:… 1… synchronous (comple… Req:    0.0 req/s,    0.00s Lat,     0.0 Conc,       0 Comp,        1 Inc,        0 … │
│                                  Tok:    0.0 gen/s,    0.0 tot/s,   0.0ms TTFT,    0.0ms ITL,     0 Prompt,        0 … │
│ [14:26:… 1… throughput  (comple… Req:    0.0 req/s,    0.00s Lat,     0.0 Conc,       0 Comp,        1 Inc,        0 … │
│                                  Tok:    0.0 gen/s,    0.0 tot/s,   0.0ms TTFT,    0.0ms ITL,     0 Prompt,        0 … │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
│ [--:--:…    constant@#… (pendin…                                                                                       │
│                                                                                                                        │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
Generating... ━━━━━━━━━━━━━━━╸━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ (2/10) [ 0:00:15 < 0:01:00 ]

Details

  • [ ]

Test Plan

guidellm benchmark
--target "https://chat.d.run//"
--model "public/qwen2.5-72b-awq"
--processor "/Users/guangli.bao/temp/Qwen3-0.6B"
--rate-type sweep
--max-seconds 1
--max-requests 1
--data "prompt_tokens=128,output_tokens=56"

Related Issues


  • "I certify that all code in this PR is my own, except as noted below."

Use of AI

  • Includes AI-assisted code completion
  • Includes code generated by an AI application
  • Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

@markurtz
Copy link
Collaborator

markurtz commented Oct 1, 2025

Thanks for your contribution @tukwila, could you take a look at the refactor branch and adapt this on top of it? Looks great to include there! #351

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants