Option to re-display a benchmark file #185

jaredoconnell · 2025-06-13T03:55:13Z

closes #175

This adds a command to re-display a prior benchmarks file in the CLI.

Before marking this as ready for review, we need to decide what command format we want to use. During the call with Mark we discussed this being an option within the benchmark command.
Also, let me know if the stripped down results file is a good one to use. I manually removed data from a large results file.

The gitignore is setup to ignore these files by default

nm-red-hat-upstream-automation-bot · 2025-06-20T19:27:33Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15786325345/artifacts/3372583658.
They will be retained for up to 30 days.

src/guidellm/__main__.py

src/guidellm/benchmark/entrypoints.py

src/guidellm/__main__.py

nm-red-hat-upstream-automation-bot · 2025-06-23T15:57:06Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15829107664/artifacts/3384635667.
They will be retained for up to 30 days.

markurtz

Thanks, @jaredoconnell; overall, the code looks good. I want to expand the functionality to encompass not only displaying the results but also reexporting them to a new file if the user desires.

Given that, I'd recommend something along the lines of the following for the CLI:
guidellm benchmark report PATH --output OUTPUT where output is optional. If an output file is supplied, we will resave the report to that file path using the extension as the file type. It could also potentially be named export, convert, or anything along those lines.

For the benchmark CLI pathways, it would then look like the following:
guidellm benchmark run ...
guidellm benchmark report ...

And if that ACTION after the benchmark is not supplied, we will default to filling it in as run. This way, we namespace all of the commands under benchmark and add flexibility towards the future.

nm-red-hat-upstream-automation-bot · 2025-06-26T22:00:25Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15913354854/artifacts/3414683301.
They will be retained for up to 30 days.

jaredoconnell · 2025-06-26T22:09:26Z

This is ready for re-review.

Since the last reviews, there is an option to re-export the benchmarks, and the command format was changed.

The default command feature isn't supported by Click so I am using a class to handle the new behavior. I included some external code for the default command as opposed to doing that myself because I found that there were a lot of edge cases that broke the functionality.

Should I document this command as Step 6 in the quick start?

nm-red-hat-upstream-automation-bot · 2025-06-26T22:10:33Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15913513325/artifacts/3414739016.
They will be retained for up to 30 days.

nm-red-hat-upstream-automation-bot · 2025-06-27T20:08:47Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15935011408/artifacts/3422105673.
They will be retained for up to 30 days.

nm-red-hat-upstream-automation-bot · 2025-06-27T20:48:37Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15935670062/artifacts/3422335403.
They will be retained for up to 30 days.

nm-red-hat-upstream-automation-bot · 2025-06-27T21:02:06Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15935873594/artifacts/3422412684.
They will be retained for up to 30 days.

nm-red-hat-upstream-automation-bot · 2025-06-27T21:09:36Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/15935873594/artifacts/3422457310.
They will be retained for up to 30 days.

jaredoconnell · 2025-06-27T22:22:58Z

The CI errors appears to be from using an older commit's code. That's very odd.

Copilot

Pull Request Overview

Adds a command and related infrastructure to load and display an existing benchmark report via the CLI.

Introduces reimport_benchmarks_report and wires it into a new guidellm benchmark from-file subcommand
Adds print_full_report to GenerativeBenchmarksConsole and updates the main benchmark group to use DefaultGroupHandler
Includes unit tests/assets for re-display behavior and updates docs to reference the new run subcommand

Reviewed Changes

Copilot reviewed 14 out of 16 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/guidellm/benchmark/entrypoints.py	Added `reimport_benchmarks_report` and simplified console calls
src/guidellm/benchmark/output.py	Added `print_full_report`, updated unsupported-extension error
src/guidellm/main.py	Converted `benchmark` to a subcommand group with `run` & `from-file`
src/guidellm/utils/default_group.py	Added `DefaultGroupHandler` to support default subcommands
tests/unit/entrypoints/.../test_benchmark_from_file_entrypoint.py	New tests for display and re-export
docs/outputs.md	Updated examples to use `benchmark run`
docs/datasets.md	Updated examples to use `benchmark run`
README.md	Updated examples to use `benchmark run`
.pre-commit-config.yaml	Excluded test assets from formatting rules

Comments suppressed due to low confidence (3)

src/guidellm/main.py:36

To automatically execute run when users invoke guidellm benchmark without any args, add default_if_no_args=True to the DefaultGroupHandler parameters.

src/guidellm/benchmark/entrypoints.py:154

[nitpick] This docstring ends abruptly at "Can also specify". Please complete it to explain what additional options are available when re-importing a report.

    existing benchmarks report. Can also specify

docs/outputs.md:3

[nitpick] The new benchmark from-file command isn’t mentioned here. Consider adding a section with usage examples for guidellm benchmark from-file to show how to re-display existing reports.

GuideLLM provides flexible options for outputting benchmark results, catering to both console-based summaries and file-based detailed reports. This document outlines the supported output types, their configurations, and how to utilize them effectively.

nm-red-hat-upstream-automation-bot · 2025-07-08T21:07:40Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/16154373917/artifacts/3490281464.
They will be retained for up to 30 days.

nm-red-hat-upstream-automation-bot · 2025-07-10T12:48:07Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/16195522505/artifacts/3504218172.
They will be retained for up to 30 days.

closes #175 This adds a command to re-display a prior benchmarks file in the CLI. Before marking this as ready for review, we need to decide what command format we want to use. During the call with Mark we discussed this being an option within the benchmark command. Also, let me know if the stripped down results file is a good one to use. I manually removed data from a large results file. --------- Co-authored-by: Samuel Monson <[email protected]> Co-authored-by: Mark Kurtz <[email protected]>

closes #175 This adds a command to re-display a prior benchmarks file in the CLI. Before marking this as ready for review, we need to decide what command format we want to use. During the call with Mark we discussed this being an option within the benchmark command. Also, let me know if the stripped down results file is a good one to use. I manually removed data from a large results file. --------- Co-authored-by: Samuel Monson <[email protected]> Co-authored-by: Mark Kurtz <[email protected]> Signed-off-by: dalthecow <[email protected]>

jaredoconnell force-pushed the redisplay-results branch from 350e1e2 to 1e496b0 Compare June 13, 2025 17:04

jaredoconnell marked this pull request as ready for review June 16, 2025 17:04

jaredoconnell added 10 commits June 16, 2025 18:22

Allow result file to be re-displayed

58bca2c

Added test for JSON

90328e6

Added yaml test

07d8150

Fix warning

1ed37cd

Add uncommitted file

379c64f

The gitignore is setup to ignore these files by default

Fix linter errors

2a8db0d

Use fixed width for CLI tests

c43573a

Fix linter error and exclude test assets from linting

18d5897

Added option to regenerate test artifact

2063d43

Fix linter errors

27e8391

jaredoconnell force-pushed the redisplay-results branch from fc2301d to 27e8391 Compare June 16, 2025 22:22

Merge branch 'main' into redisplay-results

554a182

sjmonson requested changes Jun 20, 2025

View reviewed changes

src/guidellm/__main__.py Outdated Show resolved Hide resolved

src/guidellm/benchmark/entrypoints.py Outdated Show resolved Hide resolved

src/guidellm/__main__.py Outdated Show resolved Hide resolved

Address review comments

7f2dd40

markurtz requested review from Copilot and markurtz June 24, 2025 12:52

markurtz assigned jaredoconnell Jun 24, 2025

This comment was marked as outdated.

Sign in to view

markurtz requested changes Jun 24, 2025

View reviewed changes

jaredoconnell added 4 commits June 26, 2025 13:52

Allow reexporting reimported benchmarks

fff5c87

Add test for reexporting and fix other tests

9f9ddc9

Merge branch 'main' into redisplay-results

bf3d175

Switch to internal dependency, and fix linter errors

83b0c77

jaredoconnell requested review from markurtz and sjmonson June 26, 2025 22:00

Update documentation to reflect command change

c0baf33

Merge branch 'main' into redisplay-results

9d41aa6

Merge branch 'main' into redisplay-results

686fcee

Fix linter errors

534fbe4

markurtz requested a review from Copilot July 8, 2025 15:19

Copilot AI reviewed Jul 8, 2025

View reviewed changes

jaredoconnell added 2 commits July 8, 2025 17:06

Revert docs changes

bece157

Update command to use hyphen

be1730d

markurtz approved these changes Jul 10, 2025

View reviewed changes

Merge branch 'main' into redisplay-results

2bd7f5e

markurtz merged commit f1dfdc6 into vllm-project:main Jul 10, 2025
16 checks passed

jaredoconnell deleted the redisplay-results branch July 14, 2025 19:27

Option to re-display a benchmark file #185

Option to re-display a benchmark file #185

Uh oh!

Conversation

jaredoconnell commented Jun 13, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 23, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

markurtz left a comment

Choose a reason for hiding this comment

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 26, 2025

Uh oh!

jaredoconnell commented Jun 26, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 26, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 27, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 27, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 27, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jun 27, 2025

Uh oh!

jaredoconnell commented Jun 27, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jul 8, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants