[GuideLLM Refactor] entrypoints and working state (base to create PRs off of til merged into refactor base) #358

markurtz · 2025-09-19T13:10:08Z

Summary

Refactor of the GuideLLM command-line interface, streamlining the benchmark command structure while adding new mock server functionality and performance optimization features and adding in any missing fixes in other PRs to stabilize the refactor to a working state.

Details

CLI Interface Overhaul:
- Removed legacy -scenario option in favor of direct parameter specification
- Reorganized CLI options with clear grouping (Backend, Data, Output, Aggregators, Constraints)
- Added parameter aliases for backward compatibility (e.g., -rate-type → -profile)
- Simplified option defaults by removing scenario-based defaults
- Added comprehensive docstrings and help text for all commands and options
New Mock Server Command:
- Added guidellm mock-server command with full OpenAI/vLLM API compatibility
- Configurable latency characteristics (request latency, TTFT, ITL, output tokens)
- Support for both streaming and non-streaming endpoints
- Comprehensive server configuration options (host, port, workers, model name)
Performance Optimization Features:
- Added new perf optional dependency group with orjson, msgpack, msgspec, uvloop
- Integrated uvloop for enhanced async performance when available
- Optimized event loop policy selection based on availability
Internal Architecture Improvements:
- Updated import paths (guidellm.backend → guidellm.backends, guidellm.scheduler.strategy → guidellm.scheduler)
- Replaced scenario-based benchmarking with direct benchmark_generative_text function calls
- Enhanced error handling and parameter validation
- Simplified logging format for better readability
Enhanced Output and Configuration:
- Added support for multiple output formats with -output-formats option
- Improved output path handling for files vs directories
- Added new constraint options (-max-errors, -max-error-rate, -max-global-error-rate)
- Enhanced warmup/cooldown specification with flexible numeric/percentage options
Code Quality Improvements:
- Comprehensive type annotations throughout the codebase
- Detailed docstrings following Google/NumPy style conventions
- Consistent parameter naming and organization
- Removed deprecated version option from main CLI group

Test Plan

Tests for entrypoints to be added later

Related Issues

Part of the larger scheduler refactor initiative

"I certify that all code in this PR is my own, except as noted below."

Use of AI

Includes AI-assisted code completion
Includes code generated by an AI application
Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

Signed-off-by: Mark Kurtz <[email protected]>

Copilot

Pull Request Overview

This PR refactors the GuideLLM command-line interface to streamline the benchmark command structure while adding mock server functionality and performance optimization features. The changes modernize the CLI interface by removing legacy scenario-based configuration in favor of direct parameter specification and introduce new capabilities for testing and development.

Key changes:

Replaced scenario-based CLI configuration with direct parameter specification using reorganized option groups
Added new mock server command with configurable OpenAI/vLLM API compatibility
Integrated performance optimizations through optional uvloop support and new perf dependency group

Reviewed Changes

Copilot reviewed 19 out of 21 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
src/guidellm/main.py	Complete CLI overhaul with new parameter structure, mock server command, and uvloop integration
src/guidellm/settings.py	Updated multiprocessing and scheduler settings with new configuration options
tests/unit/test_cli.py	Removed legacy CLI version flag tests
tests/unit/conftest.py	Removed old mock fixtures and test utilities
tests/unit/mock_*	Updated mock implementations for new architecture
tests/integration/scheduler/	Added integration tests for scheduler components
src/guidellm/utils/typing.py	New utility for extracting literal values from type aliases
pyproject.toml	Added perf optional dependency group

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/guidellm/__main__.py

tests/unit/mock_backend.py

src/guidellm/settings.py

…features/refactor/base-draft [GuideLLM Refactor] entrypoints and working state (base to create PRs off of til merged into refactor base) #358

sjmonson

See comment, otherwise looks good.

src/guidellm/__main__.py

src/guidellm/settings.py

## Summary This PR ports the new functionality from `benchmark run` to `benchmark from-file`, and does so in a way that reuses as much code as practical to have one source of truth. ## Details  - Fixes from-file by making it to use the new output format. - Moves code related to the new output formats to separate functions that are called from both benchmark entrypoints. - Moves additional chunks of code out of the large benchmark run entrypoint function for modularity. ## Test Plan Run a benchmark with an output of json or yaml, and use `from-file` to re-import it and export it. You can select any output type supported by `benchmark run`. `guidellm benchmark from-file ./result.json --output-formats console` `guidellm benchmark from-file ./result.yaml --output-formats yaml` ## Related Issues --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`) --------- Signed-off-by: Jared O'Connell <[email protected]>

## Summary Reintroduces a few changes from main --------- Signed-off-by: Samuel Monson <[email protected]>

… off of til merged into refactor base) \#358\n\nfeatures/refactor/working

jaredoconnell

I gave this a skim and didn't notice any blockers. Some of these changes will bet changed in the scenarios pull request, but that's okay.

Signed-off-by: Jared O'Connell <[email protected]>

markurtz added 4 commits September 19, 2025 12:31

Mock server implementation for guidellm

a28bbe3

fixes from copilot review

bb98193

Signed-off-by: Mark Kurtz <[email protected]>

Any missing changes / working state for refactor

a9a082a

Signed-off-by: Mark Kurtz <[email protected]>

add in the perf extras

6d0d4c2

Signed-off-by: Mark Kurtz <[email protected]>

markurtz requested review from DaltheCow, sjmonson, markVaykhansky, jaredoconnell, AlonKellner-RedHat and Copilot September 19, 2025 13:10

Copilot AI reviewed Sep 19, 2025

View reviewed changes

src/guidellm/__main__.py Show resolved Hide resolved

src/guidellm/__main__.py Show resolved Hide resolved

src/guidellm/__main__.py Show resolved Hide resolved

tests/unit/mock_backend.py Show resolved Hide resolved

src/guidellm/settings.py Show resolved Hide resolved

sjmonson requested changes Sep 24, 2025

View reviewed changes

src/guidellm/__main__.py Show resolved Hide resolved

src/guidellm/settings.py Show resolved Hide resolved

jaredoconnell and others added 2 commits September 24, 2025 17:31

[GuideLLM Refactor] Entrypoint: Reintroduce changes from main (#363)

78615f7

## Summary Reintroduces a few changes from main --------- Signed-off-by: Samuel Monson <[email protected]>

sjmonson added a commit that referenced this pull request Sep 25, 2025

[GuideLLM Refactor] entrypoints and working state (base to create PRs…

3ff8e02

… off of til merged into refactor base) \#358\n\nfeatures/refactor/working

Base automatically changed from features/refactor/mock-server to features/refactor/base September 29, 2025 14:19

jaredoconnell approved these changes Sep 29, 2025

View reviewed changes

Merge branch 'features/refactor/base' into features/refactor/working

447101b

Signed-off-by: Jared O'Connell <[email protected]>

sjmonson approved these changes Sep 29, 2025

View reviewed changes

markurtz merged commit 47088f9 into features/refactor/base Sep 29, 2025
7 of 17 checks passed

markurtz deleted the features/refactor/working branch September 29, 2025 16:37

markurtz added this to the v0.4.0 milestone Oct 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GuideLLM Refactor] entrypoints and working state (base to create PRs off of til merged into refactor base) #358

[GuideLLM Refactor] entrypoints and working state (base to create PRs off of til merged into refactor base) #358

Uh oh!

markurtz commented Sep 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson left a comment

Uh oh!

Uh oh!

Uh oh!

jaredoconnell left a comment

Uh oh!

Uh oh!

Uh oh!

[GuideLLM Refactor] entrypoints and working state (base to create PRs off of til merged into refactor base) #358

[GuideLLM Refactor] entrypoints and working state (base to create PRs off of til merged into refactor base) #358

Uh oh!

Conversation

markurtz commented Sep 19, 2025

Summary

Details

Test Plan

Related Issues

Use of AI

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jaredoconnell left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!