Feat: add command-line arguments for backend parameters #86

SyedaAnshrahGillani · 2025-08-07T20:44:48Z

This pull request enhances the flexibility of the gpt_oss/generate.py script
by replacing hardcoded backend parameters with configurable command-line
arguments.

Previously, the triton and vllm backends had fixed values for context and
tensor_parallel_size, respectively. This made it difficult to adapt the
script to different models or hardware configurations without modifying the
source code.

This PR introduces two new arguments:

--context-length: Allows customization of the context length for the triton
backend (defaults to 4096).
--tensor-parallel-size: Allows customization of the tensor parallel size for
the vllm backend (defaults to 2).

Additionally, the variable decoded_token has been renamed to token_text for
improved clarity.

These changes make the generation script more versatile and user-friendly,
allowing for easier experimentation with different backend settings.

Feat: add command-line arguments for backend parameters

10c1d9b

dkundel-openai approved these changes Aug 12, 2025

View reviewed changes

dkundel-openai merged commit 4195fb3 into openai:main Aug 12, 2025

Danztee pushed a commit to Danztee/gpt-oss that referenced this pull request Aug 12, 2025

Feat: add command-line arguments for backend parameters (openai#86)

1c80ba6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat: add command-line arguments for backend parameters #86

Feat: add command-line arguments for backend parameters #86

SyedaAnshrahGillani commented Aug 7, 2025

Uh oh!

Uh oh!

Feat: add command-line arguments for backend parameters #86

Feat: add command-line arguments for backend parameters #86

Conversation

SyedaAnshrahGillani commented Aug 7, 2025

Uh oh!

Uh oh!