Skip to content

Support for GPT-OSS pretraining script in Megatron-Bridge #1944

@fengxy-03

Description

@fengxy-03

Is your feature request related to a problem? Please describe.
I am interested in pretraining a model from scratch using the GPT-OSS architecture via Megatron-Bridge.
However, I couldn't find a corresponding configuration or script for GPT-OSS in the current examples/recipes/ directory. Could you please provide a pretrain_gpt_oss.py script (or a similar recipe) to support training this architecture from scratch?

Describe the solution you'd like
Provide a pretrain_gpt_oss.py script (or a similar recipe) to support training GPT-OSS architecture.

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

community-requestfeatureNew capabilities, enhancements, or enablement work

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions