-
Notifications
You must be signed in to change notification settings - Fork 207
Open
Labels
community-requestfeatureNew capabilities, enhancements, or enablement workNew capabilities, enhancements, or enablement work
Description
Is your feature request related to a problem? Please describe.
I am interested in pretraining a model from scratch using the GPT-OSS architecture via Megatron-Bridge.
However, I couldn't find a corresponding configuration or script for GPT-OSS in the current examples/recipes/ directory. Could you please provide a pretrain_gpt_oss.py script (or a similar recipe) to support training this architecture from scratch?
Describe the solution you'd like
Provide a pretrain_gpt_oss.py script (or a similar recipe) to support training GPT-OSS architecture.
Describe alternatives you've considered
Additional context
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
community-requestfeatureNew capabilities, enhancements, or enablement workNew capabilities, enhancements, or enablement work