[Feat] add bf16 sft to mxfp4 conversion #108

yiakwy-xpu-ml-framework-team · 2025-08-10T16:50:40Z

Add bf16 SFT to mxfp4 conversion

Currently the model (gpt-oss 120b) can run in either bf16 (group-wise fp8 is also possible) data type in H800/H100 or mxfp4 data type in Blackwell.

After sft the GPT-OSS model and injecting the new identity, we need to convert the model back to MXFP4 to reduce model size when loading weights from HBM with 4-bit IO (in H800, the model will be converted back to bf16 in runtime).

Verification of correctness

We checked the model end-to-end and compared fp4 weight values par-to-par:

dkundel-openai · 2025-08-12T17:21:46Z

Hey @yiakwy-xpu-ml-framework-team thanks for your contribution. We'd recommend putting this script alongside the tool you are using for SFT. Will close this PR but thank you for your contribution

yiakwy-xpu-ml-framework-team · 2025-08-13T10:14:34Z

Hi @dkundel-openai thanks for feedback ? I didn't use any other tools. Maybe I should add it to huggingface ?

I just think it is nature to be supported by OpenAI tiself to convert bf16 back to mxfp4. Feel free to open it if anyone want it back again.

liuqianchao · 2025-08-14T09:06:32Z

@yiakwy-xpu-ml-framework-team After using the script to get the mxfp4 weights, I got an error message when using the verification code. Do you have any idea?

RuntimeError: Error(s) in loading state_dict for Linear: size mismatch for weight: copying a param with shape torch.Size([201088, 2880]) from checkpoint, the shape in current model is torch.Size([50267, 1024]).

yiakwy-xpu-ml-framework-team · 2025-08-18T18:55:28Z

@liuqianchao we don't have the issue. The script should work with gpt-oss bf16 120b.

Have you verified original model and produced inverse weights map file for mxfp4 weights types ? Could you verify the name of the tensor ? Pay attention to the name, huggingface has converted original gpt-oss to new names where, "blocks", "scales" of moe ffn (up/down projections) are renamed.

yiakwy-xpu-ml-framework-team added 3 commits August 11, 2025 00:35

add bf16 sft to mxfp4 conversion

133e64e

clean comment

cc6aa77

clean code

5c88218

yiakwy-xpu-ml-framework-team mentioned this pull request Aug 12, 2025

[Tracking] OpenAI gpt-oss Day 0 Support sgl-project/sglang#8833

Closed

dkundel-openai closed this Aug 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feat] add bf16 sft to mxfp4 conversion #108

[Feat] add bf16 sft to mxfp4 conversion #108

yiakwy-xpu-ml-framework-team commented Aug 10, 2025 •

edited

Loading

Uh oh!

dkundel-openai commented Aug 12, 2025

Uh oh!

yiakwy-xpu-ml-framework-team commented Aug 13, 2025

Uh oh!

liuqianchao commented Aug 14, 2025 •

edited

Loading

Uh oh!

yiakwy-xpu-ml-framework-team commented Aug 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

[Feat] add bf16 sft to mxfp4 conversion #108

[Feat] add bf16 sft to mxfp4 conversion #108

Conversation

yiakwy-xpu-ml-framework-team commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add bf16 SFT to mxfp4 conversion

Verification of correctness

Uh oh!

dkundel-openai commented Aug 12, 2025

Uh oh!

yiakwy-xpu-ml-framework-team commented Aug 13, 2025

Uh oh!

liuqianchao commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yiakwy-xpu-ml-framework-team commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

yiakwy-xpu-ml-framework-team commented Aug 10, 2025 •

edited

Loading

liuqianchao commented Aug 14, 2025 •

edited

Loading

yiakwy-xpu-ml-framework-team commented Aug 18, 2025 •

edited

Loading