[Feat] add bf16 sft to mxfp4 conversion #108

yiakwy-xpu-ml-framework-team · 2025-08-10T16:50:40Z

Add bf16 SFT to mxfp4 conversion

Currently the model can runs in either bf16 (group-wise fp8 is also possible) data type in H800/H100 or mxfp4 data type in Blackwell.

After sft the GPT-OSS model and injecting the new identity, we need to convert the model back to MXFP4 to reduce model size when loading weights from HBM with 4-bit IO (in H800, the model will be converted back to bf16 in runtime).

Verification of correctness

We checked the model end-to-end and compared fp4 weight values par-to-par:

dkundel-openai · 2025-08-12T17:21:46Z

Hey @yiakwy-xpu-ml-framework-team thanks for your contribution. We'd recommend putting this script alongside the tool you are using for SFT. Will close this PR but thank you for your contribution

yiakwy-xpu-ml-framework-team added 3 commits August 11, 2025 00:35

add bf16 sft to mxfp4 conversion

133e64e

clean comment

cc6aa77

clean code

5c88218

yiakwy-xpu-ml-framework-team mentioned this pull request Aug 12, 2025

[Tracking] OpenAI gpt-oss Day 0 Support sgl-project/sglang#8833

Open

dkundel-openai closed this Aug 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feat] add bf16 sft to mxfp4 conversion #108

[Feat] add bf16 sft to mxfp4 conversion #108

yiakwy-xpu-ml-framework-team commented Aug 10, 2025

Uh oh!

dkundel-openai commented Aug 12, 2025

Uh oh!

Uh oh!

[Feat] add bf16 sft to mxfp4 conversion #108

[Feat] add bf16 sft to mxfp4 conversion #108

Conversation

yiakwy-xpu-ml-framework-team commented Aug 10, 2025

Add bf16 SFT to mxfp4 conversion

Verification of correctness

Uh oh!

dkundel-openai commented Aug 12, 2025

Uh oh!

Uh oh!