Skip to content

Conversation

@xieck13
Copy link
Contributor

@xieck13 xieck13 commented Jan 24, 2026

Suppose we have completed Int4 RL training. We will first save a torch_dist checkpoint in BF16 format, and then convert this torch_dist checkpoint into the Hugging Face format. To obtain a true INT4 checkpoint, we further need to convert the BF16 Hugging Face checkpoint into INT4 without using any calibration data.

The purpose of this script is to perform a weight conversion that is functionally equivalent to the INT4(W4A16) QAT.

@xieck13 xieck13 marked this pull request as draft January 24, 2026 12:37
@xieck13 xieck13 marked this pull request as ready for review January 24, 2026 14:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant