add convert hf to int4 without calibration dataset #1489
+326
−4
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Suppose we have completed Int4 RL training. We will first save a torch_dist checkpoint in BF16 format, and then convert this torch_dist checkpoint into the Hugging Face format. To obtain a true INT4 checkpoint, we further need to convert the BF16 Hugging Face checkpoint into INT4 without using any calibration data.
The purpose of this script is to perform a weight conversion that is functionally equivalent to the INT4(W4A16) QAT.