Skip to content

Commit 9bdee07

Browse files
committed
update readme and documentation
Signed-off-by: Suguna Velury <[email protected]>
1 parent 0784fe8 commit 9bdee07

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

examples/llm_qat/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -359,7 +359,7 @@ After performing QLoRA training the final checkpoint can be exported for deploym
359359
```sh
360360
python export.py \
361361
--pyt_ckpt_path llama3-fp4-qlora \
362-
--export_dir llama3-fp4-qlora-hf \
362+
--export_path llama3-fp4-qlora-hf \
363363

364364
```
365365

modelopt/torch/export/quant_utils.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -821,6 +821,7 @@ def postprocess_state_dict(
821821
state_dict: The full model state_dict.
822822
maxbound: The maximum bound value for the output quantizer.
823823
quantization: The KV cache quantization format.
824+
is_modelopt_qlora: Whether the model is a modelopt-trained QLoRA model.
824825
825826
Returns:
826827
The filtered state_dict without unnecessary keys like '_amax' and non KV cache output quantizers.

0 commit comments

Comments
 (0)