Skip to content

Commit f9e8bf6

Browse files
committed
update readme and documentation
Signed-off-by: Suguna Velury <[email protected]>
1 parent 32e6330 commit f9e8bf6

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

examples/llm_qat/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -350,7 +350,7 @@ After performing QLoRA training the final checkpoint can be exported for deploym
350350
```sh
351351
python export.py \
352352
--pyt_ckpt_path llama3-fp4-qlora \
353-
--export_dir llama3-fp4-qlora-hf \
353+
--export_path llama3-fp4-qlora-hf \
354354

355355
```
356356

modelopt/torch/export/quant_utils.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -827,6 +827,7 @@ def postprocess_state_dict(
827827
state_dict: The full model state_dict.
828828
maxbound: The maximum bound value for the output quantizer.
829829
quantization: The KV cache quantization format.
830+
is_modelopt_qlora: Whether the model is a modelopt-trained QLoRA model.
830831
831832
Returns:
832833
The filtered state_dict without unnecessary keys like '_amax' and non KV cache output quantizers.

0 commit comments

Comments
 (0)