Skip to content

Commit 43718bc

Browse files
committed
fix export
Signed-off-by: Jennifer Chen <[email protected]>
1 parent e6f3ed7 commit 43718bc

File tree

2 files changed

+1
-3
lines changed

2 files changed

+1
-3
lines changed

examples/nemo_run/qat/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ graph TD;
4040

4141
## Results
4242

43-
QAT of Qwen3-8B NVFP4 recovers most of the accuracy on the MMLU benchmark after NVFP4 PTQ. We finetune the Qwen3-8B NVFP4 checkpoint for 200 steps with a learning rate of 1e-5 and global batch size of 512.
43+
QAT of Qwen3-8B NVFP4 recovers most of the accuracy on the MMLU benchmark after NVFP4 PTQ. We finetune the Qwen3-8B NVFP4 checkpoint for 200 steps with a learning rate of 1e-5 and global batch size of 512 on one node of 8 x H100 GPUs.
4444

4545
| | MMLU 5% |
4646
|---------------------------|---------|

modelopt/torch/export/plugins/nemo_run.py

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,14 +24,12 @@
2424
def export_most_recent_ckpt(directory: str, output_path: str):
2525
"""Export most recent checkpoint from a NeMo Run experiment directory."""
2626
most_recent_ckpt = _get_most_recent_ckpt(directory)
27-
modelopt_kwargs = {"export_extra_modules": True}
2827
logging.info(f"Exporting most recent NeMo Run checkpoint: {most_recent_ckpt}")
2928
export_ckpt(
3029
most_recent_ckpt,
3130
"hf",
3231
output_path=output_path,
3332
overwrite=True,
34-
modelopt_export_kwargs=modelopt_kwargs,
3533
)
3634

3735

0 commit comments

Comments
 (0)