We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 1f77518 commit 0ecf711Copy full SHA for 0ecf711
CHANGELOG.rst
@@ -34,6 +34,7 @@ Model Optimizer Changelog (Linux)
34
- Support Phi-4-multimodal and Qwen2.5-VL quantized HF checkpoint export in ``examples/vlm_ptq``.
35
- Support storing and restoring Minitron pruning activations and scores for re-pruning without running the forward loop again.
36
- Add Minitron pruning example for Megatron-LM framework. See ``examples/megatron-lm`` for more details.
37
+- Add FP8/NVFP4 KV cache quantization support for Megatron Core models.
38
39
0.35 (2025-09-04)
40
^^^^^^^^^^^^^^^^^
0 commit comments