Skip to content

Commit b3a210c

Browse files
committed
update changelog to include SGlang/vLLM related updates
Signed-off-by: Zhiyu Cheng <[email protected]>
1 parent 8cf516e commit b3a210c

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

CHANGELOG.rst

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,8 @@ Model Optimizer Changelog (Linux)
2626
- Add support for multi-node PTQ and export with FSDP2 in ``examples/llm_ptq/multinode_ptq.py``. See `examples/llm_ptq/README.md <https://github.com/NVIDIA/TensorRT-Model-Optimizer/tree/main/examples/llm_ptq#multi-node-post-training-quantization-with-fsdp2>`_ for more details.
2727
- Add support for Nemotron Nano VL v1 & v2 models in FP8/NVFP4 PTQ workflow.
2828
- Add flags ``nodes_to_include`` and ``op_types_to_include`` in AutoCast to force-include nodes in low precision, even if they would otherwise be excluded by other rules.
29+
- Enabled native Modelopt quantization support for FP8 and NVFP4 formats in SGLang. See `SGLang quantization documentation <https://github.com/sgl-project/sglang/blob/main/docs/advanced_features/quantization.md#using-nvidia-modelopt>`_ for more details.
30+
- Added modelopt quantized checkpoints in vLLM/SGLang CI/CD pipelines (PRs are under review).
2931

3032
**Documentation**
3133

0 commit comments

Comments
 (0)