Skip to content

Commit 6dc8218

Browse files
committed
resolve conflict
Signed-off-by: Zhiyu Cheng <[email protected]>
1 parent 2290533 commit 6dc8218

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

CHANGELOG.rst

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,8 @@ Model Optimizer Changelog (Linux)
3131
- Add support for Nemotron Nano VL v1 & v2 models in FP8/NVFP4 PTQ workflow.
3232
- Add flags ``nodes_to_include`` and ``op_types_to_include`` in AutoCast to force-include nodes in low precision, even if they would otherwise be excluded by other rules.
3333
- Add support for ``torch.compile`` and benchmarking in ``examples/diffusers/quantization/diffusion_trt.py``.
34+
- Enabled native Modelopt quantization support for FP8 and NVFP4 formats in SGLang. See `SGLang quantization documentation <https://github.com/sgl-project/sglang/blob/main/docs/advanced_features/quantization.md#using-nvidia-modelopt>`_ for more details.
35+
- Added modelopt quantized checkpoints in vLLM/SGLang CI/CD pipelines (PRs are under review).
3436

3537
**Documentation**
3638

0 commit comments

Comments
 (0)