[quantization] Introduce wrapper for Qwen3VLModel by dvsav · Pull Request #555 · Samsung/TICO

dvsav · 2026-03-16T15:06:51Z

This change introduces QuantQwen3VLModel wrapper to support post-training quantization of Qwen3VLModel module.

Why?

Qwen3VLModel is an essential part of Qwen model.
Trying to quantize Qwen3VLModel via PTQ generates exception PTQQuantizer: no quantization wrapper for Qwen3VLModel.

What

This change introduces:

Class QuantQwen3VLModel (tico/quantization/wrapq/wrappers/qwen_vl/quant_model.py).
Unit tests: class TestQuantQwen3VLModel (test/quantization/wrapq/wrappers/qwen_vl/test_quant_model.py) - skipped if transformers package is not installed.
New entry in _CORE_MODULES (tico/quantization/wrapq/wrappers/registry.py).
Example of Qwen3VLModel quantization and conversion to Circle (tico/quantization/wrapq/examples/qwen/quantize_qwen_model.py).

Unit Tests

Unit tests results with coverage information:

$ coverage run -m pytest test/quantization/wrapq/wrappers/qwen_vl/test_quant_model.py -v

Coverage info (irrelevant files skipped):

$ coverage report -m
Name                                                                   Stmts   Miss  Cover   Missing
----------------------------------------------------------------------------------------------------
...
...
----------------------------------------------------------------------------------------------------
TOTAL                                                                  10170   6520    36%

Script for testing quantization and conversion to Circle

$ python3 tico/quantization/wrapq/examples/qwen/quantize_qwen_model.py

This change introduces QuantQwen3VLModel wrapper to support post-training quantization of Qwen3VLModel operation. TICO-DCO-1.0-Signed-off-by: d.savchenkov <d.savchenkov@partner.samsung.com>

dvsav mentioned this pull request Mar 16, 2026

Qwen3-VL: Implement quantization wrappers #483

Open

dvsav force-pushed the quant_model branch 7 times, most recently from 0b92491 to cec58ba Compare March 17, 2026 11:31

dvsav force-pushed the quant_model branch 3 times, most recently from 9f6ecb3 to a86be80 Compare March 27, 2026 12:51

[quantization] Introduce wrapper for Qwen3VLModel

8b88c9b

This change introduces QuantQwen3VLModel wrapper to support post-training quantization of Qwen3VLModel operation. TICO-DCO-1.0-Signed-off-by: d.savchenkov <d.savchenkov@partner.samsung.com>

dvsav force-pushed the quant_model branch from a86be80 to 8b88c9b Compare March 27, 2026 14:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] Introduce wrapper for Qwen3VLModel#555

[quantization] Introduce wrapper for Qwen3VLModel#555
dvsav wants to merge 1 commit intoSamsung:mainfrom
dvsav:quant_model

dvsav commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dvsav commented Mar 16, 2026

Why?

What

Unit Tests

Script for testing quantization and conversion to Circle

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant