[quantization] Introduce wrapper for Qwen3VLTextDecoderLayer by dvsav · Pull Request #566 · Samsung/TICO

dvsav · 2026-03-19T13:06:06Z

This change introduces QuantQwen3VLTextDecoderLayer wrapper to support post-training quantization of Qwen3VLTextDecoderLayer module.

Why?

Qwen3VLTextDecoderLayer is an essential part of Qwen model.
Trying to quantize Qwen3VLTextDecoderLayer via PTQ generates exception PTQQuantizer: no quantization wrapper for Qwen3VLTextDecoderLayer.

What

This change introduces:

Class QuantQwen3VLTextDecoderLayer (tico/quantization/wrapq/wrappers/qwen_vl/quant_text_decoder_layer.py).
Unit tests: class TestQuantQwen3VLTextDecoderLayer (test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py) - skipped if transformers package is not installed.
New entry in _CORE_MODULES (tico/quantization/wrapq/wrappers/registry.py).
Example of Qwen3VLTextDecoderLayer quantization and conversion to Circle (tico/quantization/wrapq/examples/qwen/quantize_text_decoder_layer.py).

Unit Tests

Unit tests results with coverage information:

$ coverage run -m pytest test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py -v
================================================================== test session starts ===================================================================
platform linux -- Python 3.10.12, pytest-8.4.0, pluggy-1.6.0 -- /home/d.savchenkov/myenv/bin/python3
cachedir: .pytest_cache
rootdir: /home/d.savchenkov/TICO
configfile: pyproject.toml
plugins: anyio-4.12.0, mock-3.15.1, xdist-3.7.0, cov-6.2.1
collected 8 items

test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_different_batch_sizes            PASSED [ 12%]
test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_forward_diff                     PASSED [ 25%]
test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_mode_transitions                 PASSED [ 37%]
test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_observer_count                   PASSED [ 50%]
test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_output_shape                     PASSED [ 62%]
test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_per_module_override              PASSED [ 75%]
test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_registration_in_registry         PASSED [ 87%]
test/quantization/wrapq/wrappers/qwen_vl/test_quant_text_decoder_layer.py::TestQuantQwen3VLTextDecoderLayer::test_residual_connection_preservation PASSED [100%]

======================================================== 8 passed, 2 warnings in 80.06s (0:01:20) ========================================================

Coverage info (irrelevant files skipped):

$ coverage report -m

Name                                                                    Stmts   Miss  Cover   Missing
-----------------------------------------------------------------------------------------------------
...
tico/quantization/wrapq/wrappers/qwen_vl/quant_text_decoder_layer.py       42      0   100%
...
-----------------------------------------------------------------------------------------------------
TOTAL                                                                   11272   7196    36%

Script for testing quantization and conversion to Circle

$ python tico/quantization/wrapq/examples/qwen/quantize_text_decoder_layer.py
┌───────────── Quantization Error Summary ─────────────
│ Mean |diff|: 0.018236
│ PEIR       : 1.069503 %
└──────────────────────────────────────────────────────
    ┌────────────────────────────────────────────┐
 4.5┤                                            │
    │                                         •  │
    │                                      •••   │
 3.0┤                                    •••     │
    │                                  •••       │
    │                                •••         │
    │                             ••••           │
 1.5┤                           ••••             │
    │                         ••••               │
    │                       ••••                 │
 0.0┤                     ••••                   │
    │                   ••••                     │
    │                 ••••                       │
    │               ••••                         │
-1.5┤             ••••                           │
    │           ••••                             │
    │         •••                                │
-3.0┤       •••                                  │
    │     •••                                    │
    │   •••                                      │
    │  •                                         │
-4.5┤                                            │
    └┬──────────┬──────────┬─────────┬──────────┬┘
   -4.5       -2.3        0.0       2.3       4.5 

Circle model saved as 'qwen3vl_text_decoder_layer.q.circle'

mhs4670go · 2026-03-23T14:10:11Z

@dvsav Seems CI test failed.

This change introduces QuantQwen3VLTextDecoderLayer wrapper to support post-training quantization of Qwen3VLTextDecoderLayer operation. TICO-DCO-1.0-Signed-off-by: d.savchenkov <d.savchenkov@partner.samsung.com>

dvsav · 2026-03-24T15:30:36Z

@dvsav Seems CI test failed.

Hi @mhs4670go,
Yes, indeed. I've fixed it now.

mhs4670go · 2026-03-24T23:07:28Z

tico/quantization/wrapq/examples/qwen/quantize_text_decoder_layer.py

+    # Convert to quantized version
+    quantized_model = tico.quantization.convert(prepared_model, inplace=True)
+
+    # Compute PEIR (Peak Error-to-Input Ratio) between quantized model and original model


Peak Error to Interval ratio. Seems other codes have mistakes. Let's fix them at once in another PR.

mhs4670go

LGTM

dvsav force-pushed the quant_text_decoder_layer branch from 613c233 to be903b6 Compare March 19, 2026 13:10

dvsav mentioned this pull request Mar 19, 2026

Qwen3-VL: Implement quantization wrappers #483

Open

dvsav force-pushed the quant_text_decoder_layer branch from be903b6 to cae5c77 Compare March 19, 2026 13:22

dvsav marked this pull request as ready for review March 19, 2026 13:32

dvsav force-pushed the quant_text_decoder_layer branch 2 times, most recently from 9795d38 to be93970 Compare March 23, 2026 06:53

[quantization] Introduce wrapper for Qwen3VLTextDecoderLayer

8da4c37

This change introduces QuantQwen3VLTextDecoderLayer wrapper to support post-training quantization of Qwen3VLTextDecoderLayer operation. TICO-DCO-1.0-Signed-off-by: d.savchenkov <d.savchenkov@partner.samsung.com>

dvsav force-pushed the quant_text_decoder_layer branch from be93970 to 8da4c37 Compare March 24, 2026 08:42

dvsav mentioned this pull request Mar 24, 2026

[quantization] Introduce wrappers for Qwen3VLTextDecoderLayer and Qwen3VLTextModel #535

Draft

mhs4670go reviewed Mar 24, 2026

View reviewed changes

mhs4670go approved these changes Mar 24, 2026

View reviewed changes

mhs4670go merged commit 8a8e7d8 into Samsung:main Mar 24, 2026
7 checks passed

dvsav deleted the quant_text_decoder_layer branch March 25, 2026 06:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] Introduce wrapper for Qwen3VLTextDecoderLayer#566

[quantization] Introduce wrapper for Qwen3VLTextDecoderLayer#566
mhs4670go merged 1 commit intoSamsung:mainfrom
dvsav:quant_text_decoder_layer

dvsav commented Mar 19, 2026

Uh oh!

mhs4670go commented Mar 23, 2026

Uh oh!

dvsav commented Mar 24, 2026

Uh oh!

mhs4670go Mar 24, 2026

Uh oh!

mhs4670go left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dvsav commented Mar 19, 2026

Why?

What

Unit Tests

Script for testing quantization and conversion to Circle

Uh oh!

mhs4670go commented Mar 23, 2026

Uh oh!

dvsav commented Mar 24, 2026

Uh oh!

mhs4670go Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants