[quantization] Suppport `convolutions` in `SensitivityCalibrator` by stamalakhov · Pull Request #581 · Samsung/TICO

stamalakhov · 2026-03-25T07:17:01Z

This PR:

adds support for convolutions with related tests
brings support for multiple inputs
adds the option to hide progress to SensitivityCalibrator.

./ccex test --include-internal -k quantization.algorithm.test_gptq.GPTQTest


RUN unit tests with -k quantization.algorithm.test_gptq.GPTQTest ...
test_groupwise_conv1d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_groupwise_conv2d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_model (quantization.algorithm.test_gptq.GPTQTest) ... <frozen importlib._bootstrap>:241: DeprecationWarning: builtin type SwigPyPacked has no __module__ attribute
<frozen importlib._bootstrap>:241: DeprecationWarning: builtin type SwigPyObject has no __module__ attribute
You are using the default legacy behaviour of the <class 'transformers.models.llama.tokenization_llama_fast.LlamaTokenizerFast'>. This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thoroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565 - if you loaded a llama tokenizer from a GGUF file you can ignore this message.
ok
test_net (quantization.algorithm.test_gptq.GPTQTest) ... No specialized wrapper found for ModuleList; applying recursive wrapping.
ok
test_net_on_zero_inputs (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv1d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv1d_with_logits (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv2d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv2d_on_zero_inputs (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv2d_with_logits (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv3d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv3d_on_zero_inputs (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_normconv3d_with_logits (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_paddednormconv2d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_paddednormconv3d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_transposed_conv2d (quantization.algorithm.test_gptq.GPTQTest) ... ok
test_transposed_conv2d_with_logits (quantization.algorithm.test_gptq.GPTQTest) ... ok

----------------------------------------------------------------------
Ran 17 tests in 61.068s

OK

Draft: #559
Related: #548

TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

stamalakhov · 2026-03-25T07:17:53Z

tico/quantization/algorithm/gptq/utils.py

+        if show_progress is True:
+            print("Computing calibration set")
+        for prompt in tqdm.tqdm(dataset, disable=not show_progress):
+            if isinstance(prompt, torch.Tensor):


Let's process multiple inputs as well.

stamalakhov · 2026-03-25T07:19:11Z

tico/quantization/algorithm/gptq/utils.py

+        self.calibrated_types = [
+            torch.nn.Linear,
+            torch.nn.Conv2d,
+            torch.nn.Conv1d,
+            torch.nn.Conv3d,
+            torch.nn.ConvTranspose2d,


Calibrate also convolutions.

stamalakhov · 2026-03-25T07:21:02Z

tico/quantization/algorithm/gptq/utils.py

-            inp_ids = inputs.view(-1, inputs.shape[-1])
-            logits = model(inp_ids.to(model.device)).logits
+            if isinstance(inputs, torch.Tensor):
+                inp_ids = inputs.squeeze(0)  # remove redundant batch dimension


remove redundant batch dimension, instead of reshaping input to 2D shape.

stamalakhov · 2026-03-25T07:21:46Z

tico/quantization/algorithm/gptq/utils.py

+                for item in inputs:
+                    inputs[item] = inputs[item].to(model.device).squeeze(0)
+
+                logits = model(**inputs).logits


The same as above, but for multiple inputs.

This PR: 1. adds support for convolutions with related tests 2. brings support for multiple inputs 3. adds the option to hide progress to SensitivityCalibrator. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

mhs4670go · 2026-03-25T09:37:38Z

tico/quantization/algorithm/gptq/utils.py

+            for type in self.calibrated_types:
+                if isinstance(module, type):
+                    modules_to_process[name] = module
+                    name_of_module[module] = name


This is not related with this PR but name_of_module uses nowhere.

Ahh. You're right! Thank you! I'll remove it.

mhs4670go

LGTM

stamalakhov self-assigned this Mar 25, 2026

stamalakhov commented Mar 25, 2026

View reviewed changes

stamalakhov force-pushed the sens_for_convs branch from 9e8e474 to c499149 Compare March 25, 2026 07:24

stamalakhov requested a review from a team March 25, 2026 07:29

mhs4670go reviewed Mar 25, 2026

View reviewed changes

mhs4670go approved these changes Mar 25, 2026

View reviewed changes

mhs4670go merged commit 976b8a9 into Samsung:main Mar 25, 2026
7 checks passed

stamalakhov deleted the sens_for_convs branch March 25, 2026 09:41

This was referenced Mar 25, 2026

[quantization] Tidy SensitivityCalibrator #582

Merged

[quantization] [draft] GPTQ for VLM #559

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] Suppport `convolutions` in `SensitivityCalibrator`#581

[quantization] Suppport `convolutions` in `SensitivityCalibrator`#581
mhs4670go merged 1 commit intoSamsung:mainfrom
stamalakhov:sens_for_convs

stamalakhov commented Mar 25, 2026

Uh oh!

stamalakhov Mar 25, 2026

Uh oh!

stamalakhov Mar 25, 2026

Uh oh!

stamalakhov Mar 25, 2026

Uh oh!

stamalakhov Mar 25, 2026

Uh oh!

mhs4670go Mar 25, 2026

Uh oh!

stamalakhov Mar 25, 2026

Uh oh!

mhs4670go left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stamalakhov commented Mar 25, 2026

Uh oh!

stamalakhov Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants