[quantization] [draft] GPTQ for VLM by stamalakhov · Pull Request #559 · Samsung/TICO

stamalakhov · 2026-03-17T11:47:03Z

This PR is the first try out for full quantization of VLM model by GPTQ+PTQ.

TODO:

m.b. make it less resource intensive (right now it makes inference for the whole model, not in layerwise fashion)
support PTQ quantization
synchronize GPTQ/PTQ Conv3d quantization
support convert to circle

model	orig_accuracy_vqav2	minmax_quantize_accuracy_vqav2	GPTQ_mse_accuracy_vqav2	GPTQ_smse_accuracy_vqav2
Qwen2_2B	0.8900	0.8260	0.8450	0.8740
Qwen3_2B	0.8570	0.7970	0.8470	0.8390
Qwen3_4B	0.8950	0.8450	0.8910	0.8820

some_details

all models above were quantized using GPTQ+mse/GPTQ+smse:

weights of torch.nn.Linear, torch.nn.Conv2D, torch.nn.Conv1D, nn.ConvTranspose2d to 4 bits,
activations were left at float32
accuracy was computed on the first 1000 samples on vqav2

for 256:

model	vqav2_on_1000_samples
Qwen2_2B_original	0.8900
Qwen2_2B_GPTQ_mse_256_qsamples	0.8630
Qwen2_2B_GPTQ_smse_256_qsamples	0.8780
Qwen3_2B_original	0.8570
Qwen3_2B_GPTQ_mse_256_qsamples	0.8430
Qwen3_2B_GPTQ_smse_256_qsamples	0.8520

Support Conv3d ([quantization] Support torch.nn.Conv3d in GPTQ #577)
Support all Convs in SensitivityCalibrator ([quantization] Suppport convolutions in SensitivityCalibrator #581)
Add example script ([quantization] Fallback for complex models #583)

Related: #548

TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

tico/quantization/algorithm/gptq/gptq.py

This PR is the first try-out for full quantization of VLM model by GPTQ+PTQ. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov · 2026-03-26T12:15:19Z

I believe we can close this draft. Everything was merged.

stamalakhov self-assigned this Mar 17, 2026

stamalakhov force-pushed the gptq_forVLM branch 7 times, most recently from 156b2e2 to 44de20c Compare March 20, 2026 08:45

stamalakhov mentioned this pull request Mar 20, 2026

[quantization] Support valid padding #568

Merged

mhs4670go reviewed Mar 23, 2026

View reviewed changes

tico/quantization/algorithm/gptq/gptq.py Outdated Show resolved Hide resolved

stamalakhov force-pushed the gptq_forVLM branch 2 times, most recently from 201366c to d373df3 Compare March 23, 2026 07:01

stamalakhov mentioned this pull request Mar 23, 2026

[quantization] Fix for multiple gpu #569

Merged

stamalakhov force-pushed the gptq_forVLM branch from d373df3 to b85bd43 Compare March 24, 2026 08:54

stamalakhov mentioned this pull request Mar 24, 2026

[quantization] Support torch.nn.Conv3d in GPTQ #577

Merged

stamalakhov force-pushed the gptq_forVLM branch 2 times, most recently from b474690 to 142d97d Compare March 25, 2026 07:00

stamalakhov mentioned this pull request Mar 25, 2026

[quantization] Suppport convolutions in SensitivityCalibrator #581

Merged

[quantization] [draft] GPTQ for VLM

f142bd7

This PR is the first try-out for full quantization of VLM model by GPTQ+PTQ. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov force-pushed the gptq_forVLM branch from 142d97d to f142bd7 Compare March 25, 2026 09:57

stamalakhov mentioned this pull request Mar 25, 2026

[quantization] Fallback for complex models #583

Merged

stamalakhov closed this Mar 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] [draft] GPTQ for VLM#559

[quantization] [draft] GPTQ for VLM#559
stamalakhov wants to merge 1 commit intoSamsung:mainfrom
stamalakhov:gptq_forVLM

stamalakhov commented Mar 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

stamalakhov commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stamalakhov commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

stamalakhov commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stamalakhov commented Mar 17, 2026 •

edited

Loading