Skip to content

Commit 021953e

Browse files
authored
[quantization] Disable Conv1d for GPTQFPI/ GPTQ (#409)
This PR disables `conv1d` nodes for further investigation. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>
1 parent 9a18813 commit 021953e

File tree

2 files changed

+2
-6
lines changed

2 files changed

+2
-6
lines changed

tico/quantization/algorithm/fpi_gptq/quantizer.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -76,9 +76,7 @@ def convert(self, model):
7676
)
7777
):
7878
# 1) Identify quantizable submodules within the layer
79-
full = find_layers(
80-
layer, layers=[torch.nn.Linear, torch.nn.Conv2d, torch.nn.Conv1d]
81-
)
79+
full = find_layers(layer, layers=[torch.nn.Linear, torch.nn.Conv2d])
8280
# filter out depthwise convolutions and alike
8381
full = {
8482
key: full[key]

tico/quantization/algorithm/gptq/quantizer.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -193,9 +193,7 @@ def convert(self, model):
193193
)
194194
):
195195
# 1) Identify quantizable submodules within the layer
196-
full = find_layers(
197-
layer, layers=[torch.nn.Linear, torch.nn.Conv2d, torch.nn.Conv1d]
198-
)
196+
full = find_layers(layer, layers=[torch.nn.Linear, torch.nn.Conv2d])
199197
# filter out depthwise convolutions and alike
200198
full = {
201199
key: full[key]

0 commit comments

Comments
 (0)