Skip to content

Commit 45d9b43

Browse files
author
ssjia
committed
Update on "[ET-VK] Implemement linear_dq8ta_q4gsw"
Title says it all! Build upon the support for quantized linear introduced in the previous diffs to enable dynamically quantized linear. Also included in this diff is a cleanup of the glslh files used across quantized linear implementations. Differential Revision: [D81931060](https://our.internmc.facebook.com/intern/diff/D81931060/) [ghstack-poisoned]
1 parent 7e75e5f commit 45d9b43

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

backends/vulkan/patterns/quantized_linear.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -181,6 +181,9 @@ def is_input_dynamic_perchannel_quantized(self) -> bool:
181181
if self.quantize_input_node is None:
182182
return False
183183

184+
if not isinstance(self.input_scales_node, torch.fx.Node):
185+
return False
186+
184187
# For dynamic quantization, input scale node should be a getitem operator
185188
# retrieving the output of a choose_qparams op
186189
if self.input_scales_node.target != operator.getitem:

0 commit comments

Comments
 (0)