Skip to content

Commit ec7aa63

Browse files
authored
nvidia awq only use QuantFormat.QDQ quant format (#22429)
nvidia awq only use QuantFormat.QDQ quant format
1 parent 6e5e320 commit ec7aa63

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

onnxruntime/python/tools/quantization/matmul_4bits_quantizer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -280,7 +280,7 @@ def __init__(
280280

281281
super().__init__(
282282
algorithm="nvidia_awq",
283-
quant_format=quant_format,
283+
quant_format=QuantFormat.QDQ,
284284
op_types_to_quantize=None, # Assuming op_types_to_quantize is handled elsewhere
285285
quant_axes=None, # Assuming quant_axes is handled elsewhere
286286
)

0 commit comments

Comments
 (0)