Skip to content

Commit f1c5093

Browse files
committed
up
1 parent 53240c6 commit f1c5093

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

src/diffusers/quantizers/torchao/torchao_quantizer.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -295,6 +295,8 @@ def get_cuda_warm_up_factor(self):
295295
- Use a division factor of 4 for int8 weights
296296
"""
297297
# Original mapping for non-AOBaseConfig types
298+
# For the uint types, this is a best guess. Once these types become more used
299+
# we can look into their nuances.
298300
map_to_target_dtype = {"int4_*": 8, "int8_*": 4, "uint*": 8, "float8*": 4}
299301
quant_type = self.quantization_config.quant_type
300302
for pattern, target_dtype in map_to_target_dtype.items():

0 commit comments

Comments
 (0)