-
Notifications
You must be signed in to change notification settings - Fork 453
Open
Labels
enhancementNew feature or requestNew feature or requestqwenFor any PR / issue related to Qwen supportFor any PR / issue related to Qwen supportsmoothquantFor any issue / PR related to SmoothQuant supportFor any issue / PR related to SmoothQuant support
Description
Hi there,
I’m planning to quantize the Qwen-Next 80B model to W8A8 using a combination of SmoothQuant and GPTQ. However, I noticed that SmoothQuant does not currently support this model.
In this situation, would it be acceptable to use the default mapping method for SmoothQuant? If not, are there any recommended alternatives or best practices for achieving stable and accurate W8A8 quantization on this model?
Thanks in advance for your help!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestqwenFor any PR / issue related to Qwen supportFor any PR / issue related to Qwen supportsmoothquantFor any issue / PR related to SmoothQuant supportFor any issue / PR related to SmoothQuant support