Regarding the Quantization Scheme for the Qwen-Next 80B Model W8A8

Hi there,

I’m planning to quantize the Qwen-Next 80B model to W8A8 using a combination of SmoothQuant and GPTQ. However, I noticed that SmoothQuant does not currently support this model.

In this situation, would it be acceptable to use the default mapping method for SmoothQuant? If not, are there any recommended alternatives or best practices for achieving stable and accurate W8A8 quantization on this model?

Thanks in advance for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding the Quantization Scheme for the Qwen-Next 80B Model W8A8 #2202

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Regarding the Quantization Scheme for the Qwen-Next 80B Model W8A8 #2202

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions