[Feature Request][Help Wanted] Convert AutoAWQ checkpoints to compressed-tensors

**Is your feature request related to a problem? Please describe.**
- Add a conversion tool in compressed-tensors which when given an AutoAWQ checkpoint, can convert the model to the compressed-tensors format using the [pack_quantized](https://github.com/vllm-project/compressed-tensors/blob/main/src/compressed_tensors/compressors/quantized_compressors/pack_quantized.py) compressor
- The checkpoint should also contain an updated `quantization_config` in its config.json with metadata about the format. See example: https://huggingface.co/nm-testing/Qwen3-Coder-30B-A3B-Instruct-W4A16-awq/blob/main/config.json
- This can be done by representing the quantization params for the AutoAWQ model using [QuantizationArgs](https://github.com/vllm-project/compressed-tensors/blob/e88e7d4ad6947628c36ac422a7518f86cc5f1bb8/src/compressed_tensors/quantization/quant_args.py#L142) and then applying the [ModelCompressor](https://github.com/vllm-project/compressed-tensors/blob/e88e7d4ad6947628c36ac422a7518f86cc5f1bb8/src/compressed_tensors/compressors/model_compressors/model_compressor.py#L87) to the model

**Describe the solution you'd like**
- A tool which when given an AutoAWQ checkpoint, produced a compressed-tensors formatted model
- The model should be able to run in vLLM without any drop in accuracy


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request][Help Wanted] Convert AutoAWQ checkpoints to compressed-tensors #1962

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request][Help Wanted] Convert AutoAWQ checkpoints to compressed-tensors #1962

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions