Q1 2026 Roadmap

# Overview

The near-term focus for LLM Compressor will centre on performance improvements across core workflows, targeted enhancements to NVFP4, stabilizing and hardening MXFP4 support, and broad improvements to modifier functionality. These efforts aim to improve efficiency, robustness, and overall usability while enabling more reliable and scalable model compression workflows.

In addition, we will continue to expand quantization support for the latest model releases to ensure timely compatibility with newly introduced architectures and checkpoints, including adopting transformers v5.0

We will also be focusing on improving the quality of our documentation, examples, CI/CD for easier access and understanding.

# Q1 Roadmap

## Performance Refactor - Enable Distributed Quantization Support
### Status: In Progress

RFC:  https://github.com/vllm-project/llm-compressor/issues/2180

Issues:
- [x] https://github.com/vllm-project/llm-compressor/issues/2180?issue=vllm-project%7Cllm-compressor%7C2215
- [x] https://github.com/vllm-project/llm-compressor/issues/2180?issue=vllm-project%7Cllm-compressor%7C2216
   - https://github.com/vllm-project/compressed-tensors/pull/572
- [x] https://github.com/vllm-project/llm-compressor/issues/2180?issue=vllm-project%7Cllm-compressor%7C2217

Enable Modifier Specific Support:
  - [x] https://github.com/vllm-project/llm-compressor/issues/2180?issue=vllm-project%7Cllm-compressor%7C2218
  - [x] https://github.com/vllm-project/llm-compressor/issues/2180?issue=vllm-project%7Cllm-compressor%7C2219
  - [ ] https://github.com/vllm-project/llm-compressor/issues/2180?issue=vllm-project%7Cllm-compressor%7C2220
  - [x] https://github.com/vllm-project/llm-compressor/pull/2411

## MXFP4 vLLM Integration / Validation
### Status: In Progress
- [x] MXFP4A16 Support : 
   - https://github.com/vllm-project/llm-compressor/pull/2227
   - https://github.com/vllm-project/vllm/pull/32285
   - https://github.com/vllm-project/vllm/pull/31926
- [ ] MXFP4 Support - Activation Quantization Validation (move examples out of the experimental folder)

## MXFP8 Support
### Status: In Progress
- [x] https://github.com/vllm-project/compressed-tensors/pull/630
- [x] https://github.com/vllm-project/llm-compressor/pull/2487

## AWQ, GPTQ Improvements and Benchmarking 
### Status: In Progress
- [x] https://github.com/vllm-project/llm-compressor/pull/2265
- [ ] https://github.com/vllm-project/llm-compressor/pull/2206
- [x] https://github.com/vllm-project/llm-compressor/pull/2250
- [x] https://github.com/vllm-project/llm-compressor/pull/2294
- [x] https://github.com/vllm-project/compressed-tensors/pull/551
- [x] https://github.com/vllm-project/llm-compressor/pull/2304

## NVFP4 Improvements 
### Status: Not Yet Started
- [ ] vLLM Support for NVFP4 + micro-rotations: https://github.com/vllm-project/llm-compressor/issues/2006
- [ ] General benchmarking with AutoRound, QuantizationModifier, and AWQ

## Transformers v5 Support
### Status: Not Yet Started
- [ ] Support for updated MoE Calibration: https://github.com/vllm-project/llm-compressor/issues/2036

## Quantized Model Support
### Status: In Progress
- [x] FP8 Block + NVFp4 DSR1 Support
- [x] GLM Support: https://github.com/vllm-project/llm-compressor/pull/2170
- [ ] MinMax-M2 Support: https://github.com/vllm-project/llm-compressor/pull/2171
- [x] Qwen3.5 Support: 
      - Calibration Support: https://github.com/vllm-project/llm-compressor/pull/2482
      - VL Examples: https://github.com/vllm-project/llm-compressor/pull/2467

## CI/CD Buildkite Migration
### Status: In Progress
- [ ] Migrate CI/CD to Buildkite 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Q1 2026 Roadmap #2262

Overview

Q1 Roadmap

Performance Refactor - Enable Distributed Quantization Support

Status: In Progress

MXFP4 vLLM Integration / Validation

Status: In Progress

MXFP8 Support

Status: In Progress

AWQ, GPTQ Improvements and Benchmarking

Status: In Progress

NVFP4 Improvements

Status: Not Yet Started

Transformers v5 Support

Status: Not Yet Started

Quantized Model Support

Status: In Progress

CI/CD Buildkite Migration

Status: In Progress

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Q1 2026 Roadmap #2262

Description

Overview

Q1 Roadmap

Performance Refactor - Enable Distributed Quantization Support

Status: In Progress

MXFP4 vLLM Integration / Validation

Status: In Progress

MXFP8 Support

Status: In Progress

AWQ, GPTQ Improvements and Benchmarking

Status: In Progress

NVFP4 Improvements

Status: Not Yet Started

Transformers v5 Support

Status: Not Yet Started

Quantized Model Support

Status: In Progress

CI/CD Buildkite Migration

Status: In Progress

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions