vllm-project / compressed-tensors Public

Notifications You must be signed in to change notification settings
Fork 33
Star 171

Code
Issues 5
Pull requests 16
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Pull requests: vllm-project/compressed-tensors

Labels 10 Milestones 0

New pull request New

16 Open 447 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Switch test runners to use the vllm runners

#496 opened Oct 15, 2025 by dhuangnm

Loading…

Update neuralmagic --> vllm-project for links

#495 opened Oct 15, 2025 by mgoin

Loading…

[Attention] Support FP4 attention quantization

#491 opened Oct 14, 2025 by kylesayrs

Loading…

Tensor Group Validation

#490 opened Oct 14, 2025 by kylesayrs

Loading…

[Observer Refactor] Use static defaults

#489 opened Oct 13, 2025 by kylesayrs • Draft

[Attention] R3 Attention Transform

#485 opened Oct 8, 2025 by kylesayrs

Loading…

[Compression] Remove legacy compression and decompression pathways

#465 opened Sep 11, 2025 by kylesayrs • Draft

feat: support zero-point decompression for asymmetric quantization (packed)

#463 opened Sep 11, 2025 by Etelis

Loading…

[Quantization] Support quant arg dtypes

#456 opened Sep 9, 2025 by dsikka • Draft

[FP4] Update to make compression handling more generic for fp4

#448 opened Sep 8, 2025 by dsikka • Draft

[MXFP4] Add calibration support

#440 opened Aug 28, 2025 by dsikka • Draft

[MXFP4] Support MXFp4 Format

#439 opened Aug 28, 2025 by dsikka • Draft

[Transform] Attention/Cache transforms

#436 opened Aug 26, 2025 by kylesayrs

Loading…

[KV Cache] support kv cache int8 per channel quant

#398 opened Jul 19, 2025 by Eviannn

Loading…

Optimize sparse 2:4 compression performance

#358 opened Jun 16, 2025 by rahul-tuli • Draft

8 tasks done

relax setuptools_scm version requirement

#343 opened Jun 6, 2025 by envolution

Loading…

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!