Skip to content

Turn off 2:4 sparse compression until supported in vllm#1092

Merged
dsikka merged 1 commit intomainfrom
update-24-example
Jan 23, 2025
Merged

Turn off 2:4 sparse compression until supported in vllm#1092
dsikka merged 1 commit intomainfrom
update-24-example

Conversation

@rahul-tuli
Copy link
Copy Markdown
Collaborator

@rahul-tuli rahul-tuli commented Jan 23, 2025

This PR temporarily disables the newly added Sparse24 compression feature in example script, as support for this feature is not yet available in vLLM.

Support for Sparse24 compression is being added in vLLM via this PR. Once that PR is merged, this change will be reverted to re-enable the feature.

@github-actions
Copy link
Copy Markdown

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: Rahul Tuli <rahul@neuralmagic.com>
@rahul-tuli rahul-tuli marked this pull request as ready for review January 23, 2025 18:54
@rahul-tuli rahul-tuli self-assigned this Jan 23, 2025
@dsikka dsikka added the ready When a PR is ready for review label Jan 23, 2025
@dsikka dsikka merged commit 7610854 into main Jan 23, 2025
@dsikka dsikka deleted the update-24-example branch January 23, 2025 20:48
rahul-tuli added a commit that referenced this pull request Jan 28, 2025
This PR temporarily disables the newly added Sparse24 compression
feature in example script, as support for this feature is not yet
available in vLLM.

Support for Sparse24 compression is being added in vLLM via [this
PR](vllm-project/vllm#12097). Once that PR is
merged, this change will be reverted to re-enable the feature.

Signed-off-by: Rahul Tuli <rahul@neuralmagic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready When a PR is ready for review

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants