-
-
Notifications
You must be signed in to change notification settings - Fork 9.9k
[V1][Mamba] - Enable V1 by default for Mamba Models #23650
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
tdoublep
merged 18 commits into
vllm-project:main
from
Josephasafg:default_mamba_v1_support
Aug 27, 2025
Merged
Changes from 14 commits
Commits
Show all changes
18 commits
Select commit
Hold shift + click to select a range
49bdc08
feat: Default Support for Mamba models in V1
Josephasafg 4418048
Merge branch 'main' into default_mamba_v1_support
Josephasafg cde0a25
revert: docs due to conflicts
Josephasafg bac0a47
Merge branch 'default_mamba_v1_support' of github.com:Josephasafg/vll…
Josephasafg 4088cb9
Merge branch 'main' into default_mamba_v1_support
Josephasafg 7cf56bb
Merge branch 'main' into default_mamba_v1_support
Josephasafg ae83562
Merge remote-tracking branch 'upstream/main' into default_mamba_v1_su…
Josephasafg 6e2b2cf
fix: Updated tests to use v0
Josephasafg 6e4de9b
fix: Lint
Josephasafg 05e1c00
fix: Removed with
Josephasafg fb1755b
Merge branch 'main' into default_mamba_v1_support
Josephasafg 18a6ae7
fix: test_batching to test only first items in ssm and hybrid
Josephasafg de89dda
feat: Added FalconMambaForCausalLM to config map
Josephasafg 37c61ba
fix: Removed enable_prefix_caching=False
Josephasafg 2a9db64
fix: test_batching list
Josephasafg db941f8
fix: Removed flashinfer env
Josephasafg 1b46348
fix: Tests
Josephasafg ef82b62
fix: Removed full_cuda_graph param
Josephasafg File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -417,4 +417,5 @@ def verify_and_update_config(cls, vllm_config: "VllmConfig") -> None: | |
"GptOssForCausalLM": GptOssForCausalLMConfig, | ||
"MambaForCausalLM": MambaModelConfig, | ||
"Mamba2ForCausalLM": MambaModelConfig, | ||
"FalconMambaForCausalLM": MambaModelConfig, | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @tdoublep fine for this PR but I think this line makes vLLM not that plugable to new models. |
||
} |
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.