[core] MLA performance boost for AMD GPUs and tuned MoE config for MI…#13439
[core] MLA performance boost for AMD GPUs and tuned MoE config for MI…#13439qli88 wants to merge 3 commits intovllm-project:mainfrom
Conversation
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
7e481ee to
7a76f70
Compare
mgoin
left a comment
There was a problem hiding this comment.
LGTM with a small fix needed, cc @LucasWilkinson
70a9795 to
4f2422a
Compare
…300X Signed-off-by: qli88 <qiang.li2@amd.com>
Signed-off-by: qli88 <qiang.li2@amd.com>
Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Signed-off-by: qli88 <qiang.li2@amd.com>
c131305 to
148e877
Compare
|
cc @houseroad |
| BLOCK = 64 | ||
| if is_hip_: | ||
| BLOCK = 8 | ||
|
|
There was a problem hiding this comment.
| BLOCK = 64 | |
| if is_hip_: | |
| BLOCK = 8 | |
| BLOCK = 64 if not is_hip_ else 8 |
|
why do we close the PR? |
@houseroad I created a new PR to adapt the commit landed yesterday (#12639). Please take a look at that one (#13718) |
Uh oh!
There was an error while loading. Please reload this page.