-
-
Notifications
You must be signed in to change notification settings - Fork 11.5k
[Build/CI][DP/EP] Add QWen/Qwen3-30B-A3B-FP8 + EPLB tests to Nightly H100 and B200 #29195
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
[Build/CI][DP/EP] Add QWen/Qwen3-30B-A3B-FP8 + EPLB tests to Nightly H100 and B200 #29195
Conversation
|
cc @robertgshaw2-redhat @mgoin @tlrmchlsmth . PTAL 🤚 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request enables EPLB for QWen/Qwen3-30B-A3B-FP8 in nightly tests on H100 and adds a new, similar test for B200. The changes are functionally correct and achieve the stated goal. I have one suggestion to improve the maintainability of the Buildkite pipeline configuration by reducing duplication between the H100 and B200 test jobs.
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
c4eca46 to
8a1d253
Compare
| num_gpus: 2 | ||
| working_dir: "/vllm-workspace" | ||
| commands: | ||
| - bash .buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep_eplb.sh 0.8 200 8020 2 1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dp-size = 2 and tp-size 1
| working_dir: "/vllm-workspace" | ||
| commands: | ||
| - bash .buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep.sh 0.8 200 8020 | ||
| - bash .buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep_eplb.sh 0.8 200 8020 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dp-size=2 tp-size=2
Purpose
Enable EPLB for
QWen/Qwen3-30B-A3B-FP8H100 Nightly tests. Add this test to B200 nightly as well.QWen/Qwen3-30B-A3B-FP8closely mirrors DeepSeek block FP8 models. This is useful so we can catch bugs like #29162 early.Test Plan
Run
.buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep_eplb.shscript locally on B200.Test Result
Test passes with fix #29162