Skip to content

Conversation

@varun-sundar-rabindranath
Copy link
Contributor

@varun-sundar-rabindranath varun-sundar-rabindranath commented Nov 21, 2025

Purpose

Enable EPLB for QWen/Qwen3-30B-A3B-FP8 H100 Nightly tests. Add this test to B200 nightly as well.
QWen/Qwen3-30B-A3B-FP8 closely mirrors DeepSeek block FP8 models. This is useful so we can catch bugs like #29162 early.

Test Plan

Run .buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep_eplb.sh script locally on B200.

Test Result

Test passes with fix #29162

@varun-sundar-rabindranath
Copy link
Contributor Author

cc @robertgshaw2-redhat @mgoin @tlrmchlsmth . PTAL 🤚

@mergify mergify bot added ci/build qwen Related to Qwen models labels Nov 21, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request enables EPLB for QWen/Qwen3-30B-A3B-FP8 in nightly tests on H100 and adds a new, similar test for B200. The changes are functionally correct and achieve the stated goal. I have one suggestion to improve the maintainability of the Buildkite pipeline configuration by reducing duplication between the H100 and B200 test jobs.

Varun Sundar Rabindranath added 3 commits November 21, 2025 19:32
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
num_gpus: 2
working_dir: "/vllm-workspace"
commands:
- bash .buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep_eplb.sh 0.8 200 8020 2 1
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

dp-size = 2 and tp-size 1

working_dir: "/vllm-workspace"
commands:
- bash .buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep.sh 0.8 200 8020
- bash .buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep_eplb.sh 0.8 200 8020
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

dp-size=2 tp-size=2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build qwen Related to Qwen models

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants