-
Notifications
You must be signed in to change notification settings - Fork 386
Closed
Description
Release Checklist
Release Version: v0.9.1rc3
Release Branch: v0.9.1-dev
Release Date: 2025/08/20
Release Manager: @shen-shanshan
Prepare Release Note
- Create a new issue for release feedback [v0.9.1rc3] FAQ / Feedback | 问题/反馈 #2410
- Write the release note PR [0.9.1][Doc] Add release note for
v0.9.1rc3
#2431- Update the feedback issue link in docs/source/faqs.md
- Add release note to docs/source/user_guide/release_notes.md
- Update version info in docs/source/community/versioning_policy.md
- Update contributor info in docs/source/community/contributors.md
- Update package version in docs/conf.py
PR need Merge
- [Bugfix] Fix
grammar_bitmask
IndexError caused by outdatedapply_grammar_bitmask
method #2314 - [Bugfix] Fix the bug that qwen3 moe doesn't work with aclgraph #2478
Functional Test
- DeepSeek W8A8 MTP with V1 Scheduler (A3 DP4 TP4) @Potabk
- In DeepSeek-R1 W8A8 PD disagregated Decode instance, using pure DP, with
lmhead_tensor_parallel_size=8
@zhangxinyuehfad - DeepSeek with V1 scheduler (with chunked prefill enabled) @MengqingCao
--additional_config={"lmhead_tensor_parallel_size": 8}
Doc Test
- Tutorial is updated.
- User Guide is updated.
- Developer Guide is updated.
Prepare Artifacts
- Docker image is ready.
- Wheel package is ready.
Release Step
- Release note PR is merged.
- Post the release on GitHub release page.
- Generate official doc page on https://app.readthedocs.org/dashboard/
- Wait for the wheel package to be available on https://pypi.org/project/vllm-ascend
- Wait for the docker image to be available on https://quay.io/ascend/vllm-ascend
- Upload 310p wheel to Github release page
- Broadcast the release news (By message, blog , etc)
- Close this issue
Metadata
Metadata
Assignees
Labels
No labels