Skip to content

Commit a43e2f6

Browse files
authored
[CI] Update vLLM to v0.11.0 (#3315)
### What this PR does / why we need it? There are 3 step to upgrade vllm-ascend to newest vllm. We'll create 3 PR - [x] Upgrade vllm to v0.11.0 to make CI happy first . - [ ] Move deepseek v3.2 to vllm way - [ ] Then we'll add a new PR to add vllm main support. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.11.0 Signed-off-by: wangxiyuan <[email protected]>
1 parent f12f76d commit a43e2f6

15 files changed

+20
-17
lines changed

.github/workflows/accuracy_test.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -112,7 +112,7 @@ jobs:
112112
uses: actions/checkout@v4
113113
with:
114114
repository: vllm-project/vllm
115-
ref: v0.11.0rc3
115+
ref: v0.11.0
116116
path: ./vllm-empty
117117

118118
- name: Install vllm-project/vllm from source

.github/workflows/format_pr_body.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ jobs:
3636

3737
- name: Get vLLM version
3838
run: |
39-
VLLM_COMMIT=releases/v0.11.0
39+
VLLM_COMMIT=v0.11.0
4040
echo "VLLM_COMMIT=https://github.com/vllm-project/vllm/commit/$VLLM_COMMIT" >> $GITHUB_ENV
4141
4242
- name: Checkout repository

.github/workflows/nightly_benchmarks.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ jobs:
5151
strategy:
5252
matrix:
5353
include:
54-
- vllm_branch: v0.11.0rc3
54+
- vllm_branch: v0.11.0
5555
vllm_ascend_branch: main
5656
vllm_use_v1: 1
5757
max-parallel: 1

.github/workflows/vllm_ascend_dist.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ jobs:
4343
strategy:
4444
matrix:
4545
os: [linux-aarch64-a3-8]
46-
vllm_version: [v0.11.0rc3]
46+
vllm_version: [v0.11.0]
4747
name: vLLM Ascend test
4848
runs-on: ${{ matrix.os }}
4949
container:

.github/workflows/vllm_ascend_test.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@ jobs:
4242
lint:
4343
uses: ./.github/workflows/pre-commit.yml
4444
with:
45-
vllm: releases/v0.11.0
45+
vllm: v0.11.0
4646

4747
changes:
4848
runs-on: ubuntu-latest
@@ -83,7 +83,7 @@ jobs:
8383
VLLM_USE_MODELSCOPE: True
8484
strategy:
8585
matrix:
86-
vllm_version: [releases/v0.11.0, v0.11.0rc3]
86+
vllm_version: [v0.11.0]
8787
steps:
8888
- name: Install packages
8989
run: |
@@ -145,7 +145,7 @@ jobs:
145145
name: e2e-light
146146
strategy:
147147
matrix:
148-
vllm_version: [releases/v0.11.0, v0.11.0rc3]
148+
vllm_version: [v0.11.0]
149149
# Note (yikun): If CI resource are limited we can split job into two chain jobs
150150
needs: [lint, changes]
151151
# only trigger e2e test after lint passed and the change is e2e related with pull request.

.github/workflows/vllm_ascend_test_310p.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ jobs:
5353
max-parallel: 2
5454
matrix:
5555
os: [linux-aarch64-310p-1, linux-aarch64-310p-4]
56-
vllm_version: [v0.11.0rc3]
56+
vllm_version: [v0.11.0]
5757
name: 310p e2e test
5858
runs-on: ${{ matrix.os }}
5959
container:

.github/workflows/vllm_ascend_test_full.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,7 @@ jobs:
6868
name: e2e-full
6969
strategy:
7070
matrix:
71-
vllm_version: [releases/v0.11.0, v0.11.0rc3]
71+
vllm_version: [v0.11.0]
7272
needs: [changes]
7373
if: ${{ needs.changes.outputs.e2e_tracker == 'true' }}
7474
uses: ./.github/workflows/_e2e_test.yaml

.github/workflows/vllm_ascend_test_full_vllm_0.11.0.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ jobs:
4545
e2e-test:
4646
uses: ./.github/workflows/_e2e_test.yaml
4747
with:
48-
vllm: releases/v0.11.0
48+
vllm: v0.11.0
4949
runner: linux-aarch64-a2
5050
image: swr.cn-southwest-2.myhuaweicloud.com/base_image/ascend-ci/cann:8.2.rc1-910b-ubuntu22.04-py3.11
5151
type: full

Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ RUN pip config set global.index-url ${PIP_INDEX_URL}
3737

3838
# Install vLLM
3939
ARG VLLM_REPO=https://github.com/vllm-project/vllm.git
40-
ARG VLLM_TAG=v0.11.0rc3
40+
ARG VLLM_TAG=v0.11.0
4141
RUN git clone --depth 1 $VLLM_REPO --branch $VLLM_TAG /vllm-workspace/vllm
4242
# In x86, triton will be installed by vllm. But in Ascend, triton doesn't work correctly. we need to uninstall it.
4343
RUN VLLM_TARGET_DEVICE="empty" python3 -m pip install -v -e /vllm-workspace/vllm/ --extra-index https://download.pytorch.org/whl/cpu/ && \

Dockerfile.310p

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ RUN pip config set global.index-url ${PIP_INDEX_URL}
3737

3838
# Install vLLM
3939
ARG VLLM_REPO=https://github.com/vllm-project/vllm.git
40-
ARG VLLM_TAG=v0.11.0rc3
40+
ARG VLLM_TAG=v0.11.0
4141
RUN git clone --depth 1 $VLLM_REPO --branch $VLLM_TAG /vllm-workspace/vllm
4242
# In x86, triton will be installed by vllm. But in Ascend, triton doesn't work correctly. we need to uninstall it.
4343
RUN VLLM_TARGET_DEVICE="empty" python3 -m pip install -v -e /vllm-workspace/vllm/ --extra-index https://download.pytorch.org/whl/cpu/ && \

0 commit comments

Comments
 (0)