Build VLLM CUDA from AIPCC wheels #86

ckhordiasma · 2025-03-21T14:13:32Z

Update Dockerfile.ubi to install vllm-cuda using wheel from RHEL AI team

the install script is located in payload/run.sh. An args file was also added with the custom parameters, and is referenced in the tekton pipeline.

update payload/run.sh to use bot token
add trap to guarantee run.sh deletion

…d-hat-data-services#85) * Update Dockerfile.ubi to install vllm-cuda using wheel from RHEL AI team the install script is located in payload/run.sh. An args file was also added with the custom parameters, and is referenced in the tekton pipeline. * update payload/run.sh to use bot token * add trap to guarantee run.sh deletion

dtrifiro · 2025-03-24T09:53:47Z

Let's hold on merging this until we figure out if we can get provide a vLLM v0.7.3 tag in the midstream and a build from the AIPCC team.

Context: https://issues.redhat.com/browse/INFERENG-477

ckhordiasma · 2025-03-24T12:02:58Z

@dtrifiro should I leave the 2.19 branch the way it is?

dtrifiro · 2025-03-24T13:51:51Z

Yeah, let's wait until we come up with a definitive strategy

ckhordiasma · 2025-03-24T13:56:26Z

sounds good, I will draft this for now and will not make any changes to 2.19 (which already has this change merged in there)

payload/run.sh

dtrifiro · 2025-04-14T08:09:27Z

payload/run.sh

+WHEEL_RELEASE_ARTIFACTS="https://gitlab.com/api/v4/projects/68045055/packages/generic/rhelai-wheels/${WHEEL_RELEASE}/wheels-${WHEEL_RELEASE}.tar.gz"
+
+
+# NOTE - ensure that flashinfer is included in wheel thing


Looks like it's included? https://issues.redhat.com/browse/AIPCC-49

dtrifiro · 2025-04-14T08:19:23Z

@tarukumar fyi

dtrifiro · 2025-04-14T08:21:05Z

Dockerfile.ubi


 ## CUDA Base ###################################################################
-FROM python-install as cuda-base
+FROM python-install AS cuda-base


Does this cuda-base layer include all of the rpm dependencies required by the wheel set? Should be enough to check with ldd (?)

If I was able to get a green build, does that mean that all the required dependencies are there? Or do I need to do additional checks?

ckhordiasma · 2025-04-14T20:25:35Z

/build

ckhordiasma · 2025-04-14T20:55:19Z

/build

ckhordiasma · 2025-04-15T02:11:46Z

/build

ckhordiasma · 2025-04-15T02:16:38Z

/build

ckhordiasma · 2025-04-15T02:32:35Z

/build-from-rhelai-wheels

ckhordiasma · 2025-04-15T02:45:27Z

/build-from-rhelai-wheels

ckhordiasma · 2025-04-15T04:02:33Z

/build-from-rhelai-wheels

ckhordiasma · 2025-04-15T04:16:53Z

/build-from-rhelai-wheels

EmilienM · 2025-06-06T14:40:20Z

Dockerfile.ubi

-
 ## Base Layer ##################################################################
-FROM registry.access.redhat.com/ubi9/ubi-minimal:${BASE_UBI_IMAGE_TAG} as base
+FROM registry.access.redhat.com/ubi9/ubi-minimal:${BASE_UBI_IMAGE_TAG} AS base


could you please try with a base image built by AIPCC: http://quay.io/aipcc/base-images/cuda
if your builder doesn't have access to that image, you can ping me on Slack and I'll send you the procedure to get access.

also for reference, this is the dockerfile for RHAIIS: https://gitlab.com/redhat/rhel-ai/rhaiis/containers/-/blob/main/Containerfile.cuda-ubi9?ref_type=heads

dtrifiro · 2025-10-09T17:30:19Z

outdated

ckhordiasma requested a review from dtrifiro March 21, 2025 14:49

ckhordiasma marked this pull request as draft March 24, 2025 13:55

dtrifiro mentioned this pull request Mar 27, 2025

Include nm-vllm-ent bits #97

Merged

openshift-merge-robot added the needs-rebase label Apr 6, 2025

ckhordiasma added 3 commits April 11, 2025 13:46

sync with main

7a76fc3

update to run from release script, as intended

ae840bb

reorganize argfiles into separate folder

e9494ec

openshift-merge-robot removed the needs-rebase label Apr 13, 2025

dtrifiro reviewed Apr 14, 2025

View reviewed changes

payload/run.sh Outdated Show resolved Hide resolved

dtrifiro reviewed Apr 14, 2025

View reviewed changes

ckhordiasma changed the title ~~Build VLLM CUDA from RHEL AI wheels, add audio and video packages (#85)~~ Build VLLM CUDA from AIPCC wheels Apr 14, 2025

add PR pipeline to test build

77c398d

ckhordiasma force-pushed the nm-wheel-patch branch from 9059191 to 77c398d Compare April 14, 2025 20:25

add args file to pipeline params

8387644

add additional build secret and rename pipeline

9af9303

add PR-specific pipeline settings

167b796

ckhordiasma added 2 commits April 14, 2025 21:28

change command that triggers build

5cace34

fix typo in url in run.sh

479b050

link secret to build-images step

f499f07

add additional-build-secret in intermediate pipeline spec

b6ecbab

add -u flag to run.sh

f4401c1

EmilienM reviewed Jun 6, 2025

View reviewed changes

dtrifiro closed this Oct 9, 2025

		WHEEL_RELEASE_ARTIFACTS="https://gitlab.com/api/v4/projects/68045055/packages/generic/rhelai-wheels/${WHEEL_RELEASE}/wheels-${WHEEL_RELEASE}.tar.gz"


		# NOTE - ensure that flashinfer is included in wheel thing

Build VLLM CUDA from AIPCC wheels #86

Build VLLM CUDA from AIPCC wheels #86

Uh oh!

Conversation

ckhordiasma commented Mar 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dtrifiro commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ckhordiasma commented Mar 24, 2025

Uh oh!

dtrifiro commented Mar 24, 2025

Uh oh!

ckhordiasma commented Mar 24, 2025

Uh oh!

Uh oh!

dtrifiro Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

dtrifiro commented Apr 14, 2025

Uh oh!

dtrifiro Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ckhordiasma Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

ckhordiasma commented Apr 14, 2025

Uh oh!

ckhordiasma commented Apr 14, 2025

Uh oh!

ckhordiasma commented Apr 15, 2025

Uh oh!

ckhordiasma commented Apr 15, 2025

Uh oh!

ckhordiasma commented Apr 15, 2025

Uh oh!

ckhordiasma commented Apr 15, 2025

Uh oh!

ckhordiasma commented Apr 15, 2025

Uh oh!

ckhordiasma commented Apr 15, 2025

Uh oh!

EmilienM Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

EmilienM Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

dtrifiro commented Oct 9, 2025

Uh oh!

Uh oh!

ckhordiasma commented Mar 21, 2025 •

edited by github-actions bot

Loading

dtrifiro commented Mar 24, 2025 •

edited

Loading

dtrifiro Apr 14, 2025 •

edited

Loading