Skip to content

Conversation

@lukaszstolarczuk
Copy link
Contributor

@lukaszstolarczuk lukaszstolarczuk commented Nov 17, 2025

and fix building "Intel Drivers" images, as they currently can't be built because of "No space left on device" error.

@lukaszstolarczuk lukaszstolarczuk changed the title [CI] First build Base and Build containers, then Intel Drivers images [CI] First build Base and Build images, then Intel Drivers Nov 17, 2025
@lukaszstolarczuk lukaszstolarczuk marked this pull request as ready for review November 17, 2025 11:44
@lukaszstolarczuk lukaszstolarczuk requested a review from a team as a code owner November 17, 2025 11:44
@lukaszstolarczuk
Copy link
Contributor Author

lukaszstolarczuk commented Nov 17, 2025

example CI failure in nightly job last weekend: https://github.com/intel/llvm/actions/runs/19381290080/job/55460338609#step:3:914

extra cleanup step upgrades from 19G free to 52G free space available

@lukaszstolarczuk
Copy link
Contributor Author

Finally extra cleanup takes 1m 21s and gives extra 27GB of free space.
Currently cleanup seems to be required only for "Intel Drivers" images, but I kept it in both jobs for unity...?

@intel/dpcpp-devops-reviewers please review.

Comment on lines 88 to 78
# Then build "Intel Drivers" images that depend on previous images.
# Note: Building these images on PR means using old "Base" and "Build" images,
# as the ones above were not yet pushed.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't it a deal breaker for us? @sarnex , @uditagarwal97

Copy link
Contributor

@sarnex sarnex Nov 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Probably yes, but if I understand correctly that the problem is hard drive space can't we just move the job from the GitHub hosted runners to a self-hosted one which has enough space?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 to just moving docker container build jobs to self-hosted runners

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@aelovikov-intel, what is the deal breaker?

Just to be on the same page, there are 2 issues here. While I tried fixing issue that "intel drivers" are based on old Base/Build images I noticed the issue that "intel drivers" images can't be build (because they are too big/too litle space is available).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if you want to move building dockers to self-hosted runners, pls let me know which label should I used, at best.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While I tried fixing issue that "intel drivers" are based on old Base/Build images

Are you saying that it's the same limitation when running this workflow in pre-commit (whenever image-related tasks/code are being updated) before/after PR?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I just created the docker-builder label and put it on some random Linux machine, can you try using that? Probably CI will fail because the OS isn't set up right but I'll set it up once I see the errors.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

docker-builder used.

Are you saying that it's the same limitation when running this workflow in pre-commit (whenever image-related tasks/code are being updated) before/after PR?

I actually only checked the nightly build - we fixed a dependency for benchmarks, but it was still failing on "nigthly" image, which is based on "intel drivers". Once I re-built "intel drivers" and "nightly" it all worked.

Anyway, I believe if we use only a single runner for building dockers the issue may be gone, because we have these dockers cached in the system...?

side note: we'd have to regularly clean/prune old images from that runner (to free up the disk space).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

side note: we'd have to regularly clean/prune old images from that runner (to free up the disk space).

I already have to do this lol

Intel Drivers images were hitting 'No space left on device' error on public runners
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants