Added all CPU to Docker GPU images for 'token_embd.weight' compatibility #12749

rudiservo · 2025-04-04T13:34:38Z

Issue #12500, Cuda docker images are crashing with old GPU and maybe more recent ones because token_embd.weight is being processed by the CPU, since BMI2 was added this causes the program to crash due to compatibility issues.

It was recommended to add all CPU variants to all GPU images because it would benefit GPU images with CPU compatibility.

slaren

Looks good, but if any of these images are intended to be used on Arm (maybe Vulkan?), it would need the same logic as the CPU image to disable GGML_BACKEND_DL on Arm builds.

.devops/rocm.Dockerfile

rudiservo · 2025-04-04T14:54:24Z

As far as I can tell only CPU images are built with ARM, all gpu images are built for AMD64.
If you want GPU builds for ARM, I can add them in a latter PR maybe? I don't have an ARM to test it with a GPU, so I will need someone to test it.

…patibility

aubinkure · 2025-05-10T06:48:44Z

I was running into the same segfault running llama-quantize in the Docker container as these issues:
#11683
#12564
#11196

I traced it back to this PR. Any thoughts on how to fix it?

Separately, it seems like there's no CI on the Docker images? A few short tests would be quite helpful, I think.

rudiservo · 2025-05-10T12:37:46Z

@aubinkure how can you trace it back to this PR if this was more recent than the issues you refereed to.
you traced it forward m8, either this solves it or it doesn't, the only thing this does is to add CPU support to all GPU images.
If anything it's an issue with BMI2, so not this issue or PR.

aubinkure · 2025-05-10T14:38:33Z

You're right, it seems this PR isn't related to the issues I mentioned. That said, I double checked and this PR is definitely breaking quantization for me inside a Docker container. I'll open up a new issue with steps to reproduce.

rudiservo requested a review from ngxson as a code owner April 4, 2025 13:34

github-actions bot added the devops improvements to build systems and github actions label Apr 4, 2025

ngxson requested a review from slaren April 4, 2025 13:53

slaren approved these changes Apr 4, 2025

View reviewed changes

.devops/rocm.Dockerfile Show resolved Hide resolved

Added all CPU to Docker GPU images because of 'token_embd.weight' com…

835d9e4

…patibility

rudiservo force-pushed the docker-cpu-all branch from 0c48a07 to 835d9e4 Compare April 8, 2025 15:48

Merge branch 'master' into docker-cpu-all

8aa8efd

slaren merged commit b0091ec into ggml-org:master Apr 9, 2025
2 checks passed

colout pushed a commit to colout/llama.cpp that referenced this pull request Apr 21, 2025

docker : added all CPU to GPU images (ggml-org#12749)

d8116ab

timwu pushed a commit to timwu/llama.cpp that referenced this pull request May 5, 2025

docker : added all CPU to GPU images (ggml-org#12749)

c9b0965

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added all CPU to Docker GPU images for 'token_embd.weight' compatibility #12749

Added all CPU to Docker GPU images for 'token_embd.weight' compatibility #12749

Uh oh!

rudiservo commented Apr 4, 2025

Uh oh!

slaren left a comment

Uh oh!

Uh oh!

rudiservo commented Apr 4, 2025

Uh oh!

Uh oh!

aubinkure commented May 10, 2025

Uh oh!

rudiservo commented May 10, 2025

Uh oh!

aubinkure commented May 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added all CPU to Docker GPU images for 'token_embd.weight' compatibility #12749

Added all CPU to Docker GPU images for 'token_embd.weight' compatibility #12749

Uh oh!

Conversation

rudiservo commented Apr 4, 2025

Uh oh!

slaren left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rudiservo commented Apr 4, 2025

Uh oh!

Uh oh!

aubinkure commented May 10, 2025

Uh oh!

rudiservo commented May 10, 2025

Uh oh!

aubinkure commented May 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants