Skip to content

Commit 72e89c3

Browse files
committed
fix: add /usr/bin symlinks for nvidia-ctk and nvidia-cdi-hook
The gpu-operator device plugin generates CDI specs with hooks pointing to /usr/bin/nvidia-ctk and /usr/bin/nvidia-cdi-hook (hardcoded defaults in NVIDIA/k8s-device-plugin and NVIDIA/nvidia-container-toolkit). Talos extensions install these binaries under /usr/local/bin/, causing pods requesting nvidia.com/gpu resource limits to fail. Add symlinks from /usr/bin/nvidia-ctk and /usr/bin/nvidia-cdi-hook to their /usr/local/bin/ counterparts, following the same pattern as the existing /usr/bin/ldconfig symlink. This eliminates the need for users to set NVIDIA_CDI_HOOK_PATH in the gpu-operator values. Requires siderolabs/talos#13021 validator allowlist update. Signed-off-by: David Orman <ormandj@corenode.com>
1 parent 20f2cac commit 72e89c3

File tree

1 file changed

+4
-0
lines changed
  • nvidia-gpu/nvidia-container-toolkit/nvidia-container-runtime

1 file changed

+4
-0
lines changed

nvidia-gpu/nvidia-container-toolkit/nvidia-container-runtime/pkg.yaml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,10 @@ steps:
3535
cd container-toolkit
3636
3737
find . -maxdepth 1 -type f -executable -exec cp {} /rootfs/usr/local/bin/ \;
38+
- |
39+
mkdir -p /rootfs/usr/bin
40+
ln -s /usr/local/bin/nvidia-ctk /rootfs/usr/bin/nvidia-ctk
41+
ln -s /usr/local/bin/nvidia-cdi-hook /rootfs/usr/bin/nvidia-cdi-hook
3842
- |
3943
mkdir -p /rootfs/etc/cri/conf.d
4044
cp /pkg/10-nvidia-container-runtime.part /rootfs/etc/cri/conf.d/10-nvidia-container-runtime.part

0 commit comments

Comments
 (0)