Skip to content

CUDA image fails to build on Rocky Linux 9 #685

@priteau

Description

@priteau

We need to bump the timestamp of the EPEL repository to get a newer dkms. Alternatively it seems that 575-open installs fine?

    openstack.openhpc: TASK [cuda : Install nvidia drivers] *******************************************
    openstack.openhpc: task path: /home/rocky/cuda-bump/ansible/roles/cuda/tasks/install.yml:38
    openstack.openhpc: fatal: [default]: FAILED! => {
    openstack.openhpc:     "changed": true,
    openstack.openhpc:     "cmd": [
    openstack.openhpc:         "dnf",
    openstack.openhpc:         "module",
    openstack.openhpc:         "install",
    openstack.openhpc:         "-y",
    openstack.openhpc:         "nvidia-driver"
    openstack.openhpc:     ],
    openstack.openhpc:     "delta": "0:00:01.078464",
    openstack.openhpc:     "end": "2025-05-27 20:10:27.156802",
    openstack.openhpc:     "rc": 1,
    openstack.openhpc:     "start": "2025-05-27 20:10:26.078338"
    openstack.openhpc: }
    openstack.openhpc:
    openstack.openhpc: STDOUT:
    openstack.openhpc:
    openstack.openhpc: Last metadata expiration check: 0:00:05 ago on Tue 27 May 2025 08:10:21 PM UTC.
    openstack.openhpc: (try to add '--skip-broken' to skip uninstallable packages or '--nobest' to use not only best candidate packages)
    openstack.openhpc:
    openstack.openhpc:
    openstack.openhpc: STDERR:
    openstack.openhpc:
    openstack.openhpc: Error:
    openstack.openhpc:  Problem 1: cannot install the best candidate for the job
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-open-dkms-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64
    openstack.openhpc:  Problem 2: package nvidia-kmod-common-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64 requires nvidia-kmod = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - cannot install the best candidate for the job
    openstack.openhpc:   - package kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 is filtered out by modular filtering
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-open-dkms-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64
    openstack.openhpc:  Problem 3: package nvidia-driver-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-kmod-common = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-kmod-common-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64 requires nvidia-kmod = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - cannot install the best candidate for the job
    openstack.openhpc:   - package kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 is filtered out by modular filtering
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-open-dkms-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64
    openstack.openhpc:  Problem 4: package nvidia-driver-cuda-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-kmod-common = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-kmod-common-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64 requires nvidia-kmod = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - cannot install the best candidate for the job
    openstack.openhpc:   - package kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 is filtered out by modular filtering
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-open-dkms-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64
    openstack.openhpc:  Problem 5: package nvidia-driver-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-kmod-common = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-settings-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-driver(x86-64) = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-kmod-common-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64 requires nvidia-kmod = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - cannot install the best candidate for the job
    openstack.openhpc:   - package kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 is filtered out by modular filtering
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-open-dkms-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64
    openstack.openhpc:  Problem 6: package nvidia-driver-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-kmod-common = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package xorg-x11-nvidia-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-driver(x86-64) = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-kmod-common-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64 requires nvidia-kmod = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - cannot install the best candidate for the job
    openstack.openhpc:   - package kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 is filtered out by modular filtering
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-open-dkms-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64
    openstack.openhpc:  Problem 7: package xorg-x11-nvidia-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-driver(x86-64) = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-driver-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires nvidia-kmod-common = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-xconfig-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 requires xorg-x11-nvidia(x86-64) >= 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - package nvidia-kmod-common-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64 requires nvidia-kmod = 3:570.148.08, but none of the providers can be installed
    openstack.openhpc:   - cannot install the best candidate for the job
    openstack.openhpc:   - package kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64 is filtered out by modular filtering
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-latest-dkms-3:570.148.08-1.el9.x86_64 from cuda-rhel9-x86_64
    openstack.openhpc:   - nothing provides dkms >= 3.1.8 needed by kmod-nvidia-open-dkms-3:570.148.08-1.el9.noarch from cuda-rhel9-x86_64
    openstack.openhpc:   - package xorg-x11-nvidia-3:575.51.03-1.el9.x86_64 from cuda-rhel9-x86_64 is filtered out by modular filtering

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions