Skip to content
Merged
Show file tree
Hide file tree
Changes from 3 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions gpu-operator/getting-started.rst
Original file line number Diff line number Diff line change
Expand Up @@ -513,6 +513,8 @@ For Rancher Kubernetes Engine 2 (RKE2), refer to
`Deploy NVIDIA Operator <https://docs.rke2.io/advanced#deploy-nvidia-operator>`__
in the RKE2 documentation.

Refer to the :ref:`v24.9.0-known-limitations`.

MicroK8s
========

Expand Down
13 changes: 13 additions & 0 deletions gpu-operator/release-notes.rst
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,8 @@ New Features

For more information regarding NVIDIA security vulnerability remediation policies, refer to https://www.nvidia.com/en-us/security/psirt-policies/.

For Rancher RKE2 and K3s, refer to the :ref:`v24.9.0-known-limitations`.

* Added support for the NVIDIA Data Center GPU Driver version 550.127.05.
Refer to the :ref:`GPU Operator Component Matrix`
on the platform support page.
Expand Down Expand Up @@ -127,6 +129,17 @@ Fixed Issues
Previously, the toolkit could configure the ``runc`` handler with the ``nvidia`` runtime handler even if ``runc`` was not the default runtime and cause CRI-O to crash.
In this release, the toolkit determines the default runtime by running ``crio status config`` and configures that runtime with the ``nvidia`` runtime handler.

.. _v24.9.0-known-limitations:

Known Limitations
-----------------

* On Rancher RKE2 and K3s, NVIDIA Container Toolkit v1.17.0 fails to start.
The toolkit attempts to use the ``containerd`` command to determine the container runtime configuration on the host.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To be specific , it attempts to run the following commands depending on the container runtime used for the cluster

i. containerd config dump
ii. crio status config

On these platforms, the ``containerd`` executable is not on the PATH and results in an error.

NVIDIA recommends installing v1.17.1 of the toolkit when you install or upgrade the Operator.
You can specify the ``--set toolkit.version=v1.17.1-ubuntu20.04`` or ``v1.17.1-ubi8`` argument to Helm.

.. _v24.6.2:

Expand Down