Skip to content

Commit 9df46f2

Browse files
committed
[Nvidia] Remove 'no-cc-version-check' from installation of nvidia driver.
We remove it because: 1/ it is not supported anymore by the NVIDIA installer. 2/ it was an unsafe workaround introduced in 3.8.0 (aws#2404), which was supposed to be there only in the short term and in ended be there for long time. 3/ we introduced in 3.12.0 a logic to install NVIDIA drivers using the gcc version used to compile the kernel aws#2852.
1 parent d6ed357 commit 9df46f2

File tree

3 files changed

+5
-5
lines changed

3 files changed

+5
-5
lines changed

CHANGELOG.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,7 @@ This file is used to list changes made in each version of the AWS ParallelCluste
2121
- Upgrade ARM PL to version 24.10 (from 23.10).
2222
- Remove generation of DSA keys for login nodes as DSA, which became unsupported in OpenSSH 9.7+.
2323
- Set instance ID and instance type information in Slurm upon compute nodes launch.
24+
- Install NVIDIA drivers without the option 'no-cc-version-check', which is now deprecated in the NVIDIA installer.
2425

2526
3.12.0
2627
------

cookbooks/aws-parallelcluster-platform/resources/nvidia_driver/partial/_nvidia_driver_common.rb

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -73,14 +73,13 @@
7373
end
7474

7575
# Install driver
76-
# TODO remove --no-cc-version-check when we can update ubuntu 22 images
7776
bash 'nvidia.run advanced' do
7877
user 'root'
7978
group 'root'
8079
cwd '/tmp'
8180
code <<-NVIDIA
8281
set -e
83-
#{compiler_path} ./nvidia.run --silent --dkms --disable-nouveau --no-cc-version-check -m=#{nvidia_kernel_module}
82+
#{compiler_path} ./nvidia.run --silent --dkms --disable-nouveau -m=#{nvidia_kernel_module}
8483
rm -f /tmp/nvidia.run
8584
NVIDIA
8685
creates '/usr/bin/nvidia-smi'

cookbooks/aws-parallelcluster-platform/spec/unit/resources/nvidia_driver_spec.rb

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -244,7 +244,7 @@ def self.setup(chef_run, nvidia_driver_version: nil)
244244
cwd: '/tmp',
245245
creates: '/usr/bin/nvidia-smi'
246246
)
247-
.with_code(%r{CC=/usr/bin/gcc10-gcc ./nvidia.run --silent --dkms --disable-nouveau --no-cc-version-check -m=#{kernel_module}})
247+
.with_code(%r{CC=/usr/bin/gcc10-gcc ./nvidia.run --silent --dkms --disable-nouveau -m=#{kernel_module}})
248248
.with_code(%r{rm -f /tmp/nvidia.run})
249249
end
250250
elsif platform == 'ubuntu' && version == '22.04'
@@ -272,7 +272,7 @@ def self.setup(chef_run, nvidia_driver_version: nil)
272272
cwd: '/tmp',
273273
creates: '/usr/bin/nvidia-smi'
274274
)
275-
.with_code(%r{#{compiler_path} ./nvidia.run --silent --dkms --disable-nouveau --no-cc-version-check -m=#{kernel_module}})
275+
.with_code(%r{#{compiler_path} ./nvidia.run --silent --dkms --disable-nouveau -m=#{kernel_module}})
276276
end
277277
else
278278
it "doesn't install gcc10" do
@@ -286,7 +286,7 @@ def self.setup(chef_run, nvidia_driver_version: nil)
286286
cwd: '/tmp',
287287
creates: '/usr/bin/nvidia-smi'
288288
)
289-
.with_code(%r{./nvidia.run --silent --dkms --disable-nouveau --no-cc-version-check -m=#{kernel_module}})
289+
.with_code(%r{./nvidia.run --silent --dkms --disable-nouveau -m=#{kernel_module}})
290290
.with_code(%r{rm -f /tmp/nvidia.run})
291291
end
292292
end

0 commit comments

Comments
 (0)