Investigate CUDA 13 `--compress-mode size`

CUDA 12.8+ introduced the `nvcc` option [`--compress-mode`](https://docs.nvidia.com/cuda/archive/13.0.0/cuda-compiler-driver-nvcc/index.html#compress-mode-default-size-speed-balance-none-compress-mode) allowing fatbin compression parameters to be tweaked. 

The default value was changed with CUDA 13, leading to smaller binaries (90MB vs 160MB for python wheels, although other differences will also have an impact). Pypi has a 100MB limit per wheel, and 10GB per project ([source](https://docs.pypi.org/project-management/storage-limits)). 

With CUDA 13, the default value is `balanced` according to a [blog post](https://developer.nvidia.com/blog/whats-new-and-important-in-cuda-toolkit-13-0/#cuda_130_improves_default_compression_with_zstandard_zstd%C2%A0) (however the [docs](https://docs.nvidia.com/cuda/archive/13.0.1/cuda-compiler-driver-nvcc/index.html#compress-mode-default-size-speed-balance-none-compress-mode) state it is still `speed`)


For our python wheels, we might want to choose `size` instead, depending on how much this impacts wheel size and the performance impact this would have on fatbin decompression.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate CUDA 13 `--compress-mode size` #1349

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Investigate CUDA 13 --compress-mode size #1349

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Investigate CUDA 13 `--compress-mode size` #1349