CUDA: correct the lowest Maxwell supported by CUDA 12 #11984

PureJourney · 2025-02-20T21:03:30Z

The the lowest architecture supported by CUDA 12 is Maxwell.
And 5.0 is the lowest one in the Maxwell family.

JohannesGaessler

You're right, I thought CUDA 12 needed compute capability 5.2 or higher but it is indeed 5.0 or higher.

ggml/src/ggml-cuda/CMakeLists.txt

Co-authored-by: Johannes Gäßler <[email protected]>

LostRuins · 2025-02-24T03:50:57Z

Is there any downsides to targeting a lower compute capability of 5.0 e.g. for GTX 970? Performance should still be the same as it was for 5.2 right?

JohannesGaessler · 2025-02-24T08:03:38Z

I think with this configuration CMake will produce PTX and PTXAS for compute capability 5.0. PTX is the cude equivalent of assembly, PTXAS is the binary code that can be run directly. So for a CC 5.2 GPU there will be just-in-time compilation of PTX to PTXAS and the startup will take longer. Other than that there should be no downsides.

* CUDA: correct the lowest Maxwell supported by CUDA 12 --------- Co-authored-by: Johannes Gäßler <[email protected]>

CUDA: correct the lowest Maxwell supported by CUDA 12

0f01b29

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Feb 20, 2025

JohannesGaessler reviewed Feb 20, 2025

View reviewed changes

ggml/src/ggml-cuda/CMakeLists.txt Outdated Show resolved Hide resolved

Apply suggestions from code review

ac4e437

Co-authored-by: Johannes Gäßler <[email protected]>

JohannesGaessler approved these changes Feb 21, 2025

View reviewed changes

JohannesGaessler merged commit ecc8e3a into ggml-org:master Feb 21, 2025
46 checks passed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

CUDA: correct the lowest Maxwell supported by CUDA 12 (ggml-org#11984)

19b54ba

* CUDA: correct the lowest Maxwell supported by CUDA 12 --------- Co-authored-by: Johannes Gäßler <[email protected]>

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

CUDA: correct the lowest Maxwell supported by CUDA 12 (ggml-org#11984)

c0b282a

* CUDA: correct the lowest Maxwell supported by CUDA 12 --------- Co-authored-by: Johannes Gäßler <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: correct the lowest Maxwell supported by CUDA 12 #11984

CUDA: correct the lowest Maxwell supported by CUDA 12 #11984

Uh oh!

PureJourney commented Feb 20, 2025

Uh oh!

JohannesGaessler left a comment

Uh oh!

Uh oh!

Uh oh!

LostRuins commented Feb 24, 2025

Uh oh!

JohannesGaessler commented Feb 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CUDA: correct the lowest Maxwell supported by CUDA 12 #11984

CUDA: correct the lowest Maxwell supported by CUDA 12 #11984

Uh oh!

Conversation

PureJourney commented Feb 20, 2025

Uh oh!

JohannesGaessler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

LostRuins commented Feb 24, 2025

Uh oh!

JohannesGaessler commented Feb 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants