Skip to content

WIP: Reintegrate CUDA support#247

Draft
dfgHiatus wants to merge 1 commit intomainfrom
feat/cuda-for-all
Draft

WIP: Reintegrate CUDA support#247
dfgHiatus wants to merge 1 commit intomainfrom
feat/cuda-for-all

Conversation

@dfgHiatus
Copy link
Copy Markdown
Collaborator

This PR has a bit of a story attached to it.

As of the latest release, inference on Windows is accelerated through the DirectML execution provider, and there is no acceleration on Linux. The AMD backends, ROCM and MiGraphX, do not have any NuGet packages available, and the former is deprecated. The effort to maintain our own execution providers is not worth it at this time.

On the flip side, CUDA has been a more interesting. For us to have proper CUDA support, we need to bundle a parts of the CUDNN runtime with the application, as the publicly available NuGet package does not do this by default (and hasn't for a while, by the looks of some things online). This process is literally dropping in a single missing file, as has been my experience on Windows. I can't speak for Linux, at least on WSL Ubuntu 24.04.

I learned about this after modifying how I was appending the execution provider to CUDA. Dropping AppendExecutionProvider_CUDA for the longer code block has made things work for me on a 5060ti and CUDA 12-13. This is the only card I have to reliably test against, I can't speak against other hardware.

There are workarounds. This could be done in the actions build step, or we could copy the file over on the local user machine. These implementations appear fragile - This hasn't even factored in instructing the user to install compatible versions of CUDA and CUDNN on their system, build size, etc.

In the spirit of transparency, I'm going to push up what I have and see what others make and or think of it. Cheers, and good luck

@dfgHiatus dfgHiatus self-assigned this Mar 20, 2026
@dfgHiatus dfgHiatus changed the title WIP: Reintegrate proper CUDA support WIP: Reintegrate CUDA support Mar 22, 2026
@dfgHiatus dfgHiatus added enhancement New feature or request help wanted Extra attention is needed labels Mar 22, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request help wanted Extra attention is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant