Skip to content

Conversation

dferre97
Copy link
Contributor

Added fast w_mul_xj propagate CUDA support for sparse graphs using SpMM.
Added benchmarks to compare with gather/scatter approach, speedup from 40x to 300x on my machine, huge memory allocation benefits (up to 1000x less or more depending on size of graph and sparsity level).
CUDA tests on GraphNeuralNetworks passed.

@CarloLucibello
Copy link
Member

Build kite Cuda issues doesn't seem related to this pr, but we have to fix it at some point.

@CarloLucibello CarloLucibello merged commit 9b327d6 into JuliaGraphs:master Jul 16, 2025
5 of 7 checks passed
@dferre97 dferre97 deleted the df/wmulxj-sparse-cuda branch July 17, 2025 05:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants