Create extension for AMD and generalize GPU wrappers #42

kshyatt · 2025-08-13T12:07:21Z

This generalizes the CUDA wrappers to also support AMD GPUs. I moved some of the higher GPU logic into src so that we don't have to duplicate code, and provided "fallbacks" for the lower-level LAPACK like methods (ROCm and CUDA implement the same APIs) so that the extensions should only need to extend the LAPACK API rather than qr_null! and such. I also added CI support for AMD.

Since I don't have an AMD GPU to test on, this is going to be "CI based debugging" for a bit 😅 . I did check locally that the CUDA tests pass.

codecov · 2025-08-13T17:00:16Z

Codecov Report

❌ Patch coverage is 89.57346% with 22 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
ext/MatrixAlgebraKitAMDGPUExt/yarocsolver.jl	86.66%	10 Missing ⚠️
src/implementations/svd.jl	89.70%	7 Missing ⚠️
src/implementations/qr.jl	93.87%	3 Missing ⚠️
...ixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl	84.61%	2 Missing ⚠️

Files with missing lines	Coverage Δ
...MatrixAlgebraKitCUDAExt/MatrixAlgebraKitCUDAExt.jl	`85.71% <100.00%> (+10.71%)`	⬆️
ext/MatrixAlgebraKitCUDAExt/yacusolver.jl	`92.74% <ø> (ø)`
src/MatrixAlgebraKit.jl	`100.00% <ø> (ø)`
...ixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl	`84.61% <84.61%> (ø)`
src/implementations/qr.jl	`96.31% <93.87%> (-1.05%)`	⬇️
src/implementations/svd.jl	`93.13% <89.70%> (-1.72%)`	⬇️
ext/MatrixAlgebraKitAMDGPUExt/yarocsolver.jl	`86.66% <86.66%> (ø)`

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

lkdvos

Looks great!

I had one question about the way you define things for AbstractMatrix and Union{CUAlg, ROCAlg}, which is just to check if this causes any ambiguities or not - one of the goals of this package was to allow overloading based on your own "operator", and therefore it tends to be easier to only use concrete types for the algorithms. This being said, from what I can tell it should be fine here since you probably always want to define things for both at the same time anyways, I just wanted to hear if you have any thoughts on that.

ext/MatrixAlgebraKitAMDGPUExt/yarocsolver.jl

test/amd/utilities.jl

kshyatt · 2025-08-14T08:48:19Z

I had one question about the way you define things for AbstractMatrix and Union{CUAlg, ROCAlg}, which is just to check if this causes any ambiguities or not - one of the goals of this package was to allow overloading based on your own "operator", and therefore it tends to be easier to only use concrete types for the algorithms. This being said, from what I can tell it should be fine here since you probably always want to define things for both at the same time anyways, I just wanted to hear if you have any thoughts on that.

If I understood the question correctly, I think this should avoid ambiguities since in the package extensions the functions only accept the appropriate Strided*Array where * can be Cu or ROC -- so if you pass a CUDA array to the ROCm wrapper, it will fall back to the AbstractArray case and throw a MethodError. One option would also be to define a new abstract type GPUSVDPolar <: AbstractAlgorithm and have the CUDA/AMD concrete types subtype this.

lkdvos · 2025-08-14T09:05:46Z

I think I would prefer not to have too many abstract types added, so probably the const union is fine. Thinking a bit more through it I think you are right that either we hit the MethodError or we hit the appropriate Strided*Array implementation, so should be good!

kshyatt · 2025-08-14T09:09:07Z

Should I maybe add an explanatory comment there about this, since it was confusing enough that you had to ask?

lkdvos · 2025-08-14T09:51:45Z

I'm fine with either, I think it's not necessarily confusing, I tend to overworry a bit about ambiguities since there's a large amount of downstream packages for which this would be very annoying to work around

kshyatt · 2025-08-14T09:53:04Z

Then it sounds like you are worrying an appropriate amount. Perhaps we should add some downstream tests to CI, as CUDA.jl has?

…

On Thu, Aug 14, 2025 at 11:52 AM Lukas Devos ***@***.***> wrote: *lkdvos* left a comment (QuantumKitHub/MatrixAlgebraKit.jl#42) <#42 (comment)> I'm fine with either, I think it's not necessarily confusing, I tend to overworry a bit about ambiguities since there's a large amount of downstream packages for which this would be very annoying to work around — Reply to this email directly, view it on GitHub <#42 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGKJY4UELLWUH3OQM4V7FT3NRL4NAVCNFSM6AAAAACDZNKLOWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTCOBXHAZTSNBTGU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

lkdvos · 2025-08-14T10:09:41Z

We definitely should, unfortunately these still are a bit in development 😀 Thinking of TensorKit, BlockSparseArrays etc right now

kshyatt · 2025-08-14T10:11:25Z

In that case, are we good to merge?

lkdvos · 2025-08-14T10:13:33Z

I think so!
(It might be me, but I can't see the updates from the resolved conversations, is this just not pushed yet?)

kshyatt requested review from Jutho and lkdvos August 13, 2025 12:07

kshyatt force-pushed the ksh/amd branch 17 times, most recently from 43ee4d2 to fd5e540 Compare August 13, 2025 16:51

Create extension for AMD and generalize GPU wrappers

a9c5ed6

kshyatt force-pushed the ksh/amd branch from fd5e540 to a9c5ed6 Compare August 13, 2025 16:55

lkdvos reviewed Aug 14, 2025

View reviewed changes

ext/MatrixAlgebraKitAMDGPUExt/yarocsolver.jl Show resolved Hide resolved

test/amd/utilities.jl Show resolved Hide resolved

lkdvos approved these changes Aug 14, 2025

View reviewed changes

kshyatt merged commit 8877460 into main Aug 14, 2025
10 checks passed

kshyatt deleted the ksh/amd branch August 14, 2025 10:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create extension for AMD and generalize GPU wrappers #42

Create extension for AMD and generalize GPU wrappers #42

Uh oh!

kshyatt commented Aug 13, 2025 •

edited

Loading

Uh oh!

codecov bot commented Aug 13, 2025 •

edited

Loading

Uh oh!

lkdvos left a comment

Uh oh!

Uh oh!

Uh oh!

kshyatt commented Aug 14, 2025

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

kshyatt commented Aug 14, 2025

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

kshyatt commented Aug 14, 2025 via email

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

kshyatt commented Aug 14, 2025

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Create extension for AMD and generalize GPU wrappers #42

Create extension for AMD and generalize GPU wrappers #42

Uh oh!

Conversation

kshyatt commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lkdvos left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kshyatt commented Aug 14, 2025

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

kshyatt commented Aug 14, 2025

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

kshyatt commented Aug 14, 2025 via email

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

kshyatt commented Aug 14, 2025

Uh oh!

lkdvos commented Aug 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kshyatt commented Aug 13, 2025 •

edited

Loading

codecov bot commented Aug 13, 2025 •

edited

Loading