WIP: native algorithms #90

Jutho · 2025-10-31T23:53:18Z

This is currently just a proof of principle, but I think it is not too hard, using my old attempt at a generic linear algebra library (never published) to implement native algorithms. Currently only has a QR (which is easy), but with a performance that is surprisingly identical to GenericLinearAlgebra.

Hence, this is an alternative to #87 , but we can easily have both, as it would take some time to bring everything up to date.

codecov · 2025-11-01T00:01:33Z

Codecov Report

❌ Patch coverage is 0% with 238 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/common/householder.jl	0.00%	110 Missing ⚠️
src/implementations/lq.jl	0.00%	62 Missing ⚠️
src/implementations/qr.jl	0.00%	62 Missing ⚠️
src/interface/lq.jl	0.00%	2 Missing ⚠️
src/interface/qr.jl	0.00%	2 Missing ⚠️

Files with missing lines	Coverage Δ
src/MatrixAlgebraKit.jl	`100.00% <ø> (ø)`
src/interface/decompositions.jl	`100.00% <ø> (ø)`
src/interface/lq.jl	`18.75% <0.00%> (-31.25%)`	⬇️
src/interface/qr.jl	`18.18% <0.00%> (-48.49%)`	⬇️
src/implementations/lq.jl	`31.72% <0.00%> (-67.21%)`	⬇️
src/implementations/qr.jl	`33.82% <0.00%> (-62.79%)`	⬇️
src/common/householder.jl	`0.00% <0.00%> (ø)`

... and 29 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

kshyatt · 2025-11-02T09:23:45Z

I think having the native AD can be helpful for testing since the compilers in both Mooncake and Enzyme are pretty good and the guidance is to only write a custom rule if you really have to -- such as when you have a call to a foreign library like BLAS.

Jutho · 2025-11-02T09:42:07Z

This now contains a fully functional QR and LQ and tests, and could in principle be merged as is, with other factorizations be done in separate PRs. However, going forward, this raises a number of interesting questions: @lkdvos and @kshyatt

Currently, the Native_HouseholderQR/LQ algorithm is not registered as default. Should this be the default for AbstractMatrix{BigFloat} and AbstractMatrix{Complex{BigFloat}}, with then other scalar types (e.g. https://github.com/JuliaMath/DoubleFloats.jl) needing separate registration (possibly in package extensions). Or do we register the native algorithms as a default for all AbstractMatrix, hoping that the LAPACK / GPU stuff is always correctly selected by their more specific default registration signature. Also, how should a non-strided AbstractMatrix{Float64} be handled in a non-mutating method so that it is anyway copied?
In principle, we can natively AD through these implementations. Is this interesting for testing purposes? Or do we even want to have the native AD as the default behavior?

lkdvos · 2025-11-03T11:59:11Z

For 1, I think that since the arbitrary element types are the primary candidates for these native implementations, it would make sense to just register the native implementations as the defaults. I would prefer to avoid having to define overloads for every possible combination of weird element type and weird array type, and this seems like a solution that might often just work?

For 2, I don't really know. We don't really have a good interface for selecting different AD modes right now, (nor for selecting tolerances etc), so while I don't mind having the option to use native AD, I would also just want to see what the performances are like before we invest into trying to make this accessible?

kshyatt · 2025-11-03T12:58:08Z

test/lq.jl


-eltypes = (Float32, Float64, ComplexF32, ComplexF64)
+lapack_eltypes = (Float32, Float64, ComplexF32, ComplexF64)
+native_eltypes = (lapack_eltypes..., BigFloat, Complex{BigFloat})


Should we also try Float16 for the native eltypes?

kshyatt · 2025-11-03T12:58:45Z

Since this provides LQ and QR could we also test the left_orth and left_null and right sided versions using these?

kshyatt · 2025-11-03T13:00:46Z

Also, it looks like we are not yet testing the GPU support for these and I suspect we'll get scalar indexing errors -- should we add in GPU tests to this PR?

Jutho · 2025-11-03T14:17:05Z

This is definitely not meant for GPUs (which I still do not know anything about). Do people ever want to do non-native scalar types on GPUs?

Adding Float16 to the tests is a worthwhile suggestion.

So I then add a catchall default_qr_algorithm(::AbstractMatrix)? I assume there is no way to exclude anything that actually lives on the GPU?

Finally, regarding AD: I assume that our default setup will already automatically select our custom pullback rules also for these native QR and LQ implementations. How difficult is it to circumvent these registered custom pullback rules and switch to native AD'ing, e.g. for testing purposes?

kshyatt · 2025-11-03T14:20:23Z

Do people ever want to do non-native scalar types on GPUs?

Well, Sander might ;). But people definitely do Float16 and in theory one might want to do Float128 types even if arbitrary precision won't work because it may not be isbits.

lkdvos · 2025-11-03T15:24:21Z

Since we are intercepting the AD at the level of qr_compact!(A, F, alg), we could just try and AD through _native_qr, which won't be intercepted, if the goal is to just test things out.

native_qr

2f9a915

add lq and tests

75e0d03

kshyatt reviewed Nov 3, 2025

View reviewed changes

sanderdemeyer mentioned this pull request Nov 3, 2025

add support for BigFloats via a new extension #87

Merged

add defaults and more tests

4dcbaf6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: native algorithms #90

WIP: native algorithms #90

Uh oh!

Jutho commented Oct 31, 2025

Uh oh!

codecov bot commented Nov 1, 2025 •

edited

Loading

Uh oh!

kshyatt commented Nov 2, 2025

Uh oh!

Jutho commented Nov 2, 2025

Uh oh!

lkdvos commented Nov 3, 2025

Uh oh!

kshyatt Nov 3, 2025

Uh oh!

kshyatt commented Nov 3, 2025

Uh oh!

kshyatt commented Nov 3, 2025

Uh oh!

Jutho commented Nov 3, 2025

Uh oh!

kshyatt commented Nov 3, 2025

Uh oh!

lkdvos commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

WIP: native algorithms #90

Are you sure you want to change the base?

WIP: native algorithms #90

Uh oh!

Conversation

Jutho commented Oct 31, 2025

Uh oh!

codecov bot commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

kshyatt commented Nov 2, 2025

Uh oh!

Jutho commented Nov 2, 2025

Uh oh!

lkdvos commented Nov 3, 2025

Uh oh!

kshyatt Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

kshyatt commented Nov 3, 2025

Uh oh!

kshyatt commented Nov 3, 2025

Uh oh!

Jutho commented Nov 3, 2025

Uh oh!

kshyatt commented Nov 3, 2025

Uh oh!

lkdvos commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Nov 1, 2025 •

edited

Loading