feat(autogram): Add `DiagonalSparseTensor`. #466

PierreQuinton · 2025-10-20T11:00:40Z

Some prototype of diagonal sparse Tensors.

TODO:

Add assert utility to test that the DSTs are the same whence densified and also the shape of contiguous_data/v_to_ps match up to joint move_dim.
(Probably later, for now we want to keep getting assertions error) Catch errors in dispatch to fall back to dense for maximal compatibility.

Problems and questions:

Slicing is done on each dim independently, if we had t=DST(..., v_to_p=[[0],[0]]) and slicing t[2:8, 2:8], we would want the result to be a DST(..., v_to_p=[[0], [0]]) but this cannot be done easily with slicing iteratively on each dimensions.
Should we use the operator from aten on inner Tensor rather than higher level operators from torch? It feels like aten do not accept any Tensor and we would want to be able to compose with any. However doing this might result in some extra overhead.

codecov · 2025-10-20T11:01:53Z

Codecov Report

❌ Patch coverage is 94.79167% with 5 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/torchjd/autogram/diagonal_sparse_tensor.py	94.68%	5 Missing ⚠️

Files with missing lines	Coverage Δ
src/torchjd/autogram/_engine.py	`100.00% <100.00%> (ø)`
src/torchjd/autogram/diagonal_sparse_tensor.py	`94.68% <94.68%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…ctions (mean and sum).

…hould be implemented differently)

…used over the constructor of `DST`

PierreQuinton · 2025-10-23T11:05:36Z

I think square is failing because it is implemented as t * t and we didn't implement mul yet.

…ake `data` and `v_to_p` public.

…e, and make `data` and `v_to_p` public." This reverts commit 85c8e41.

src/torchjd/sparse/_diagonal_sparse_tensor.py

…k-diagonal-tensor

…ungrouped_dims, remove print

…irtual dimension that uses a physical dimension multiple times.

* The result is the same as before * Before that we only iterated on the pdims used by each virtual dim, and summed them if a pdim was present multiple times. * Now the new stride is already the sum of the old strides when a pdim is present multiple times in a vdim. We iterate over all dimensions, because for dimensions not present in the vdim, the stride is simply 0. * There's probably a more efficient implementation

* Now that we iterate over all_pdims instead of the pdims of the current virtual dimension, the result of torch.stack([p_indices_grid[d] for d in all_pdims], dim=-1) is always the same, and is simply equal to torch.stack(p_indices_grid, dim=-1). So we directly stack the p_indices_grid when creating it, and use the already stacked p_indices_grid in the for-loop.

* In the long term I'll try to rely mostly or even only on them instead of v_to_ps, so it makes sense to pre-compute them.

… make it directly as a tuple

* This makes lines shorter

…cannot have strides on cuda because addmm_cuda (required for tensordot) does not support Long tensors (but the cpu version does).

…nd self.dtype

PierreQuinton · 2025-10-31T13:57:49Z

src/torchjd/sparse/_diagonal_sparse_tensor.py

+        self.physical = physical
+        self.v_to_ps = v_to_ps
+        pshape = list(self.physical.shape)
+        self.strides = tensor([strides_v2(pdims, pshape) for pdims in self.v_to_ps])


Could we make strides_v2 return a Tensor directly so that if we change from v_to_ps to pure stride description, we can then already assume it is an array (we could have a function to build a stride from a v_to_ps and physical.shape)

PierreQuinton · 2025-10-31T13:58:42Z

src/torchjd/sparse/_diagonal_sparse_tensor.py

+        # addmm_cuda not implemented for Long tensors => gotta have these tensors on cpu
+        v_indices_grid = tensordot(self.strides, p_indices_grid, dims=1)
+        res = zeros(self.shape, device=self.device, dtype=self.dtype)
+        res[tuple(v_indices_grid)] = self.physical


I would be really surprised if the cast to tuple was necessary (but did not check)

Suggested change

res[tuple(v_indices_grid)] = self.physical

res[v_indices_grid] = self.physical

Add DiagonalSparseTensor with the default fallback to dense mechanism.

32ef75e

PierreQuinton requested a review from ValerianRey October 20, 2025 11:00

PierreQuinton added feat New feature or request package: autogram labels Oct 20, 2025

PierreQuinton and others added 9 commits October 20, 2025 19:19

Ignore mypy

5c76d69

Remove useless comment.

34f4dce

Change repr

a0b7ffc

revert removing __init__

f476b29

Give implementation for pointwise

447d714

Add decorator to handle other functions. Add two examples of such fun…

85556a8

…ctions (mean and sum).

Improve naming.

c5f868c

Merge branch 'main' into block-diagonal-tensor

7f5c097

improve

efa8019

ValerianRey changed the base branch from main to dev-new-engine October 22, 2025 18:19

ValerianRey force-pushed the block-diagonal-tensor branch from 87b4da0 to efa8019 Compare October 23, 2025 01:30

ValerianRey mentioned this pull request Oct 23, 2025

feat(autogram): Remove batched optimizations #470

Merged

PierreQuinton added 4 commits October 23, 2025 11:38

Remove inplace functions from the list of pointwise functions (they s…

e36b3c5

…hould be implemented differently)

Fix to_dense

018a994

Verify input.

e91323c

Make a builder for DSPs and move checks in it. This should always be …

f728153

…used over the constructor of `DST`

PierreQuinton and others added 8 commits October 23, 2025 16:09

Implement pointwise and inplace pointwise in _HANDLED_FUNCTIONS.

b2b5d7a

Move Pointwise functions definitions.

2b80788

Clean filed of DST, remove virtual shape, it is just the shape, and m…

85c8e41

…ake `data` and `v_to_p` public.

Merge branch 'dev-new-engine' into block-diagonal-tensor

123d2ce

Use DST for initial jac_output

7c0bc45

Add test for to_dense and inplace_pointwise

8509a33

Revert "Clean filed of DST, remove virtual shape, it is just the shap…

0eed3a3

…e, and make `data` and `v_to_p` public." This reverts commit 85c8e41.

Make contiguous_data and v_to_p public.

55a7cbc

Add new_implementation idea in einsum.

c6e3fd9

ValerianRey mentioned this pull request Oct 29, 2025

feat(autogram): Use new engine #476

Open

4 tasks

ValerianRey added package: sparse and removed package: autogram labels Oct 30, 2025

Merge branch 'dev-new-engine' into block-diagonal-tensor

cecd69b

ValerianRey reviewed Oct 30, 2025

View reviewed changes

src/torchjd/sparse/_diagonal_sparse_tensor.py Outdated Show resolved Hide resolved

ValerianRey and others added 22 commits October 30, 2025 08:04

Always use randn_ in test

d2e53a3

Fix order of sorting in merg_strides

16e7c1c

Merge remote-tracking branch 'origin/block-diagonal-tensor' into bloc…

2c23dbe

…k-diagonal-tensor

Add strides_v2

16e6165

Add more failing parametrizations to test_get_groupings and test_fix_…

59cf10b

…ungrouped_dims, remove print

Add test_concatenate

e0dc1a7

Add strides_to_pdims

3dffd1e

Add (passing) test_to_dense2 to test to_dense when the tensor has a v…

48387ab

…irtual dimension that uses a physical dimension multiple times.

Remove unused variable dims in the loop of to_dense

7044e37

Pre-compute strides

384d550

* In the long term I'll try to rely mostly or even only on them instead of v_to_ps, so it makes sense to pre-compute them.

Replace for-loop with for comprehension to create v_indices_grid, and…

18044dd

… make it directly as a tuple

Remove for-loop in computation of v_indices_grid in to_dense

b0a0e7a

Add internal strides to debug_info

456adf1

Simplify creation of strides in to_dense

dc37d29

Remove torch. prefix when possible

a9e298a

* This makes lines shorter

Create strides tensor in constructor

ac2a2c5

Remove comments about the device of the indices tensors. We actually …

cfba7e0

…cannot have strides on cuda because addmm_cuda (required for tensordot) does not support Long tensors (but the cpu version does).

Replace self.physical.device and self.physical.dtype by self.device a…

2c94488

…nd self.dtype

Move unwrap_to_dense out of DiagonalSparseTensor

9c1ad5b

Extract print_fallback

7252757

PierreQuinton commented Oct 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(autogram): Add `DiagonalSparseTensor`. #466

feat(autogram): Add `DiagonalSparseTensor`. #466

PierreQuinton commented Oct 20, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 20, 2025 •

edited

Loading

Uh oh!

PierreQuinton commented Oct 23, 2025

Uh oh!

Uh oh!

PierreQuinton Oct 31, 2025

Uh oh!

PierreQuinton Oct 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	res[tuple(v_indices_grid)] = self.physical
	res[v_indices_grid] = self.physical

feat(autogram): Add DiagonalSparseTensor. #466

Are you sure you want to change the base?

feat(autogram): Add DiagonalSparseTensor. #466

Conversation

PierreQuinton commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

PierreQuinton commented Oct 23, 2025

Uh oh!

Uh oh!

PierreQuinton Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

PierreQuinton Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(autogram): Add `DiagonalSparseTensor`. #466

feat(autogram): Add `DiagonalSparseTensor`. #466

PierreQuinton commented Oct 20, 2025 •

edited

Loading

codecov bot commented Oct 20, 2025 •

edited

Loading

PierreQuinton Oct 31, 2025 •

edited

Loading