Use cumsum from flox #10987

Illviljan · 2025-12-06T13:44:27Z

Closes cumsum drops index coordinates #6528
Tests added
User visible changes (including notable bug fixes) are documented in whats-new.rst

The non-flox version reduces chunksizes significantly:

x = xr.DataArray([1, 1, 1, 1, 1], name="x").chunk()
grp_idx = xr.DataArray([-1, 0, 0, -1, 1])
with xr.set_options(use_flox=False):
    print(x.groupby(grp_idx).cumsum())
<xarray.DataArray 'x' (dim_0: 5)> Size: 40B
dask.array<getitem, shape=(5,), dtype=int64, chunksize=(2,), chunktype=numpy.ndarray>
Dimensions without coordinates: dim_0

With flox the chunksize is retained:

x = xr.DataArray([1, 1, 1, 1, 1], name="x").chunk()
grp_idx = xr.DataArray([-1, 0, 0, -1, 1])
with xr.set_options(use_flox=True):
    print(x.groupby(grp_idx).cumsum())
<xarray.DataArray 'x' (dim_0: 5)> Size: 40B
dask.array<_finalize_scan, shape=(5,), dtype=int64, chunksize=(5,), chunktype=numpy.ndarray>
Dimensions without coordinates: dim_0

Other changes:

Changes DataArray.cumsum/Dataset.cumsum/DataTree.cumsum/DataArray.groupby.cumsum/Dataset.groupby.cumsum etc.
Coordinates are now retained

Notes
groupby_scan was added in: https://github.com/xarray-contrib/flox/releases/tag/v0.9.9
cumsum was added in: https://github.com/xarray-contrib/flox/releases/tag/v0.10.5

for more information, see https://pre-commit.ci

…o cumsum_flox

for more information, see https://pre-commit.ci

…o cumsum_flox

for more information, see https://pre-commit.ci

xarray/core/groupby.py

…o cumsum_flox

for more information, see https://pre-commit.ci

…o cumsum_flox

for more information, see https://pre-commit.ci

…o cumsum_flox

for more information, see https://pre-commit.ci

xarray/core/groupby.py

Co-authored-by: Deepak Cherian <[email protected]>

Illviljan · 2026-01-07T22:24:09Z

@dcherian, this is ready for a another review now. It was only changes in tests since the last time.

dcherian · 2026-01-08T05:18:35Z

Are you able to address the extra testing requested in #10987 (comment)?

If you're too busy, we can just merge. This is a good improvement.

for more information, see https://pre-commit.ci

This reverts commit eff5561.

for more information, see https://pre-commit.ci

Illviljan · 2026-01-10T19:15:54Z

Writing down these flox issues before I forget:

ds = xr.Dataset(
    {
        "foo": (
            ("test", "time"),
            [[7, 2, 0, 1, 2, np.nan], [1, 1, 1, 1, 1, 1], [2, 2, 2, 2, 2, 2]],
        )
    },
    coords={
        "time": [0, 1 / 6, 2 / 6, 3 / 6, 4 / 6, 5 / 6],
        "test": ["a", "b", "b"],
        "group_idx": ("time", [0, 0, 1, 1, 2, 2]),
        "group_idx2": ("time", [0, 1, 1, 1, 1, 1]),
    },
)

# group_idx along 1 dim and cumsum dim along another fails with flox:
ds.groupby("group_idx").cumsum("test") 

# cumsum along multple dims fails with flox:
ds.groupby("group_idx").cumsum(...)

for more information, see https://pre-commit.ci

…o cumsum_flox

for more information, see https://pre-commit.ci

use cumsum from flox

776bc5a

github-actions bot added the topic-groupby label Dec 6, 2025

pre-commit-ci bot and others added 13 commits December 6, 2025 13:44

[pre-commit.ci] auto fixes from pre-commit.com hooks

ae27632

for more information, see https://pre-commit.ci

Update groupby.py

a5f9326

Update groupby.py

50ccca4

[pre-commit.ci] auto fixes from pre-commit.com hooks

f55531e

for more information, see https://pre-commit.ci

Update groupby.py

06ac372

Merge branch 'cumsum_flox' of https://github.com/Illviljan/xarray int…

31244e6

…o cumsum_flox

Update groupby.py

dd47536

[pre-commit.ci] auto fixes from pre-commit.com hooks

e867f12

for more information, see https://pre-commit.ci

Update groupby.py

88e0ebc

[pre-commit.ci] auto fixes from pre-commit.com hooks

181d4a3

for more information, see https://pre-commit.ci

use apply_ufunc for dataset and dataarray handling

a82ec39

Merge branch 'cumsum_flox' of https://github.com/Illviljan/xarray int…

6c6abed

…o cumsum_flox

[pre-commit.ci] auto fixes from pre-commit.com hooks

24c3f1d

for more information, see https://pre-commit.ci

dcherian reviewed Dec 6, 2025

View reviewed changes

xarray/core/groupby.py Show resolved Hide resolved

dcherian reviewed Dec 6, 2025

View reviewed changes

xarray/core/groupby.py Show resolved Hide resolved

Illviljan and others added 11 commits December 6, 2025 16:21

Update groupby.py

d8d0eaa

Merge branch 'cumsum_flox' of https://github.com/Illviljan/xarray int…

55ff46a

…o cumsum_flox

[pre-commit.ci] auto fixes from pre-commit.com hooks

33d1360

for more information, see https://pre-commit.ci

sync protocols with each other

c97ae98

Merge branch 'cumsum_flox' of https://github.com/Illviljan/xarray int…

06b52ae

…o cumsum_flox

typing

84f9b44

[pre-commit.ci] auto fixes from pre-commit.com hooks

2978877

for more information, see https://pre-commit.ci

add dataset and version requirement

0a9adee

Merge branch 'cumsum_flox' of https://github.com/Illviljan/xarray int…

ae9a3d8

…o cumsum_flox

[pre-commit.ci] auto fixes from pre-commit.com hooks

c056d1f

for more information, see https://pre-commit.ci

Update _aggregations.py

d4873b9

dcherian reviewed Dec 6, 2025

View reviewed changes

xarray/core/groupby.py Outdated Show resolved Hide resolved

Update xarray/core/groupby.py

21cbde2

Co-authored-by: Deepak Cherian <[email protected]>

Update whats-new.rst

b986776

Illviljan added 2 commits January 10, 2026 11:40

Add ASV test

1019c2d

Motivate use Dataset.func further

08a4d88

github-actions bot added the topic-performance label Jan 10, 2026

Illviljan added the run-benchmark Run the ASV benchmark workflow label Jan 10, 2026

Illviljan and others added 15 commits January 10, 2026 14:37

test refactor

7764a81

[pre-commit.ci] auto fixes from pre-commit.com hooks

eff5561

for more information, see https://pre-commit.ci

Revert "[pre-commit.ci] auto fixes from pre-commit.com hooks"

d73e97e

This reverts commit eff5561.

Update test_groupby.py

8324ea1

Update test_groupby.py

a93ce0e

Update test_groupby.py

093e47c

Update test_groupby.py

c98f99d

Update test_groupby.py

55dfdef

nd-array tests and different dims

3cb0981

Update test_groupby.py

0ee5db4

Update test_groupby.py

d3d3ed8

[pre-commit.ci] auto fixes from pre-commit.com hooks

d5e554e

for more information, see https://pre-commit.ci

Update test_groupby.py

0eb869a

Update test_groupby.py

e62db4c

Update test_groupby.py

43d53f6

Illviljan and others added 7 commits January 10, 2026 20:41

more tests

cf3a2d9

Update test_groupby.py

784d7ca

[pre-commit.ci] auto fixes from pre-commit.com hooks

6f8e5f1

for more information, see https://pre-commit.ci

Update test_groupby.py

d108969

Merge branch 'cumsum_flox' of https://github.com/Illviljan/xarray int…

5e6e0eb

…o cumsum_flox

[pre-commit.ci] auto fixes from pre-commit.com hooks

4001a39

for more information, see https://pre-commit.ci

Update test_groupby.py

ae74a27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use cumsum from flox #10987

Use cumsum from flox #10987

Illviljan commented Dec 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Illviljan commented Jan 7, 2026

Uh oh!

dcherian commented Jan 8, 2026

Uh oh!

Illviljan commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Use cumsum from flox #10987

Are you sure you want to change the base?

Use cumsum from flox #10987

Conversation

Illviljan commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Illviljan commented Jan 7, 2026

Uh oh!

dcherian commented Jan 8, 2026

Uh oh!

Illviljan commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Illviljan commented Dec 6, 2025 •

edited

Loading