Refactor `_resampled_scene()` and `_reduce_data()` methods of the `Scene` class #3178

pnuu · 2025-07-28T08:13:20Z

This PR refactors Scene._resampled_scene() that @gerritholl reported to be complicated in #3168 (comment) . As there was a change to _reduce_data(), I did some additional refactoring to it too.

Closes #xxxx
Tests added
Fully documented
Add your name to AUTHORS.md if not there already

codecov · 2025-07-28T08:19:38Z

Codecov Report

❌ Patch coverage is 91.07143% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 96.28%. Comparing base (5cc91a2) to head (2d8d883).
⚠️ Report is 46 commits behind head on main.

Files with missing lines	Patch %	Lines
satpy/scene.py	91.07%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3178      +/-   ##
==========================================
- Coverage   96.28%   96.28%   -0.01%     
==========================================
  Files         436      436              
  Lines       57830    57940     +110     
==========================================
+ Hits        55681    55785     +104     
- Misses       2149     2155       +6

Flag	Coverage Δ
behaviourtests	`3.78% <17.85%> (+<0.01%)`	⬆️
unittests	`96.37% <91.07%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

pnuu · 2025-07-28T08:48:24Z

I'll do some more refactoring to _reduce_data().

coveralls · 2025-07-28T08:51:44Z

Pull Request Test Coverage Report for Build 16566762203

Details

51 of 56 (91.07%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 96.381%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
satpy/scene.py	51	56	91.07%

Totals
Change from base Build 16528353281:	0.0%
Covered Lines:	56082
Relevant Lines:	58188

💛 - Coveralls

gerritholl · 2025-07-29T15:17:17Z

Thanks! I've merged this into #3168 but there is still an issue with missing test coverage.

djhoese · 2025-08-04T16:52:29Z

satpy/scene.py

                replace_anc(res, pres)

+    @classmethod
+    def _get_new_datasets_from_parent(self, new_datasets, parent_dataset):


The self should be cls since these are classmethods now, but given that they are classmethods or could be staticmethods, how about we move these to outside of the Scene? And could they (or should they) be moved to the satpy.resample subpackage?

djhoese · 2025-08-04T16:54:02Z

satpy/scene.py

        try:
-            if reduce_data:
-                key = source_area
-                try:
-                    (slice_x, slice_y), source_area = reductions[key]
-                except KeyError:
-                    if resample_kwargs.get("resampler") == "gradient_search":
-                        factor = resample_kwargs.get("shape_divisible_by", 2)
-                    else:
-                        factor = None
-                    try:
-                        slice_x, slice_y = source_area.get_area_slices(
-                            destination_area, shape_divisible_by=factor)
-                    except TypeError:
-                        slice_x, slice_y = source_area.get_area_slices(
-                            destination_area)
-                    source_area = source_area[slice_y, slice_x]
-                    reductions[key] = (slice_x, slice_y), source_area
-                dataset = self._slice_data(source_area, (slice_x, slice_y), dataset)
-            else:
-                LOG.debug("Data reduction disabled by the user")
+            slice_x, slice_y = self._get_source_dest_slices(source_area, destination_area, reductions, resample_kwargs)
+            source_area = source_area[slice_y, slice_x]
+            reductions[source_area] = (slice_x, slice_y), source_area
+            dataset = self._slice_data(source_area, (slice_x, slice_y), dataset)


I'm curious if the _get_source_dest_slices operation is the only step that raises NotImplementedError or if _slice_data also does it? If the former, then maybe _slice_data should be moved outside of the try/except. Thoughts?

djhoese · 2025-08-04T17:06:46Z

satpy/scene.py

+    @classmethod
+    def _get_new_datasets_from_parent(self, new_datasets, parent_dataset):
+        if parent_dataset is not None:
+            return new_datasets[DataID.from_dataarray(parent_dataset)]


DataID.from_dataarray returns a single DataID, right? I think I'd prefer a different name for this method. I think the purpose of this chunk of code is to say "if we've resampled the parent already, use the resampled parent", right? Or rather, if the current dataset has a parent, it should have been resampled already, so we should use the resampled version of the parent. I think?

In addition to renaming, it seems that parent_dataset is only used in the later steps to check for is None. I'm wondering if we can remove the use of parent_dataset in favor of pres and "bundle" this methods operation with the dataset_walker to be something like:

for ds_id, dataset, resampled_parent in resampled_dataset_walker(datasets, new_datasets):

Or something like that.

...and if that is done, then there might be an argument for putting _replace_anc_for_new_datasets and _update_area into the for loop generator too. This changes the purpose of the for loop to be "what datasets do we need to resample" and then the inside of the for loop logic is just "reduce data", "resample data", "store result".

I'll admit the code was ugly and the logic of new_scn._datasets and new_datasets really isn't helping that.

Sorry for all the comments and no regular review. I just keep brainstorming.

One other idea, what if only new_datasets gets modified in the for loop and then assigning to new_scn._datasets is left for a second for loop (ex. for ds_id, new_data_arr in new_datasets.items():)? Maybe that would clean up the code inside the loop. I feel like a lot of this ugliness is caused by new_scn (or rather the DatasetDict inside) copying the DataArray and/or making modifications to it.

Refactor Scene._resampled_scene()

d2dd40c

pnuu self-assigned this Jul 28, 2025

pnuu requested review from djhoese and mraspaud as code owners July 28, 2025 08:13

pnuu added component:scene cleanup Code cleanup but otherwise no change in functionality labels Jul 28, 2025

Move source area access to _reduce_data()

ccbab5e

Refactor data reduction

2d8d883

pnuu changed the title ~~Refactor Scene._resampled_scene()~~ Refactor _resampled_scene() and _reduce_data() methods of the Scene class Jul 28, 2025

gerritholl mentioned this pull request Jul 29, 2025

Track and write measurement time for area #3168

Open

7 tasks

djhoese reviewed Aug 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor `_resampled_scene()` and `_reduce_data()` methods of the `Scene` class #3178

Refactor `_resampled_scene()` and `_reduce_data()` methods of the `Scene` class #3178

Uh oh!

pnuu commented Jul 28, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jul 28, 2025 •

edited

Loading

Uh oh!

pnuu commented Jul 28, 2025

Uh oh!

coveralls commented Jul 28, 2025 •

edited

Loading

Uh oh!

gerritholl commented Jul 29, 2025

Uh oh!

djhoese Aug 4, 2025

Uh oh!

djhoese Aug 4, 2025

Uh oh!

djhoese Aug 4, 2025

Uh oh!

djhoese Aug 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Refactor _resampled_scene() and _reduce_data() methods of the Scene class #3178

Are you sure you want to change the base?

Refactor _resampled_scene() and _reduce_data() methods of the Scene class #3178

Uh oh!

Conversation

pnuu commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

pnuu commented Jul 28, 2025

Uh oh!

coveralls commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 16566762203

Details

💛 - Coveralls

Uh oh!

gerritholl commented Jul 29, 2025

Uh oh!

djhoese Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

djhoese Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

djhoese Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

djhoese Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Refactor `_resampled_scene()` and `_reduce_data()` methods of the `Scene` class #3178

Refactor `_resampled_scene()` and `_reduce_data()` methods of the `Scene` class #3178

pnuu commented Jul 28, 2025 •

edited

Loading

codecov bot commented Jul 28, 2025 •

edited

Loading

coveralls commented Jul 28, 2025 •

edited

Loading