Add minimal examples to API documentation by nevencaplar · Pull Request #1243 · astronomy-commons/lsdb

nevencaplar · 2026-02-02T22:36:09Z

Closes #1240.

dougbrn

Nice, I have one larger structural comment, which is that these examples will try to be run by doctest I believe, so anything that produces output will need output alongside the input. That may even result in failed tests, though we wouldn't know for sure until the CI decides to actually do something (not sure what's going on there).

It's also good for these to have output for usefulness, check out map_rows for an example of that: https://docs.lsdb.io/en/latest/reference/api/lsdb.catalog.Catalog.map_rows.html

Generally, running these code blocks on command line will get you a copy-pastable output to put in the docstring.

I'm not sure what the best practice would be for the examples that generate plots...

src/lsdb/catalog/catalog.py

github-actions · 2026-02-03T00:43:18Z

Before [`e1de1de`]	After [`a9f12d4`]	Ratio	Benchmark (Parameter)
6.89±0.03s	6.95±0.05s	1.01	benchmarks.time_create_large_catalog
1.04±0.01s	1.05±0.02s	1.01	benchmarks.time_create_midsize_catalog
27.7±0.2ms	27.7±0.4ms	1	benchmarks.time_box_filter_on_partition
19.5±0s	19.5±0.02s	1	benchmarks.time_save_big_catalog
8.19±0.02s	8.07±0.01s	0.99	benchmarks.time_lazy_crossmatch_many_columns_all_suffixes
8.19±0.09s	8.03±0.04s	0.98	benchmarks.time_lazy_crossmatch_many_columns_overlapping_suffixes
3.78±0.03s	3.70±0.02s	0.98	benchmarks.time_open_many_columns_all
370±10ms	362±3ms	0.98	benchmarks.time_open_many_columns_default
165±4ms	161±1ms	0.98	benchmarks.time_open_many_columns_list
105±2ms	100±2ms	0.96	benchmarks.time_kdtree_crossmatch

Click here to view all benchmarks.

nevencaplar · 2026-02-03T01:08:38Z

I did not add output for plotting function or for write_catalog (for write catalog I even have dummy output PATH in the example, i.e., the code cant work automatically the way it is written). Not sure what to do there?

I am also a bit worried about formatting. Is that a problem? My examples look like

                            ra        dec    id
   _healpix_29                                   
  118362963675428450  52.696686  39.675892  8154

while your example in map_rows looks like(difference is _healpix_29 is is in the same row as other columns, index is in front):

      _healpix_29  plus_one  minus_one
 0  1372475556631677955      21.1       20.9

dougbrn · 2026-02-03T19:52:16Z

@nevencaplar the formatting does matter for the CI, you can lean on the actual failed test results on this PR to see what it expects:

Differences (unified diff with -expected +actual):
    @@ -1,3 +1,3 @@
    -                        ra        dec    id
    +                           ra        dec    id
     _healpix_29                                   
     118362963675428450  52.696686  39.675892  8154

In cases where the code is actually non-operable, at least for CI, I think it's okay to let CI skip it, by adding # doctest: +SKIP as a comment to the output producing line(s)

dougbrn · 2026-02-03T19:54:39Z

Additionally, with LSDB's current setup pre-commit and doctest disagree on formatting, you will probably want to add # doctest: +NORMALIZE_WHITESPACE to any lines executing code that produce a dataframe to make both of them happy. See an example in from_astropy: https://github.com/astronomy-commons/lsdb/blob/main/src/lsdb/loaders/dataframe/from_astropy.py#L94

codecov · 2026-02-03T23:41:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 96.66%. Comparing base (fbb3fea) to head (acb6232).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1243   +/-   ##
=======================================
  Coverage   96.66%   96.66%           
=======================================
  Files          46       46           
  Lines        2877     2877           
=======================================
  Hits         2781     2781           
  Misses         96       96

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

nevencaplar · 2026-02-04T00:05:30Z

I believe that all tests are passing, through combination of SKIP , NORMALIZE_WHITESPACE, and reduction of width of output pandas dataframes.

dougbrn

Looks great, thank you!

nevencaplar added 12 commits February 2, 2026 11:49

Modify random sample to cover large part of the sky

db3e664

Add API example for from_dataframe

d690d59

Add clarification about highest order being main limiting factor

15c6fc9

Add query example

2cf510a

Add example for map_partitions

9d9beb7

Add example for cone_search

c1159c1

Add join example

60d41d5

Add xmatch example

5686171

Add write_catalog example

c094ff2

Add get_partition example

76a5dd2

Add examples for plotting

b46f7d9

Remove from_dataframe in tutorials directory.

993e738

nevencaplar requested a review from dougbrn February 2, 2026 22:46

dougbrn reviewed Feb 2, 2026

View reviewed changes

src/lsdb/catalog/catalog.py Outdated Show resolved Hide resolved

Modify compute.head to .head

cfd37b5

nevencaplar added 7 commits February 2, 2026 16:44

Add output for map_partitions example

ae5d881

Add output for query example

88edae6

Add output for from_dataframe example

0398d18

Add output for coneSearch example

328ffd6

Add output for join example

6d0b3c2

Add output for crossmatch example

fa3f433

Add output for get_partition example

63543d8

nevencaplar requested a review from dougbrn February 3, 2026 01:08

nevencaplar added 3 commits February 3, 2026 13:51

Remove whitespace behind _healpix_29

e03450c

Remove trailing whitespace in from_dataframe example

6d515c1

Fix length problem in xmatch example

e705ff8

nevencaplar added 3 commits February 3, 2026 15:04

Add doctest exceptions

c3195b7

Skip test of output for plotting

cc4e6ba

Reduce number of columns in wide output dfs

9e57c7b

Avoid long lines in modified examples

acb6232

dougbrn approved these changes Feb 4, 2026

View reviewed changes

nevencaplar merged commit f9af042 into main Feb 4, 2026
12 checks passed

nevencaplar deleted the lsdb_1240 branch February 4, 2026 00:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add minimal examples to API documentation#1243

Add minimal examples to API documentation#1243
nevencaplar merged 27 commits intomainfrom
lsdb_1240

nevencaplar commented Feb 2, 2026

Uh oh!

dougbrn left a comment

Uh oh!

Uh oh!

github-actions bot commented Feb 3, 2026 •

edited

Loading

Uh oh!

nevencaplar commented Feb 3, 2026 •

edited

Loading

Uh oh!

dougbrn commented Feb 3, 2026

Uh oh!

dougbrn commented Feb 3, 2026

Uh oh!

codecov bot commented Feb 3, 2026 •

edited

Loading

Uh oh!

nevencaplar commented Feb 4, 2026

Uh oh!

dougbrn left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nevencaplar commented Feb 2, 2026

Uh oh!

dougbrn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nevencaplar commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dougbrn commented Feb 3, 2026

Uh oh!

dougbrn commented Feb 3, 2026

Uh oh!

codecov bot commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

nevencaplar commented Feb 4, 2026

Uh oh!

dougbrn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Feb 3, 2026 •

edited

Loading

nevencaplar commented Feb 3, 2026 •

edited

Loading

codecov bot commented Feb 3, 2026 •

edited

Loading