You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: tutorials/parquet-catalog-demos/irsa-hats-with-lsdb.md
+8-42Lines changed: 8 additions & 42 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ kernelspec:
11
11
name: python3
12
12
---
13
13
14
-
# Access IRSA HATS collections using lsdb
14
+
# Access HATS Collections Using LSDB: Euclid Q1 and ZTF DR23
15
15
16
16
+++
17
17
@@ -246,13 +246,9 @@ These can be useful for our **column filters** — the columns we want to SELECT
246
246
We can also filter the schema DataFrame by name, unit, type, etc., to identify columns most relevant for our **row filters** — WHERE rows satisfy conditions on column values for our query.
247
247
For example, let's explore the columns that are part of the PHZ (photometric redshift) catalog to identify photometric redshifts and source types:
248
248
249
-
```{code-cell} ipython3
250
-
euclid_schema_df[euclid_schema_df["name"].str.startswith("phz_")] # phz_ prefix is for PHZ catalog columns in this merged catalog
251
-
```
252
-
253
249
```{code-cell} ipython3
254
250
euclid_schema_df[
255
-
euclid_schema_df["name"].str.startswith("phz_")
251
+
euclid_schema_df["name"].str.startswith("phz_") # phz_ prefix is for PHZ catalog columns in this merged catalog
256
252
& euclid_schema_df["type"].str.contains("int") # to see flag type columns
ztf_schema_df[ztf_schema_df["unit"].str.contains("mag")] # to identify magnitude quantities
327
-
```
328
-
329
-
You can explore the schema further to identify other columns of interest.
321
+
You can filter the schema further by units, type, etc. to identify other columns of interest.
330
322
It's also useful to go through the [ZTF DR23 release notes](https://irsa.ipac.caltech.edu/data/ZTF/docs/releases/ztf_release_notes_latest) and [explanatory supplement](https://irsa.ipac.caltech.edu/data/ZTF/docs/ztf_explanatory_supplement.pdf) at IRSA for more details on column selections and caveats.
331
323
332
324
For this tutorial, the following columns are most relevant to us:
333
325
334
326
```{code-cell} ipython3
335
327
ztf_columns = ztf_schema_df["name"].tolist()[:6]
336
-
ztf_columns
337
-
```
338
-
339
-
```{code-cell} ipython3
340
328
ztf_columns.extend([
341
329
'fid', 'filtercode',
342
330
'ngoodobsrel',
@@ -436,7 +424,9 @@ with Client(n_workers=get_nworkers(euclid_x_ztf),
436
424
euclid_x_ztf_df
437
425
```
438
426
439
-
[Optional] Let's purify the crossmatched catalog by analyzing the distance between matched sources and removing the matches that don't meet a quality cut on percentile.
427
+
### 5.3 [Optional] Filter the crossmatched catalog
428
+
429
+
Let's purify the crossmatched catalog by analyzing the distance between matched sources and removing the matches that don't meet a quality cut on percentile.
440
430
We also keep the matches that are outside this cutoff but are still within the same 19th order HEALPix tile.
441
431
442
432
```{code-cell} ipython3
@@ -467,7 +457,7 @@ Going forward, we will use this purified crossmatched catalog `euclid_x_ztf_filt
467
457
468
458
+++
469
459
470
-
### 5.3 Identify objects of interest from the crossmatch
460
+
### 5.4 Identify objects of interest from the crossmatch
0 commit comments