You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: tutorials/parquet-catalog-demos/euclid-hats-parquet.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,15 +21,15 @@ This notebook introduces the [Euclid Q1](https://irsa.ipac.caltech.edu/data/Eucl
21
21
22
22
By the end of this tutorial, you will:
23
23
24
-
- Query the dataset to find and create figures for galaxies, QSOs, and stars with quality fluxes, redshifts, and morphology.
24
+
- Query the dataset for galaxies, QSOs, and stars with quality fluxes, redshifts, and morphology.
25
25
- Understand the format and schema of this dataset.
26
26
- Learn how to work with this HATS Parquet product using the PyArrow Python library.
27
27
28
28
+++
29
29
30
30
## 1. Introduction
31
31
32
-
The Collection includes a HATS Catalog (main data product), Margin Cache (10 arcsec), and Index Table (object_id).
32
+
The Collection includes a HATS Catalog (main data product), Margin Cache (10 arcsec), and Index Table (object_id) (described in [HATS partitioning and HATS Collections](https://irsa.ipac.caltech.edu/docs/parquet_catalogs/#hats)).
33
33
The Catalog includes the twelve Euclid Q1 tables listed below, joined on the column 'object_id' into a single Parquet dataset with 1,594 columns.
34
34
There are 29,953,430 rows, one per Euclid MER Object, and the total data volume is 400 GB.
35
35
The data includes several different redshift measurements, several flux measurements for each Euclid band, and flux measurements for bands from several ground-based observatories -- in addition to morphological and other measurements.
It was generated by Phosphoros, a fully Bayesian template-fitting code.
282
282
It was computed for all MER objects, but the input models assumed galaxy.
283
283
The model grid was built spanning the parameters: redshift (z in [0, 6]), galaxy SED, intrinsic reddening curve, intrinsic attenuation.
284
-
Phosphoros should be better for cosmology than ML algorithms (which are more typical) due to the scarcity of spectroscopic "truth" training data above z ~ 1.
284
+
Phosphoros should be better for cosmology than ML algorithms (which are more typical) due to the scarcity of spectroscopic "truth" training data above z ~ 1 (see Tucci).
285
285
286
286
"physparam_phz_pp_median_redshift" is the median of the photometric redshift PDF that was produced for galaxy-classed objects by the physical-properties branch of the PHZ pipeline (i.e., non-cosmology).
0 commit comments