Skip to content

Commit f1e90ef

Browse files
authored
Update 2024-12-03-mmu.md
1 parent c9113cb commit f1e90ef

File tree

1 file changed

+212
-29
lines changed

1 file changed

+212
-29
lines changed

_posts/2024-12-03-mmu.md

Lines changed: 212 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ Astronomy has always been a data-rich science — but in recent years, the sheer
1919
That’s why we’re excited to have partnered with the **Multimodal Universe** collaboration to introduce a new, large-scale curated collection of standardized data designed to accelerate ML research in astronomy. If you’ve ever dreamed of having a unified resource that seamlessly ties together images, spectra, time-series, and more, from multiple surveys, we built the Multimodal Universe with you in mind.
2020

2121
<p align="center">
22-
<img src="/images/blog/mmu_dset_examples.png" alt="Examples of data in the MMU dataset" width="95%" style="mix-blend-mode: darken;">
22+
<img src="/images/blog/mmu_img.png" alt="Overview of the cross-matching scheme" width="95%" style="mix-blend-mode: darken;">
2323
</p>
2424

2525

@@ -29,36 +29,218 @@ That’s why we’re excited to have partnered with the **Multimodal Universe**
2929

3030
#### What’s in the Multimodal Universe?
3131

32+
<p align="center">
33+
<img src="/images/blog/mmu_dset_examples.png" alt="Examples of data in the MMU dataset" width="95%" style="mix-blend-mode: darken;">
34+
</p>
35+
3236
We’ve combined publicly available data from **major astronomical surveys** into one consistently cross-matched framework, summarized in the table below. Images, spectra, hyperspectral data cubes, time-series data… they’re all in here! Each dataset has been carefully pre-processed, documented, and aligned to play nicely with one another right out of the box.
3337

34-
| Modality | Source Survey | N_c | Shape | Number of samples | Main science |
35-
|---------------------|-------------------------|-------|------------|-------------------|----------------------|
36-
| Images | Legacy Surveys DR10 | 4 | 160x160 | 124M | Galaxies |
37-
| Images | Legacy Surveys North | 3 | 152x152 | 15M | Galaxies |
38-
| Images | HSC | 5 | 160x160 | 477K | Galaxies |
39-
| Images | BTS | 3 | 63x63 | 400K | Supernovae |
40-
| Images | JWST | 6-7 | 96x96 | 300K | Galaxies |
41-
| Spectra | Gaia BP/RP | - | 110 | 220M | Stars |
42-
| Spectra | SDSS-II | - | Variable | 4M | Galaxies, Stars |
43-
| Spectra | DESI | - | 7081 | 1M | Galaxies |
44-
| Spectra | APOGEE SDSS-III | - | 7514 | 716k | Stars |
45-
| Spectra | GALAH | - | Variable | 325k | Stars |
46-
| Spectra | Chandra | - | Variable | 129K | Galaxies, Stars |
47-
| Spectra | VIPERS | - | 557 | 91K | Galaxies |
48-
| Hyperspectral Image | MaNGA SDSS-IV | 4563 | 96x96 | 12k | Galaxies |
49-
| Time Series | PLAsTiCC | 6 | Variable | 3.5M | Time-varying objects |
50-
| Time Series | TESS | 1 | Variable | 1M | Exoplanets, Stars |
51-
| Time Series | CfA Sample | 5-11 | Variable | 1K | Supernovae |
52-
| Time Series | YSE | 6 | Variable | 2K | Supernovae |
53-
| Time Series | PS1 SNe Ia | 4 | Variable | 369 | Supernovae |
54-
| Time Series | DES Y3 SNe Ia | 4 | Variable | 248 | Supernovae |
55-
| Time Series | SNLS | 4 | Variable | 239 | Supernovae |
56-
| Time Series | Foundation | 4 | Variable | 180 | Supernovae |
57-
| Time Series | CSP SNe Ia | 9 | Variable | 134 | Supernovae |
58-
| Time Series | Swift SNe Ia | 6 | Variable | 117 | Supernovae |
59-
| Tabular | Gaia | - | - | 220M | Stars |
60-
| Tabular | PROVABGS | - | - | 221K | Galaxy |
61-
| Tabular | Galaxy10 DECaLS | - | - | 15K | Galaxy |
38+
<table>
39+
<thead>
40+
<tr>
41+
<th>Modality</th>
42+
<th>Source Survey</th>
43+
<th>N<sub>c</sub></th>
44+
<th>Shape</th>
45+
<th>Number of samples</th>
46+
<th>Main science</th>
47+
</tr>
48+
</thead>
49+
<tbody>
50+
<!-- Images -->
51+
<tr>
52+
<td rowspan="5">Images</td>
53+
<td>Legacy Surveys DR10</td>
54+
<td>4</td>
55+
<td>160×160</td>
56+
<td>124M</td>
57+
<td>Galaxies</td>
58+
</tr>
59+
<tr>
60+
<td>Legacy Surveys North</td>
61+
<td>3</td>
62+
<td>152×152</td>
63+
<td>15M</td>
64+
<td>Galaxies</td>
65+
</tr>
66+
<tr>
67+
<td>HSC</td>
68+
<td>5</td>
69+
<td>160×160</td>
70+
<td>477K</td>
71+
<td>Galaxies</td>
72+
</tr>
73+
<tr>
74+
<td>BTS</td>
75+
<td>3</td>
76+
<td>63×63</td>
77+
<td>400K</td>
78+
<td>Supernovae</td>
79+
</tr>
80+
<tr>
81+
<td>JWST</td>
82+
<td>6-7</td>
83+
<td>96×96</td>
84+
<td>300K</td>
85+
<td>Galaxies</td>
86+
</tr>
87+
<!-- Spectra -->
88+
<tr>
89+
<td rowspan="7">Spectra</td>
90+
<td>Gaia BP/RP</td>
91+
<td>-</td>
92+
<td>110 [1]</td>
93+
<td>220M</td>
94+
<td>Stars</td>
95+
</tr>
96+
<tr>
97+
<td>SDSS-II</td>
98+
<td>-</td>
99+
<td>Variable</td>
100+
<td>4M</td>
101+
<td>Galaxies, Stars</td>
102+
</tr>
103+
<tr>
104+
<td>DESI</td>
105+
<td>-</td>
106+
<td>7081</td>
107+
<td>1M</td>
108+
<td>Galaxies</td>
109+
</tr>
110+
<tr>
111+
<td>APOGEE SDSS-III</td>
112+
<td>-</td>
113+
<td>7514</td>
114+
<td>716k</td>
115+
<td>Stars</td>
116+
</tr>
117+
<tr>
118+
<td>GALAH</td>
119+
<td>-</td>
120+
<td>Variable</td>
121+
<td>325k</td>
122+
<td>Stars</td>
123+
</tr>
124+
<tr>
125+
<td>Chandra</td>
126+
<td>-</td>
127+
<td>Variable</td>
128+
<td>129K</td>
129+
<td>Galaxies, Stars</td>
130+
</tr>
131+
<tr>
132+
<td>VIPERS</td>
133+
<td>-</td>
134+
<td>557</td>
135+
<td>91K</td>
136+
<td>Galaxies</td>
137+
</tr>
138+
<!-- Hyperspectral Image -->
139+
<tr>
140+
<td>Hyperspectral Image</td>
141+
<td>MaNGA SDSS-IV</td>
142+
<td>4563</td>
143+
<td>96×96</td>
144+
<td>12k</td>
145+
<td>Galaxies</td>
146+
</tr>
147+
<!-- Time Series -->
148+
<tr>
149+
<td rowspan="10">Time Series</td>
150+
<td>PLAsTiCC [2]</td>
151+
<td>6</td>
152+
<td>Variable</td>
153+
<td>3.5M</td>
154+
<td>Time-varying objects</td>
155+
</tr>
156+
<tr>
157+
<td>TESS</td>
158+
<td>1</td>
159+
<td>Variable</td>
160+
<td>1M</td>
161+
<td>Exoplanets, Stars</td>
162+
</tr>
163+
<tr>
164+
<td>CfA Sample</td>
165+
<td>5-11</td>
166+
<td>Variable</td>
167+
<td>1K</td>
168+
<td>Supernovae</td>
169+
</tr>
170+
<tr>
171+
<td>YSE</td>
172+
<td>6</td>
173+
<td>Variable</td>
174+
<td>2K</td>
175+
<td>Supernovae</td>
176+
</tr>
177+
<tr>
178+
<td>PS1 SNe Ia</td>
179+
<td>4</td>
180+
<td>Variable</td>
181+
<td>369</td>
182+
<td>Supernovae</td>
183+
</tr>
184+
<tr>
185+
<td>DES Y3 SNe Ia</td>
186+
<td>4</td>
187+
<td>Variable</td>
188+
<td>248</td>
189+
<td>Supernovae</td>
190+
</tr>
191+
<tr>
192+
<td>SNLS</td>
193+
<td>4</td>
194+
<td>Variable</td>
195+
<td>239</td>
196+
<td>Supernovae</td>
197+
</tr>
198+
<tr>
199+
<td>Foundation</td>
200+
<td>4</td>
201+
<td>Variable</td>
202+
<td>180</td>
203+
<td>Supernovae</td>
204+
</tr>
205+
<tr>
206+
<td>CSP SNe Ia</td>
207+
<td>9</td>
208+
<td>Variable</td>
209+
<td>134</td>
210+
<td>Supernovae</td>
211+
</tr>
212+
<tr>
213+
<td>Swift SNe Ia</td>
214+
<td>6</td>
215+
<td>Variable</td>
216+
<td>117</td>
217+
<td>Supernovae</td>
218+
</tr>
219+
<!-- Tabular -->
220+
<tr>
221+
<td rowspan="3">Tabular</td>
222+
<td>Gaia</td>
223+
<td>-</td>
224+
<td>-</td>
225+
<td>220M</td>
226+
<td>Stars</td>
227+
</tr>
228+
<tr>
229+
<td>PROVABGS</td>
230+
<td>-</td>
231+
<td>-</td>
232+
<td>221K</td>
233+
<td>Galaxy</td>
234+
</tr>
235+
<tr>
236+
<td>Galaxy10 DECaLS</td>
237+
<td>-</td>
238+
<td>-</td>
239+
<td>15K</td>
240+
<td>Galaxy</td>
241+
</tr>
242+
</tbody>
243+
</table>
62244

63245
Up-to-date instructions on how to download the data, plus details about cross-matching and referencing the original sources, can be found on the [Multimodal Universe GitHub](https://github.com/MultimodalUniverse/MultimodalUniverse/).
64246

@@ -97,6 +279,7 @@ We host the Multimodal Universe dataset in full at the Flatiron Institute, with
97279
We envision this living dataset as a **central hub** for ML-driven astronomy, drastically cutting down on the data-engineering overhead that has historically slowed progress.
98280

99281
#### Getting Started
282+
Below is a brief overview on how to jumpstart your research with MMU:
100283

101284
1. **Visit the Landing Page**
102285
Head to the [Multimodal Universe GitHub](https://github.com/MultimodalUniverse/MultimodalUniverse/) for the latest version, plus scripts for data retrieval and usage.

0 commit comments

Comments
 (0)