You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: index.qmd
+26-5Lines changed: 26 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -3,14 +3,35 @@ title: "NSIDC Data Cookbook"
3
3
author: NSIDC
4
4
---
5
5
6
-
## Welcome
7
-
8
-
Welcome to the NSIDC Data Cookbook!
9
-
10
6
<!-- prettier-ignore -->
11
7
> [!WARNING]
12
8
> This cookbook is under active development. Major changes to the structure of
13
9
> the book and its content are expected. We are striving to develop content
14
10
> that is well-tested and peer-reviewed, but nothing contained here should be
15
11
> expected to work correctly (or at all!) in this early phase. Many sections contain only an outline of the content.
16
-
> These sections will have content added at the project develops.
12
+
> These sections will have content added as the project develops.
13
+
14
+
## Welcome to the NSIDC Data Cookbook!
15
+
16
+
This Cookbook for National Snow and Ice Data Center Distributed Active Archive Center ([NSIDC DAAC](https://nsidc.org/data/data-programs/nsidc-daac)) Data is more Julia Child’s “Mastering the Art of French Cooking” than just a collection of recipes (or data tutorials). The aim is to not only provide easy to follow recipes for working with data but also to provide an understanding of the data managed by NSIDC and the tools available to work with the data. It is not an in-depth guide to the many datasets housed by NSIDC but instead is an introduction to the common types of data, file formats and data structures. The hope is that the cookbook will provide a guide and foundataion to help you *master the art of working with cryospheric data*.
17
+
18
+
## How to use this cookbook
19
+
20
+
If you are new to NASA or NSIDC data, begin with the **Getting Started** section. Beyond that, the cookbook is divided into four major sections: **NSIDC Data**, **Working with Data**, **How-To Guides**, and **Tutorials / Workshops**. We also include some **Appendices** that include a best practices guide for Python scripting, some background details on computing, and how to contribute to the cookbook (if you are so inclined!).
21
+
22
+
The **NSIDC Data** section is an introduction to the types of data managed by NSIDC, the file formats used to store that data, coordinate reference systems and grids comon to NSIDC data, and a guide to NASA terminology for Earth science data. There is also an introduction to some “quirky” datasets that do not fit more common simple data structures or are just a little more difficult to work with.
23
+
24
+
The **Working with Data** section is a guide to the tools and applications for accomplishing common steps in scientific workflows: finding and accessing data, wrangling data, and visualizing data. Wrangling data covers subsetting, reformatting, reprojecting and resampling data in preparation for analysis.
25
+
26
+
The **How-To Guides** section is a list of recipes to acheive common tasks. It is intended a list of solutions to common scientific programming tasks.
27
+
28
+
Lastly, the **Tutorials / Workshops** section serves to highlight content that was developed for past events or specific use cases.
29
+
30
+
## About the NSIDC DAAC
31
+
32
+
NSIDC has managed the NASA National Snow and Ice Data Center Distributed Active Archive Center (NSIDC DAAC) since 1993, archiving and distributing cryospheric and related geophysical data from NASA Earth-observing satellite missions, airborne campaigns and field observations. The NSIDC DAAC provides hundreds of free and open-access NASA Earth science data sets, detailed data documentation, data tools, resources and tutorials, as well as robust data user support services. These data can be used to study topics relating to snow cover, sea ice, ice sheets, ice shelves, glaciers, frozen ground, soil moisture, climate interactions, and more!
A [NASA Earthdata Login](https://urs.earthdata.nasa.gov/) is required for all data access methods, including download and direct cloud access.
8
+
9
+
#### **NASA Earthdata Cloud**
10
+
11
+
The NASA Earthdata Cloud is NASA’s cloud-based archive of Earth observation data. It is hosted by Amazon Web Services (AWS) in region us-west-2. Downloading data from the Earthdata Cloud to your local computer or storage system is and will continue to be free for users. You don't need an AWS account to download data. Alternatively, you can work directly with NSIDC DAAC holdings stored in Amazon S3 (the object storage service used by NASA Earthdata Cloud). This direct access method allows you stream data into memory and analyze it "in place", avoiding large transfers to your local computer.
12
+
13
+
Options for accessing NASA Earthdata are outlined below:
14
+
15
+
#### **Option 1: Browser-Based Download (No Coding Required)**
16
+
17
+
If you prefer a graphical interface, use one of these web-based tools:
18
+
19
+
[NASA Earthdata Search](https://search.earthdata.nasa.gov) - Explore, filter and customize data before downloading files. Instructions for using Earthdata Search to find and access data can be found in our Earthdata Search [guide](https://nsidc.org/data/user-resources/help-center/search-order-and-customize-nsidc-daac-data-nasa-earthdata-search).
20
+
21
+
image
22
+
23
+
Data Access Tool - The NSIDC Data Access Tool can be accessed from a data set's landing page.
24
+
Example: [ATLAS/ICESat-2 L2A Global Geolocated Photon Data. (ATL03, Version 7) Landing Page](https://nsidc.org/data/atl03/versions/7). In the right-hand menu on a data set landing page, click "Data Access & Tools". You will be directed to the top of a list of tool and service "cards" with links to various data access methods. Click on the card title "Data Access Tool" to be directed to the Data Access Tool interface for that particular data set. The Data Access Tool allows users to filter files within a data set using spatial bounds, temporal ranges, and filename wildcards.
25
+
26
+
Help article on using the Data Access Tool: [https://nsidc.org/data/user-resources/help-center/filter-and-order-data-set-web-page-using-data-access-tool](https://nsidc.org/data/user-resources/help-center/filter-and-order-data-set-web-page-using-data-access-tool)
27
+
28
+
#### **Option 2: Python (earthaccess Library)**
29
+
30
+
The earthaccess Python library provides a streamlined way to search, authenticate, and download NASA Earthdata. It works both locally and in the cloud (e.g., on EC2 or JupyterHub environments).
*Have reference/link to one of our "how do I's" here?*
35
+
36
+
#### **Option 3: Command Line Tools (```wget```, ```curl```, PODAAC subscriber)**
37
+
38
+
If you're comfortable using the terminal, command-line tools allow flexible and efficient downloading.
39
+
40
+
```wget``` or ```curl``` – Download known files or batch download from an HTTPS URL list.
41
+
Learn to create .txt files of download links here: [Creating Text Files of HTTPS and S3 URLs for Earthdata Cloud Data Access](https://nsidc.org/data/user-resources/help-center/creating-text-files-https-and-s3-urls-earthdata-cloud-data-access)
42
+
43
+
[PODAAC Data Subscriber](https://github.com/podaac/data-subscriber/blob/main/README.md) – A Python-based command-line tool that supports spatial and temporal filtering. Though designed for PODAAC, it can be adapted for NSIDC and other DAACs.
44
+
45
+
All of these command line options are detailed in this help article: [Downloading Data from Earthdata Cloud to Your Local Computer Using the Command Line](https://nsidc.org/data/user-resources/help-center/downloading-data-earthdata-cloud-your-local-computer-using-command-line)
There are many common data types/ structures and terminology to go along with them. Some examples:
6
6
7
-
_We should add information for tabular data and Data Frames_
7
+
-**Tabular** – rows and columns, often stored in CSV or TSV files. Each row is an observation, and each column is a variable (e.g., time, latitude, longitude, temperature).
8
+
-**Data Frames** – tabular data structures used in programming languages like R or Python (pandas). Data frames allow for more complex indexing, metadata, and transformations than simple tabular files.
9
+
-**Swath** – along-track measurements collected as the satellite passes over an area, usually irregular in shape and resolution.
10
+
-**Raster / Grids** – data organized into regular grid cells, each cell representing a spatial unit (e.g., 25 km × 25 km grid of snow cover).
11
+
-**Resampling** – methods for transforming data between swath, raster, or other structures (e.g., nearest neighbor, bilinear interpolation).
8
12
9
-
- Swath
10
-
- Along-track
11
-
- Raster/Grids
12
-
- Resampling
13
+
14
+
How to work with file formats commonly found at NSIDC: In most cases, it’s best to avoid low-level libraries such as `netCDF4` or `h5py`. Higher-level libraries provide more intuitive access, automatically handle metadata, and streamline analysis. Some format descriptions and reccomendations are in the table below.
|**NetCDF4 / NetCDFx**| Multidimensional climate/remote sensing data (time, lat, lon, variables). |`xarray` (`xr.open_dataset`) in Python; `terra` or `ncdf4` in R. |
19
+
|**HDF5**| Hierarchical format for storing arrays, tables, and metadata; used widely in NASA products. |`xarray`, `pandas`; avoid `h5py` unless necessary. |
20
+
|**HDF-EOS**| Earth Observing System variant of HDF, often with swath, grid, or point structures. |`xarray`, `h5netcdf`, NASA `harmony-py`. |
21
+
|**Shapefile**| Vector geospatial data (points, lines, polygons) with CRS support. |`geopandas` (Python); `sf` (R). |
Copy file name to clipboardExpand all lines: reference-guides/dataset-search-and-discovery.qmd
+42-5Lines changed: 42 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -2,8 +2,45 @@
2
2
title: "Dataset Search and Discovery"
3
3
---
4
4
5
-
- Getting an EDL
6
-
- A simple earthaccess search and download
7
-
- EDS
8
-
- curl
9
-
- wget
5
+
### **NASA Common Metadata Repository**
6
+
7
+
There are a variety of search and discovery methods allowing users to find NASA Earthdata from a graphical user interface (GUI) or using programmatic access methods, depending on preference. Any search and discovery method leverages the [NASA Common Metadata Repository](https://www.earthdata.nasa.gov/about/esdis/eosdis/cmr) to find data of interest.
8
+
9
+
*NASA's Common Metadata Repository (CMR) is a high-performance, high-quality, continuously evolving metadata system that catalogs all data and service metadata records for NASA's Earth Observing System Data and Information System (EOSDIS) and will be the authoritative management system for all EOSDIS metadata. These metadata records are registered, modified, discovered, and accessed through programmatic interfaces leveraging standard protocols and APIs.*
10
+
11
+
*CMR is the keystone that makes NASA's Earth observation data discoverable. As a metadata repository, CMR contains Unified Metadata Model (UMM) schema records that describe individual Earth data files (UMM-Granules), collections of files (UMM-Collections), scientific details about the data files (UMM-Variables), related tools and services that act on the data files (UMM-Tools and -Services), and pertinent relationships between these concepts. Using the UMM allows CMR to host its metadata records in several supported native formats, with translation services available between formats.*
12
+
13
+
### **Search and Discovery Methods**
14
+
15
+
16
+
#### **Graphical User Interfaces**
17
+
18
+
19
+
##### **NASA Earthdata Search**
20
+
21
+
All of NASA Earthdata are available through [NASA Earthdata Search](https://search.earthdata.nasa.gov), not just data archived through the NSIDC DAAC. [Help article](https://nsidc.org/data/user-resources/help-center/search-order-and-customize-nsidc-daac-data-nasa-earthdata-search) for using Earthdata Search.
22
+
23
+
##### **NSIDC Data Access Tool**
24
+
25
+
The NSIDC Data Access Tool is accessible from landing pages on the NSIDC website, and is an easy way to filter for files of interest without leaving nsidc.org. [Help article](https://nsidc.org/data/data-access-tool) for the NSIDC Data Access Tool.
26
+
27
+
#### **Programmatic**
28
+
29
+
The majority of the shorter form "How-To Guides" and longer form "Tutorials / Workshops" will present programmatic data access methods using Python. We hope to expand to other languages, such as R, in the future.
30
+
31
+
**earthaccess Python library**
32
+
33
+
`earthaccess` is a Python library to **search for** and **download** or **stream** NASA Earth science data with just a few lines of code.
34
+
35
+
>*"earthaccess revolutionizes NASA data access by drastically reducing the complexity and code required. Since open science is a collaborative effort involving people from different technical backgrounds, our team took the approach that data analysis can and should be made more inclusive and accessible by reducing the complexities of underlying systems."*
36
+
>
37
+
>Luis López, an NSIDC software developer and earthaccess creator
38
+
39
+
Instructions for searching for data using earthaccess [here](https://earthaccess.readthedocs.io/en/stable/user_guide/search/).
NASA Earthdata are freely accessible to all users, but a NASA Earthdata Login is required for access. Users can register for an Earthdata Login at [https://urs.earthdata.nasa.gov/](https://urs.earthdata.nasa.gov/).
6
+
7
+
8
+
Some programmatic data access methods are simplified by setting up your Earthdata Login credentials in a netrc file for easy authentication.
9
+
Instructions for creating a netrc file can be found [here](https://nsidc.org/data/user-resources/help-center/creating-netrc-file-earthdata-login).
0 commit comments