Skip to content

Commit 407542e

Browse files
committed
initial commits and contents
1 parent 89aa027 commit 407542e

File tree

1 file changed

+63
-0
lines changed

1 file changed

+63
-0
lines changed

intermediate/intro-to-zarr.ipynb

Lines changed: 63 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,63 @@
1+
{
2+
"cells": [
3+
{
4+
"cell_type": "markdown",
5+
"id": "8253fe2d",
6+
"metadata": {},
7+
"source": [
8+
"# Intro to Zarr\n",
9+
"\n",
10+
"This notebook provides a brief introduction to Zarr and how to\n",
11+
"use it in cloud environments for scalable, chunked, and compressed data storage.\n",
12+
"Zarr is a file format with implementations in different languages. In this tutorial, we will look at an example of how to use the Zarr format by looking at some features of the `zarr-python` library and how Zarr files can be opened with `xarray`.\n",
13+
"\n",
14+
"## What is Zarr?\n",
15+
"\n",
16+
"The Zarr data format is an open, community-maintained format designed for efficient, scalable storage of large N-dimensional arrays. It stores data as compressed and chunked arrays in a format well-suited to parallel processing and cloud-native workflows.\n",
17+
"\n",
18+
"### Zarr Data Organization:\n",
19+
"- **Arrays**: N-dimensional arrays that can be chunked and compressed.\n",
20+
"- **Groups**: A container for organizing multiple arrays and other groups with a hierarchical structure.\n",
21+
"- **Metadata**: JSON-like metadata describing the arrays and groups, including information about dimensions, data types, and compression.\n",
22+
"- **Dimensions and Shape**: Arrays can have any number of dimensions, and their shape is defined by the number of elements in each dimension.\n",
23+
"- **Coordinates & Indexing**: Zarr supports coordinate arrays for each dimension, allowing for efficient indexing and slicing.\n",
24+
"\n",
25+
"The diagram below showing the structure of a Zarr file:\n",
26+
"![EarthData](https://learning.nceas.ucsb.edu/2025-04-arctic/images/zarr-chunks.png)\n"
27+
]
28+
},
29+
{
30+
"cell_type": "code",
31+
"execution_count": null,
32+
"id": "ae9c38ed",
33+
"metadata": {
34+
"vscode": {
35+
"languageId": "plaintext"
36+
}
37+
},
38+
"outputs": [],
39+
"source": []
40+
},
41+
{
42+
"cell_type": "code",
43+
"execution_count": null,
44+
"id": "233640b0",
45+
"metadata": {
46+
"vscode": {
47+
"languageId": "plaintext"
48+
}
49+
},
50+
"outputs": [],
51+
"source": [
52+
"The Xarray library provides a rich API for working with Zarr data, slicing, and selecting data. \n"
53+
]
54+
}
55+
],
56+
"metadata": {
57+
"language_info": {
58+
"name": "python"
59+
}
60+
},
61+
"nbformat": 4,
62+
"nbformat_minor": 5
63+
}

0 commit comments

Comments
 (0)