You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/open-datasets/samples.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,14 +5,14 @@ ms.service: open-datasets
5
5
ms.topic: sample
6
6
author: fbsolo-ms1
7
7
ms.author: franksolomon
8
-
ms.date: 05/06/2020
8
+
ms.date: 07/29/2024
9
9
---
10
10
11
11
# Example Jupyter notebooks show how to enrich data with Open Datasets
12
-
The example Jupyter notebooks for Azure Open Datasets show you how to load open datasets and use them to enrich demo data. Techniques include use of Apache Spark and Pandas to process data.
12
+
The example Jupyter notebooks for Azure Open Datasets explain how to load open datasets, and use them to enrich demo data. The techniques include use of Apache Spark and Pandas to process data.
13
13
14
-
>[!IMPORTANT]
15
-
>When working in a non-Spark environment, Open Datasets allows downloading only one month of data at a time with certain classes in order to avoid MemoryError with large datasets.
14
+
>[!IMPORTANT]
15
+
>When working in a non-Spark environment, Open Datasets allows downloads of only one month of data at a time with certain classes, to avoid MemoryError problems with large datasets.
16
16
17
17
## Load NOAA Integrated Surface Database (ISD) data
18
18
|Notebook | Description |
@@ -23,14 +23,14 @@ The example Jupyter notebooks for Azure Open Datasets show you how to load open
|[Join demo data with weather data - Pandas](https://github.com/Azure/OpenDatasetsNotebooks/blob/master/tutorials/data-join/02-weather-join-in-pandas.ipynb)| Join a 1-month demo dataset of sensor locations with weather readings in a Pandas dataframe. |
26
+
|[Join demo data with weather data - Pandas](https://github.com/Azure/OpenDatasetsNotebooks/blob/master/tutorials/data-join/02-weather-join-in-pandas.ipynb)| Join a one-month demo dataset of sensor locations with weather readings in a Pandas dataframe. |
27
27
|[Join demo data with weather data – Spark](https://github.com/Azure/OpenDatasetsNotebooks/blob/master/tutorials/data-join/01-weather-join-in-spark.ipynb)| Join a demo dataset of sensor locations with weather readings in a Spark dataframe. |
|[Taxi trip data enriched with weather data - Pandas](https://github.com/Azure/OpenDatasetsNotebooks/blob/master/tutorials/data-join/04-nyc-taxi-join-weather-in-pandas.ipynb)| Load NYC green taxi data (over 1 month) and enrich it with weather data in a Pandas dataframe. This example overrides the method `get_pandas_limit` and balances data load performance with the amount of data.|
33
-
|[Taxi trip data enriched with weather data – Spark](https://github.com/Azure/OpenDatasetsNotebooks/blob/master/tutorials/data-join/03-nyc-taxi-join-weather-in-spark.ipynb)| Load NYC green taxi data and enrich it with weather data, in Spark dataframe. |
32
+
|[Taxi trip data enriched with weather data - Pandas](https://github.com/Azure/OpenDatasetsNotebooks/blob/master/tutorials/data-join/04-nyc-taxi-join-weather-in-pandas.ipynb)| Load NYC green taxi data (over one month) and enrich it with weather data in a Pandas dataframe. This example overrides the method `get_pandas_limit` and balances data load performance with the amount of data.|
33
+
|[Taxi trip data enriched with weather data – Spark](https://github.com/Azure/OpenDatasetsNotebooks/blob/master/tutorials/data-join/03-nyc-taxi-join-weather-in-spark.ipynb)| Load NYC green taxi data, and enrich it with weather data, in Spark dataframe. |
0 commit comments