Skip to content

Commit 3e6871a

Browse files
authored
Merge branch 'main' into ctrees-add-amazonia-tree-height
2 parents d355370 + 6c3f78c commit 3e6871a

10 files changed

+180
-11
lines changed

datasets/allen-sea-ad-atlas.yaml

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -30,18 +30,30 @@ Resources:
3030
Type: S3 Bucket
3131
Explore:
3232
- '[Browse Bucket](https://sea-ad-single-cell-profiling.s3.amazonaws.com/index.html)'
33+
- Description: "Update notifications for s3://sea-ad-single-cell-profiling. Users can subscribe to this SNS topic with [AWS Lambda](https://aws.amazon.com/lambda/) or [AWS Simple Queue Service](https://aws.amazon.com/sqs/)."
34+
ARN: arn:aws:sns:us-west-2:208217671510:sea-ad-single-cell-profiling-object_created
35+
Region: us-west-2
36+
Type: SNS Topic
3337
- Description: Quantitative neuropathology (full resolution images, processed images, and quantifications) in a public bucket
3438
ARN: arn:aws:s3:::sea-ad-quantitative-neuropathology
3539
Region: us-west-2
3640
Type: S3 Bucket
3741
Explore:
3842
- '[Browse Bucket](https://sea-ad-quantitative-neuropathology.s3.amazonaws.com/index.html)'
43+
- Description: "Update notifications for s3://sea-ad-quantitative-neuropathology. Users can subscribe to this SNS topic with [AWS Lambda](https://aws.amazon.com/lambda/) or [AWS Simple Queue Service](https://aws.amazon.com/sqs/)."
44+
ARN: arn:aws:sns:us-west-2:208217671510:sea-ad-quantitative-neuropathology-object_created
45+
Region: us-west-2
46+
Type: SNS Topic
3947
- Description: Spatial transcriptomics data files in a public bucket
4048
ARN: arn:aws:s3:::sea-ad-spatial-transcriptomics
4149
Region: us-west-2
4250
Type: S3 Bucket
4351
Explore:
4452
- '[Browse Bucket](https://sea-ad-spatial-transcriptomics.s3.amazonaws.com/index.html)'
53+
- Description: "Update notifications for s3://sea-ad-spatial-transcriptomics. Users can subscribe to this SNS topic with [AWS Lambda](https://aws.amazon.com/lambda/) or [AWS Simple Queue Service](https://aws.amazon.com/sqs/)."
54+
ARN: arn:aws:sns:us-west-2:208217671510:sea-ad-spatial-transcriptomics-object_created
55+
Region: us-west-2
56+
Type: SNS Topic
4557
DataAtWork:
4658
Tools & Applications:
4759
- Title: Seattle Alzheimer’s Disease Brain Cell Atlas

datasets/blue_et.yaml

Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,33 @@
1+
Name: IWMI DIWASA Blue ET for Africa
2+
Description: Blue evapotranspiration (Blue ET) is the portion of ET derived from blue water sources, including surface water (rivers, lakes, reservoirs) and groundwater used for irrigation. It is a key component of blue water fluxes in water accounting. Blue ET consists of evaporation from irrigated fields, transpiration from irrigated crops, and water lost from artificial storage. It helps assess water productivity in irrigated agriculture, quantify consumptive water use, and support sustainable water resource management, particularly in water-scarce regions.
3+
Documentation: https://iwmi.africageoportal.com/pages/continent-africa
4+
5+
ManagedBy: "[IWMI](https://www.iwmi.org/)"
6+
UpdateFrequency: None
7+
Tags:
8+
- aws-pds
9+
- surface water
10+
- irrigated cropland
11+
- ground water
12+
- evapotranspiration
13+
- water
14+
License: "Creative Commons open license"
15+
Resources:
16+
- Description: Monthly Blue ET for Africa
17+
ARN: arn:aws:s3:::iwmi-datasets/Water_accounting_plus/Africa/Incremental_ET_M/
18+
Region: af-south-1
19+
Type: S3 Bucket
20+
Explore:
21+
- '[Browse Bucket](https://iwmi-datasets.s3.af-south-1.amazonaws.com/Cropland_partition/index.html)'
22+
23+
DataAtWork:
24+
Tutorials:
25+
- Title: Analysis of IWMI’s Water Data Products through Digital Earth Africa
26+
URL: https://learn.digitalearthafrica.org/courses/course-v1:IWMI+DIWASA1+2024_10/about
27+
AuthorName: A.T. Haile, E.T. Negash, K. Mubea, M. Tadesse
28+
AuthorURL: https://github.com/iwmiwaplus
29+
Tools & Applications:
30+
- Title: Multi-Scale Water Accounting in the Volta Basin
31+
URL: https://public.tableau.com/app/profile/iwmi.wa/viz/Voltabasinvertical/Merged?publish=yes
32+
AuthorName: iwmiwaplus
33+
AuthorURL: https://public.tableau.com/app/profile/iwmi.wa

datasets/brazil-data-cubes.yaml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@ Documentation: http://brazildatacube.org/en/home-page-2/
44
55
ManagedBy: "[INPE - Brazil Data Cube](http://brazildatacube.org/)"
66
UpdateFrequency: New EO data cubes are added as soon as there are produced by the Brazil Data Cube project.
7+
DeprecatedNotice: This dataset is deprecated and will be removed from AWS Open Data in the near future. If you have any questions or require assistance, please contact us at [[email protected]].
78
Tags:
89
- earth observation
910
- satellite imagery
@@ -71,4 +72,4 @@ DataAtWork:
7172
AuthorName: K. R. Ferreira, et al.
7273
- Title: Building Earth Observation Data Cubes on AWS
7374
URL: https://www.proquest.com/openview/070d2a753cc88d26535c98293171a5ac/1?
74-
AuthorName: Ferreira, K R; Queiroz, G R; Marujo, R F B; Costa, R W. 
75+
AuthorName: Ferreira, K R; Queiroz, G R; Marujo, R F B; Costa, R W. 

datasets/broad-references.yaml

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -30,10 +30,6 @@ DataAtWork:
3030
- AWS Batch
3131
- Amazon FSx
3232
Tools & Applications:
33-
- Title: Genomics Workflows on AWS - Cromwell on AWS
34-
URL: https://docs.opendata.aws/genomics-workflows/orchestration/cromwell/cromwell-examples/#real-world-example-haplotypecaller
35-
AuthorName: W. Lee Pang
36-
AuthorURL: https://www.linkedin.com/in/lee-pang-a039a26/
3733
Publications:
3834
- Title: Advancing NGS quality control to enable measurement of actionable mutations in circulating tumor DNA
3935
URL: https://www.cell.com/cell-reports-methods/pdf/S2667-2375(21)00165-X.pdf

datasets/colorado-imagery.yaml

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
Name: State of Colorado Imagery
2+
Description: The State of Colorado has gathered public historical imagery ranging from 2005 to 2021.
3+
Documentation: https://docs.google.com/document/d/1YDHignUj9lQTMw2J-SqA96MTP8KmJYtk2ZKKC2ZYuPE/edit?usp=sharing
4+
5+
ManagedBy: State of Colorado Governor's Office of Information Technology (OIT) GIS team
6+
UpdateFrequency: Periodically
7+
Tags:
8+
- aws-pds
9+
- aerial imagery
10+
- geospatial
11+
- imaging
12+
- mapping
13+
License: https://creativecommons.org/publicdomain/zero/1.0/legalcode
14+
Resources:
15+
- Description: The State of Colorado historic public aerial imagery. Currently, NAIP is available from 2005 and 2009-2021. The National Agriculture Imagery Program is a project managed by the U.S. Department of Agriculture created to collect leaf-on imagery for the United States during peak growing seasons. The files are available as GeoTIFFs. From 2005-2017 they have a one meter resolution. After that, it is a 60cm resolution.
16+
Region: us-east-1
17+
Type: S3 Bucket
18+
DataAtWork:
19+
Tutorials:
20+
- Title: Colorado AWS Open Imagery Guide
21+
URL: https://docs.google.com/document/d/15GjCSWSzst82FZMqBqdGV0rt6FKJzt03NlQYdWwsLGE/edit?usp=sharing
22+
AuthorName: State of Colorado OIT-GIS
23+
AuthorURL: https://geodata.colorado.gov/
24+
Tools & Applications:
25+
- Title: Colorado Public Imagery Dowloader
26+
URL: https://gis.colorado.gov/imagery/
27+
AuthorName: State of Colorado OIT-GIS
28+
AuthorURL: https://geodata.colorado.gov/
29+
ADXCategories:
30+
- Public Sector Data

datasets/gatk-test-data.yaml

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,8 +23,4 @@ Resources:
2323
DataAtWork:
2424
Tutorials:
2525
Tools & Applications:
26-
- Title: Genomics Workflows on AWS - Cromwell on AWS
27-
URL: https://docs.opendata.aws/genomics-workflows/orchestration/cromwell/cromwell-examples/#real-world-example-haplotypecaller
28-
AuthorName: W. Lee Pang
29-
AuthorURL: https://www.linkedin.com/in/lee-pang-a039a26/
3026
Publications:

datasets/green_et.yaml

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,32 @@
1+
Name: IWMI DIWASA Green ET for Africa
2+
Description: Green evapotranspiration (Green ET) is the portion of ET derived from green water, which includes soil moisture and rainfall used by vegetation. It represents a key component of green water fluxes in water accounting. Green ET consists of evaporation from soil moisture in non-irrigated areas, transpiration from rainfed crops and natural vegetation, and interception losses from precipitation on vegetation. It plays a crucial role in rainfed agriculture, drought monitoring, and sustainable water management by tracking how rainfall supports plant growth.
3+
Documentation: https://iwmi.africageoportal.com/pages/continent-africa
4+
5+
ManagedBy: "[IWMI](https://www.iwmi.org/)"
6+
UpdateFrequency: None
7+
Tags:
8+
- soil moisture
9+
- rainfed cropland
10+
- interception loss
11+
- evapotranspiration
12+
- water
13+
License: "Creative commons open license"
14+
Resources:
15+
- Description: Monthly Green ET for Africa
16+
ARN: arn:aws:s3:::iwmi-datasets/Water_accounting_plus/Africa/Rainfall_ET_M/
17+
Region: af-south-1
18+
Type: S3 Bucket
19+
Explore:
20+
- '[Browse Bucket](https://iwmi-datasets.s3.af-south-1.amazonaws.com/Cropland_partition/index.html)'
21+
22+
DataAtWork:
23+
Tutorials:
24+
- Title: Analysis of IWMI’s Water Data Products through Digital Earth Africa
25+
URL: https://learn.digitalearthafrica.org/courses/course-v1:IWMI+DIWASA1+2024_10/about
26+
AuthorName: A.T. Haile, E.T. Negash, K. Mubea, M. Tadesse
27+
AuthorURL: https://github.com/iwmiwaplus
28+
Tools & Applications:
29+
- Title: Multi-Scale Water Accounting in the Volta Basin
30+
URL: https://public.tableau.com/app/profile/iwmi.wa/viz/Voltabasinvertical/Merged?publish=yes
31+
AuthorName: iwmiwaplus
32+
AuthorURL: https://public.tableau.com/app/profile/iwmi.wa

datasets/open-ceda.yaml

Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,59 @@
1+
Name: Open CEDA by Watershed
2+
Description: |
3+
CEDA is a multi-regional Environmentally-Extended Input-Output (EEIO) model developed to support a wide range of environmental systems analyses—including corporate carbon accounting and sustainable spend analysis. CEDA provides unparalleled global coverage and granularity, representing 95% of the world's GDP across 148 countries and 400 sectors, enabling robust and geographically comprehensive Scope 3 greenhouse gas (GHG) measurement.
4+
Open CEDA is the publicly avaialable version of CEDA, now easy to download and available for free for all use cases. For more information please visit our website at openceda.org
5+
CEDA 2024, the latest version of CEDA, uses 2022 as its base year, ensuring that emissions factors and economic data reflect the most recent global economic landscape available. To maintain accuracy and relevance, CEDA is updated annually with the latest data releases.
6+
At its core, CEDA connects economic exchanges to GHG emissions by quantifying the life-cycle emissions of products and services. This is achieved through the integration of input-output tables, which represent the full supply-chain network of the global economy, with GHG emissions data. As a result, CEDA provides users with a powerful tool to assess the environmental impacts embedded in corporate value chains.
7+
Documentation: https://openceda.org/
8+
9+
ManagedBy: "[Watershed Technology](https://watershed.com)"
10+
UpdateFrequency: Annual
11+
Collabs:
12+
ASDI:
13+
Tags:
14+
- sustainability
15+
Tags:
16+
- aws-pds
17+
- climate
18+
- carbon
19+
- scope 3
20+
- supply chain
21+
- spend-based models
22+
- EEIO
23+
License: Creative Commons CC BY-SA
24+
Resources:
25+
- Description: An .xlsx file containing the Open CEDA dataset
26+
ARN: arn:aws:s3:::open-ceda
27+
Region: us-west-2
28+
Type: S3 Bucket
29+
Explore:
30+
- "[Open CEDA](https://open-ceda.s3.amazonaws.com/index.html)"
31+
DataAtWork:
32+
Tutorials:
33+
- Title: For a tutoral please download the CEDA Methodology Documentation on the openceda.org website.
34+
URL: https://openceda.org/
35+
AuthorName: Watershed Technology
36+
Publications:
37+
- Title: Converting University Spending to Greenhouse Gas Emissions - A Supply Chain Carbon Footprint Analysis of UC Berkeley
38+
URL: https://nature.berkeley.edu/classes/es196/projects/2012final/DoyleK_2012.pdf
39+
AuthorName: Kelley L. Doyle, 2012
40+
- Title: A Consumption-Based Greenhouse Gas Inventory of SanFrancisco Bay Area Neighborhoods, Cities and Counties
41+
URL: https://escholarship.org/content/qt2sn7m83z/qt2sn7m83z.pdf
42+
AuthorName: Christopher M. Jones & Daniel M. Kammen, 2015
43+
- Title: Are services better for climate change?
44+
URL: https://pubs.acs.org/doi/10.1021/es0609351
45+
AuthorName: Sangwon, Suh, 2006
46+
- Title: Advancing Sustainable Materials Management - 2016 Recycling Economic Information (REI) Report
47+
URL: https://www.epa.gov/sites/default/files/2016-11/documents/final_2016_rei_report.pdf
48+
AuthorName: EPA, 2016
49+
- Title: Greening Government Procurement - Turning Uncle Sam Into an Eco-Friendly Consumer
50+
URL: https://psmag.com/environment/greening-government-procurement-turning-uncle-sam-eco-friendly-consumer-78592/
51+
AuthorName: James Badham, 2014
52+
- Title: Environmental Impacts of Products - Policy Relevant Information and Data Challenges
53+
URL: https://publications.jrc.ec.europa.eu/repository/handle/JRC34107
54+
AuthorName: Eder P, Tukker A, Suh S
55+
- Title: Environmentally extended input-output tables and models for Europe
56+
URL: https://www.eurogypsum.org/wp-content/uploads/2015/05/N0533.pdf
57+
AuthorName: Arnold Tukker (TNO), Gjalt Huppes, Lauran van Oers, Reinout Heijungs (CML), 2006
58+
ADXCategories:
59+
- Environmental Data

datasets/rsna-screening-mammography-breast-cancer-detection.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ Tags:
1717
- breast cancer
1818
- cancer
1919
- life sciences
20-
License: "You may access and use these de-identified imaging datasets and annotations (“the data”) for non-commercial purposes only, including academic research and education, as long as you agree to abide by the following provisions: Not to make any attempt to identify or contact any individual(s) who may be the subjects of the data. If you share or re-distribute the data in any form, include a citation to the “Brain CT Hemorrhage Dataset, Copyright RSNA, 2019” as follows: Flanders AF, et al. The RSNA Brain CT Hemorrhage Dataset [10.1148/ryai.2020190211]. Radiology: Artificial Intelligence 2020;2:3."
20+
License: "You may access and use these de-identified imaging datasets and annotations (“the data”) for non-commercial purposes only, including academic research and education, as long as you agree to abide by the following provisions: Not to make any attempt to identify or contact any individual(s) who may be the subjects of the data. If you share or re-distribute the data in any form, include a citation to the “Radiological Society of North America Screening Mammography Breast Cancer Detection (RSNA-SMBC) Dataset, November 2022” [https://doi.org/10.1148/dataset.smbc.2024]."
2121
Resources:
2222
- Description: Zip archive containing DCM and CSV files
2323
ARN: arn:aws:s3:::screening-mammography-breast

tags.yaml

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,6 +52,7 @@
5252
- Caenorhabditis elegans
5353
- calcium imaging
5454
- cancer
55+
- carbon
5556
- cell biology
5657
- cell imaging
5758
- cell painting
@@ -100,7 +101,7 @@
100101
- cultural preservation
101102
- culture
102103
- cyber security
103-
- cyclone/typhoon/hurricane
104+
- cyclone typhoon hurricane
104105
- czi
105106
- Danio rerio
106107
- data assimilation
@@ -131,6 +132,7 @@
131132
- economics
132133
- ecosystems
133134
- education
135+
- EEIO
134136
- electricity
135137
- electron microscopy
136138
- electron tomography
@@ -146,6 +148,7 @@
146148
- ethereum
147149
- ethnicity
148150
- Eulerian
151+
- evapotranspiration
149152
- event camera
150153
- events
151154
- exploration
@@ -191,6 +194,7 @@
191194
- grand-challenge.org
192195
- graph
193196
- green aviation
197+
- ground water
194198
- group quarters
195199
- h5
196200
- hazard
@@ -222,6 +226,7 @@
222226
- information retrieval
223227
- infrastructure
224228
- internet
229+
- interception loss
225230
- intrusion detection
226231
- ion channels
227232
- irrigated cropland
@@ -363,6 +368,7 @@
363368
- satellite imagery
364369
- scholarly communication
365370
- schools
371+
- scope 3
366372
- seafloor
367373
- segmentation
368374
- seismology
@@ -377,6 +383,7 @@
377383
- single-cell transcriptomics
378384
- social media
379385
- socioeconomic
386+
- soil moisture
380387
- solar
381388
- source code
382389
- space biology
@@ -386,6 +393,7 @@
386393
- speech processing
387394
- speech recognition
388395
- speech synthesis
396+
- spend-based models
389397
- sports
390398
- sqlite
391399
- stac
@@ -395,6 +403,8 @@
395403
- structural birth defect
396404
- structural variation
397405
- subtitles
406+
- supply chain
407+
- surface water
398408
- survey
399409
- sustainability
400410
- synthetic aperture radar

0 commit comments

Comments
 (0)