Skip to content

Adding Converters for mlcroissant[geo]#994

Open
HarshShinde0 wants to merge 6 commits intomlcommons:mainfrom
HarshShinde0:harsh/geocroissant
Open

Adding Converters for mlcroissant[geo]#994
HarshShinde0 wants to merge 6 commits intomlcommons:mainfrom
HarshShinde0:harsh/geocroissant

Conversation

@HarshShinde0
Copy link

Introduce mlcroissant Converters for GeoCroissant

Add NASA UMM-G and STAC (SpatioTemporal Asset Catalog) converters to transform geospatial metadata into Croissant format. Includes demo notebooks for both converters.

Cc: @omshinde

HarshShinde0 and others added 2 commits November 22, 2025 21:50
- Add NASA UMM converter for Earth observation datasets
- Add STAC (SpatioTemporal Asset Catalog) converter
- Include Jupyter notebook examples for both converters
- Add geo dependencies: geopandas, shapely, pyproj, rasterio, pystac, earthaccess
- Update pyproject.toml with geo extras for geospatial functionality
- fix: black lint for croissant/editor/views/record_sets.py
Copilot AI review requested due to automatic review settings January 7, 2026 07:27
@HarshShinde0 HarshShinde0 requested a review from a team as a code owner January 7, 2026 07:27
@github-actions
Copy link

github-actions bot commented Jan 7, 2026

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR introduces GeoCroissant converters for NASA UMM-G and STAC metadata formats, enabling transformation of geospatial metadata into Croissant format. The implementation includes comprehensive conversion logic, dependency management, and demo notebooks.

  • Adds NASA UMM-G and STAC to GeoCroissant converters with support for collections, items, catalogs, and feature collections
  • Introduces mlcroissant[geo] extra with required geospatial dependencies (geopandas, shapely, pyproj, rasterio, pystac, earthaccess)
  • Includes demo notebooks demonstrating converter usage and validation

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated no comments.

Show a summary per file
File Description
python/mlcroissant/pyproject.toml Adds geo extra with geospatial dependencies and updates dev dependencies
python/mlcroissant/mlcroissant/_src/geo/stac_converters.py Implements STAC to GeoCroissant converter with support for multiple STAC types
python/mlcroissant/mlcroissant/_src/geo/nasa_umm_converter.py Implements NASA UMM-G to GeoCroissant converter with metadata extraction
python/mlcroissant/mlcroissant/_src/geo/demo/stac_conversion.ipynb Provides demo notebook for STAC converter with example conversions
python/mlcroissant/mlcroissant/_src/geo/init.py Exports converter functions for public API

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant