feat: add `stac:collections` to spec #89

gadomski · 2025-03-06T13:35:00Z

Some words to implement #88. Opening as a draft while we resolve a couple of issues:

We don't have versioning on the stac-geoparquet spec ... we probably should? Maybe just a note in the README and a tag?
If we throw a version on, what do we pick? While I wrote v1.1 in this PR, I could be talking into a pre-1 version as well.
In general, it's a little awkward to have the spec coupled to the code here ... but I'm not sure the easy path to resolve that in a non-confusing way for the community.

TomAugspurger

We don't have versioning on the stac-geoparquet spec ... we probably should? Maybe just a note in the README and a tag?

Whoops. What do you think about putting a stac-geoparquet version identifier in the parquet file metadata? Maybe discuss that separately.

what do we pick

1.1 seems fine.

In general, it's a little awkward to have the spec coupled to the code here

Happy to move stuff (code or spec) as desired.

TomAugspurger · 2025-03-06T22:50:40Z

spec/stac-geoparquet-spec.md


 - Must be valid GeoParquet, with proper metadata. Ideally the geometry types are defined and as narrow as possible.
- Strongly recommend to only have one GeoParquet per STAC 'Collection'. Not doing this will lead to an expanded GeoParquet schema (the union of all the schemas of the collection) with lots of empty data
+- Recommend to only have one GeoParquet per STAC 'Collection'. Not doing this will lead to an expanded GeoParquet schema (the union of all the schemas of the collection) with lots of empty data


Maybe rephrase this as strongly recommending that the records be somewhat uniform? That gets to the core if the issue (avoiding a bloated schema, lean on the strengths of parquet), and whether this comes from one or many collections is secondary. So maybe something like

Strongly recommend that the records be mostly uniform, either because they came from a single STAC collection or multiple STAC collections whose items have similar fields. Apache Parquet is a columnar format, and is most efficient and convenient for users when all the records have the same fields. Not doing this will lead to an expanded GeoParquet schema (the union of all the schemas of the collection) with lots of empty data.

And does it make sense to mention the fields extension here?

TomAugspurger · 2025-03-06T22:55:53Z

spec/stac-geoparquet-spec.md

+
+```json
+{
+    "stac:collections": "{\"collection-a\":{\"id\":\"collection-a\",...}}"


I've confused myself, but do we need the escaping here? I guess the text does say that the value of stac:collections is a JSON string, so I guess this does look correct.

This updates the file metadata to 1. Add a version (currently 1.0) 2. Add `stac:collections` 3. Deprecate `stac:collection` 4. Add a jsonschema file Supercedes stac-utils#89

feat: add stac:collections to spec

1d96fc5

gadomski requested review from TomAugspurger and m-mohr March 6, 2025 13:35

gadomski linked an issue Mar 6, 2025 that may be closed by this pull request

[RFC] Multiple collections in metadata #88

Closed

TomAugspurger reviewed Mar 6, 2025

View reviewed changes

gadomski mentioned this pull request Mar 10, 2025

Include stac-geoparquet version in metadata stac-utils/rustac#676

Closed

TomAugspurger mentioned this pull request Mar 15, 2025

Metadata spec should be versioned #94

Closed

TomAugspurger mentioned this pull request May 2, 2025

stac-geoparquet metadata updates #98

Merged

m-mohr removed their request for review June 22, 2025 10:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add `stac:collections` to spec #89

feat: add `stac:collections` to spec #89

Uh oh!

gadomski commented Mar 6, 2025

Uh oh!

TomAugspurger left a comment

Uh oh!

TomAugspurger Mar 6, 2025

Uh oh!

TomAugspurger Mar 6, 2025

Uh oh!

TomAugspurger Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add stac:collections to spec #89

Are you sure you want to change the base?

feat: add stac:collections to spec #89

Uh oh!

Conversation

gadomski commented Mar 6, 2025

Uh oh!

TomAugspurger left a comment

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add `stac:collections` to spec #89

feat: add `stac:collections` to spec #89