Skip to content

Unclear how to set parquet compression #74

@scottyhq

Description

@scottyhq

The example notebook appears incorrect in that there is a comment that it writes 'compressed stac-geoparquet' by default
https://stac-utils.github.io/stacrs/v0.5.9-beta.0/example/

import stacrs

items = await stacrs.search(
    "https://stac.eoapi.dev",
    collections="openaerialmap",
    bbox=[-125, 25, -67, 49],  # CONUS
    sortby="-properties.datetime",
    max_items=1000,
)
await stacrs.write("items.parquet", items)  # compressed stac-geoparquet

But as far as I can tell no compression is used and I'm unsure how to change it (gpq describe items.parquet --format json)

For example geopandas.to_parquet gives the following options compression = {‘snappy’, ‘gzip’, ‘brotli’, None}, default ‘snappy’

Metadata

Metadata

Assignees

Labels

documentationImprovements or additions to documentation

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions