Skip to content

Conversation

wietzesuijker
Copy link
Collaborator

Introduces automated tools to measure tile generation latency and validate GeoZarr data quality.

Changes

  • benchmark_tile_performance.py measures actual user-facing tile latency
  • validate_geozarr.py checks metadata, chunks, CRS, compression, pyramids
  • benchmark_comparison.py compares multiple datasets with statistical tests
  • Argo workflow for automated benchmarking in CI/CD
  • Metrics include mean/p95 latency, success rate, multi-scale performance

Usage

# Benchmark item
./scripts/benchmark_tile_performance.py \
  --collection sentinel-2-l2a \
  --item-id S2A_MSIL2A_20250516_T32TQM \
  --num-tiles 20 --zoom-levels 10

# Validate GeoZarr
./scripts/validate_geozarr.py \
  s3://eopf-devseed/geozarr/item.zarr \
  --check-metadata --check-chunks

Testing

uv run pytest tests/unit/ -v
kubectl create -f workflows/run-benchmark-test.yaml

Impact

Enables data-driven optimization decisions and regression detection before deployment.

@wietzesuijker wietzesuijker force-pushed the feat/collection-registry branch from 077445d to 7b3debe Compare October 9, 2025 03:46
@wietzesuijker wietzesuijker force-pushed the feat/performance-validation branch from fcccde0 to dea64b2 Compare October 9, 2025 03:46
Add comprehensive performance measurement and validation:

- Automated validation workflow task (validate_geozarr.py)
- Performance benchmarking tools (benchmark_comparison.py, benchmark_tile_performance.py)
- Production metrics from 9 operational workflows (8.6min avg, 75% success)
- Ecosystem compatibility validation (zarr-python, xarray, stac-geoparquet)
- User guide for adding new collections (docs/ADDING_COLLECTIONS.md)
- Performance report with operational metrics (docs/PERFORMANCE_REPORT.md)

Production validation shows pipeline ready for deployment with
validated performance and ecosystem compatibility.
@wietzesuijker wietzesuijker force-pushed the feat/collection-registry branch from 7b3debe to eef0652 Compare October 9, 2025 04:15
@wietzesuijker wietzesuijker force-pushed the feat/performance-validation branch from dea64b2 to 269e0b9 Compare October 9, 2025 04:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant