Skip to content

Latest commit

 

History

History
82 lines (66 loc) · 2.87 KB

File metadata and controls

82 lines (66 loc) · 2.87 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[0.1.1] - 2026-01-01

Changed

  • Made GDAL an optional dependency (install with pip install -e ".[raster]")
  • Tests requiring GDAL array support now skip gracefully when unavailable
  • Updated documentation with GDAL installation instructions

Fixed

  • Test suite now passes in virtual environments without system GDAL
  • All 50 core tests pass, 9 GDAL-dependent tests skip cleanly

0.1.0 - 2026-01-01

Added

  • Raster Processing Pipeline

    • Complete RasterProcessor class for COG creation and H3 tiling
    • Automatic H3 resolution detection from raster pixel size
    • Support for processing by h0 regions (memory-efficient global processing)
    • COG optimization for cloud rendering (titiler-compatible)
    • Parent resolution support for hierarchical aggregation
    • Configurable value columns and nodata handling
    • CLI commands for raster processing
  • Vector Processing (Existing)

    • H3 hexagonal tiling for polygon and point datasets
    • Two-pass processing to avoid OOM with large datasets
    • Chunked processing with configurable batch sizes
    • Parent resolution support
    • Repartitioning by h0 cells for efficient querying
    • ID column auto-detection and handling
  • Kubernetes Integration

    • Job generation for parallel processing
    • Indexed jobs for chunk-based workflows
    • Resource configuration (CPU, memory, parallelism)
    • Support for h0-based regional processing
  • Storage Management

    • S3 bucket configuration and CORS setup
    • Rclone integration for multi-cloud syncing
    • Credential management
  • CLI Tools

    • cng-datasets vector - Vector processing
    • cng-datasets raster - Raster processing
    • cng-datasets k8s - Kubernetes job generation
    • cng-datasets storage - Storage management
    • cng-datasets workflow - Complete dataset workflows
  • Documentation

    • Comprehensive package README
    • Dataset-specific READMEs with examples
    • API documentation in docstrings
    • Example scripts and notebooks
    • Contributing guidelines
  • Testing

    • Unit tests for vector processing
    • Unit tests for raster processing
    • Integration tests for S3 and H3 operations
    • Mock tests for external services
    • Test fixtures and utilities

Changed

  • Updated H3 edge length values to match official h3geo.org specification
  • Improved resolution detection with informative user feedback
  • Enhanced error messages and logging throughout

Fixed

  • H3 edge length accuracy (using official values)
  • Resolution override behavior with helpful messages
  • Memory efficiency for large polygon processing