ckan_geoconnex_bulk_runner_demo_medium.mp4
Status: This codebase is currently a work in progress and more documentation is planned.
The ckan_geoconnex_bulk_runner codebase is part of a multi-service infrastructure to sync water data hubs using CKAN to the Geoconnex knowledge graph.
- geoconnex_utils: Helper functions used throughout the ckan_geoconnex_bulk_runner project including JSON-LD construction and JSON schema validation.
- geoconnex_release: Compatible CKAN datasets and vector geospatial features for all connected water data hubs are uploaded to a
ckan-geoconnex-web-resources.jsonlfile in the latest GitHub release. - bulk_loader: Requests and outputs the latest JSONL file from the latest GitHub release. This is ran as a Docker container by Geoconnex on a periodic frequency to upload all water data hub web resources to the Geoconnex knowledge graph following the Geoconnex bulk contribution specification.
- ckan_geoconnex_bulk_runner_py: Python library intended for usage by the ckanext-gztr and DataPusher+ CKAN extensions.
This runner is expected to be implemented for a water data hub with the relevant fields and/or ckanext-gztr (not open-source yet) and/or DataPusher+ enabled. For questions reach out to datHere, Center for Geospatial Solutions, or add an issue/discussion.