Skip to content

fix(data)!: use integer for YAML file version#87

Merged
bassosimone merged 84 commits intomainfrom
refactor/ghcache_sync_integration
Dec 25, 2025
Merged

fix(data)!: use integer for YAML file version#87
bassosimone merged 84 commits intomainfrom
refactor/ghcache_sync_integration

Conversation

@bassosimone
Copy link
Copy Markdown
Collaborator

@bassosimone bassosimone commented Dec 22, 2025

This diff changes the YAML file version to be an integer. While there, we also publish data for 2025-07/country.

@codecov-commenter
Copy link
Copy Markdown

codecov-commenter commented Dec 22, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.17%. Comparing base (691321b) to head (0ededce).

Additional details and impacted files
@@           Coverage Diff           @@
##             main      #87   +/-   ##
=======================================
  Coverage   99.17%   99.17%           
=======================================
  Files          37       37           
  Lines        2295     2295           
=======================================
  Hits         2276     2276           
  Misses         19       19           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

bassosimone added a commit that referenced this pull request Dec 22, 2025
The ghremote package implements `pipeline.RemoteCache` using
GitHub releases to publish files.

The `pipeline.RemoteCache` facility was introduced in
#85.

The implementation is at `ghremote.IQBGitHubRemoteCache`.

We derived the implementation from already-existing code living
inside the `./data/ghcache.py` file.

Using GitHub releases to publish datasets is meant as an interim
solution. However, we are also going to implement a better approach
for datasets, possibly based on GCS buckets. To this end, we need
to refactor the existing approach to create the facilities for
making the GCS-based approach possible. Hence, this diff, so that
we can slowly move towards GCS-based publishing of datasets.

This diff adds the basic functionality and tests, along with minor
changes and tweaks in the rest of the codebase (boyscout rule).

We integrated the code added by this PR and verified it is working
as intended in #87.
bassosimone added a commit that referenced this pull request Dec 22, 2025
The `ghremote` package implements `pipeline.RemoteCache` using GitHub
releases to publish files. The `pipeline.RemoteCache` facility was
introduced in #85. The implementation
is at `ghremote.IQBGitHubRemoteCache`. We derived the implementation
from already-existing code living inside the `./data/ghcache.py` file.
We're also adding `dacite` as a dependency, to easily convert parsed
JSONs to dataclasses.

Using GitHub releases to publish datasets is meant as an interim
solution. However, we are also going to implement a better approach for
datasets, possibly based on GCS buckets. To this end, we need to
refactor the existing approach to create the facilities for making the
GCS-based approach possible. Hence, this diff.

This diff adds the basic functionality and tests, along with minor
changes and tweaks in the rest of the codebase (boyscout rule). We
integrated the code added by this PR and verified it is working as
intended in #87.
@bassosimone bassosimone changed the title wip: refactor ghremote wip: pulling more data and experimenting Dec 24, 2025
@bassosimone bassosimone changed the title wip: pulling more data and experimenting fix(data)!: use integer for YAML version Dec 25, 2025
@bassosimone bassosimone changed the title fix(data)!: use integer for YAML version fix(data)!: use integer for YAML file version Dec 25, 2025
@bassosimone bassosimone marked this pull request as ready for review December 25, 2025 11:45
@bassosimone bassosimone merged commit 1d6efc4 into main Dec 25, 2025
5 checks passed
@bassosimone bassosimone deleted the refactor/ghcache_sync_integration branch December 25, 2025 11:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants