Skip to content

Conversation

@nikhilwoodruff
Copy link
Contributor

@nikhilwoodruff nikhilwoodruff commented Aug 1, 2025

The dataset upload was failing silently when files were missing, so available datasets weren't getting deployed. This changes the upload script to warn about missing files but continue with the ones that exist.

The pooled CPS wasn't being published because of a bug in policyengine_us_data/datasets/cps/cps.py:19. The original test_lite = os.environ.get("TEST_LITE") would return the string "false" when TEST_LITE=false, which is truthy in Python. So test lite mode was always enabled, preventing full dataset generation. Fixed this in commit 909c912.

Fixes #415

Changes the upload process to continue with available files rather than failing completely when any dataset file is missing. This allows successful deployment while making missing files visible through warning messages, preventing the silent omission of datasets like the pooled 3-year CPS.

Fixes #415
@nikhilwoodruff nikhilwoodruff self-assigned this Aug 1, 2025
@nikhilwoodruff nikhilwoodruff merged commit f8292d7 into main Aug 1, 2025
6 checks passed
@nikhilwoodruff nikhilwoodruff deleted the fix-pooled-cps-upload branch August 4, 2025 12:49
juaristi22 pushed a commit that referenced this pull request Aug 8, 2025
* Fix dataset upload to warn instead of fail on missing files

Changes the upload process to continue with available files rather than failing completely when any dataset file is missing. This allows successful deployment while making missing files visible through warning messages, preventing the silent omission of datasets like the pooled 3-year CPS.

Fixes #415

* Error instead of warning

* Fix bug causing test lite to be true

* Versioning
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Pooled 3-year CPS dataset not being published

2 participants