Preserve raw age data and document original sources#298
Open
vahid-ahmadi wants to merge 1 commit intomainfrom
Open
Preserve raw age data and document original sources#298vahid-ahmadi wants to merge 1 commit intomainfrom
vahid-ahmadi wants to merge 1 commit intomainfrom
Conversation
The constituency fill_missing_age_demographics.py was reading and overwriting age.csv in place, losing the original data. Changed to read from raw_age.csv (matching the LA script pattern). Updated both READMEs to document the raw/processed distinction and original data sources. Closes #71. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
fill_missing_age_demographics.pyto read fromraw_age.csvinstead ofage.csv, matching the existing LA script pattern. This prevents the script from overwriting the original data.Note: the raw data files (
raw_age.csv) were never committed to the repo — the currentage.csvfiles are already processed outputs. This PR ensures the pipeline is correct going forward when someone re-downloads fresh data.Closes #71.
Test plan
fill_missing_age_demographics.pyreads fromraw_age.csvin both directories🤖 Generated with Claude Code