Part of this project is getting experience with automated data ingestion. Doing so is more interesting with data that changes regularly.
The following are [types of] datasets that could be a good fit:
- Crime
- Transportation
- Transit
- Traffic
- Bike share
- Energy
- Climate
- Air quality
- Atmosphere
- Fire
- National Oceanic and Atmospheric Administration
- NASA Land, Atmosphere Near real-time Capability for Earth observation (LANCE)
- Surface and ocean temperatures
- Water quality
- Weather
- Public facilities
- Jails/prisons
- Homeless shelters
- Finance
- Stocks
- Indexes
- Polls
- "daily" datasets, via the Socrata Discovery API
- Transportation data:
- NYC's recently updated datasets
- Could do a similar search in other Socrata portals
- Datasets using git-scraping
- Various open data portals