
- spend at least one hour exploring a dataset I have never seen before.
- publish the notebooks on GitHub
- write and publish a brief summary of what I learned each week
I won't make it a rule, but I'm going to try my best not to repeat categories. This means that I will try to pick a datasets from a variety of categories (e.g. politics, environment, biology, media, etc). This should not only challenge me, but also make it more interesting for all of us. Although it is called "dataset of the week", I don't plan to do this every week. I have to many other side projects and family/friend/work responsibilities to make that promise. But I will try to keep it weekly-ish for at least the first month or two.
- Date: 24 October 2019
- Notebook: 241019-ecl-nursing-homes.ipynb
- Blog post: Nursing Homes in the U.S.
- Primary Data Source: Nursing Home Inspect (ProPublica)
- Category: elderly care
- Date: 21 November 2019
- Notebook: 211119-ecl-design-census.ipynb
- Blog post: Design Census: Designing a data dictionary
- Primary Data Source: Design Census 2019
- Category: design
- The States Network [commerce, networks]
- Ebola Outbreak in the DRC [public health]
- USDA's Nutrient Database [food]
If you are in need of some datasets to explore, I highly recommend these resources: