EDA - NYC Parking Violations

Exploratory data analysis of NYC parking violations using Pandas and several visualization libraries This Jupyter notebook can also be found on Jovian

Since 2012 New York City has made data publically available at NYC Open Data Of the many data sets available is Parking violatons data from 2014 through present (2021). In addition to it being on the NYC Open data site it can also be found on kaggle. The fill data set contains 4 years of tickets with 42.3 Million rows of data.

It turns out that NYC issues over 10 Million parking tickets every year!

This project analyzes the fiscal year of 2017, which runs from July 1st, 2016 through June 30th, 2017. one year alone is nearly 2GB of data, and the notebook is intended to be run with a lot of memory. I have used Google Colab.

This project used Pandas, numpy, opendatasets as well as several python visualization packages to make insights on this data. The visualization libraries are:

Matplotlib
Seaborn
Plotly
Folium

I will be back to incorperate the other years up to and including 2021. this will involve the use of Dask for handling even larger dataframes. This page will be updated accordingly.

Best

😎 Sam

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
eda-nyc-parking-tickets.ipynb		eda-nyc-parking-tickets.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EDA - NYC Parking Violations

About

Uh oh!

Releases

Packages

Languages

srobertsphd/EDA-nyc-parking-violations

Folders and files

Latest commit

History

Repository files navigation

EDA - NYC Parking Violations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages