Traffic Congestion and Collisions Analysis in Chicago

This project aims to analyze the correlation between traffic congestion and collisions in Chicago using big data techniques. By leveraging large datasets and distributed computing frameworks, we investigate the potential relationship between traffic congestion levels and the frequency or severity of collisions across different city areas.

Data Sources

Traffic Crash Data: https://data.cityofchicago.org/Transportation/Traffic-Crashes-Crashes/85ca-t3if/about_data
Traffic Congestion Data: https://data.cityofchicago.org/Transportation/Chicago-Traffic-Tracker-Historical-Congestion-Esti/kf7e-cur8/about_data

Methodology

Data Processing: We utilize PySpark, a distributed computing framework for big data processing, to handle and transform the large-scale traffic and collision datasets. This includes data cleaning, joining datasets, and performing relevant aggregations and transformations.
Exploratory Data Analysis: We conduct exploratory data analysis (EDA) to gain insights into the datasets, identify patterns, and visualize key variables related to traffic congestion and collisions.
Spatial Analysis: By leveraging GeoSpatial libraries like Geopandas, we analyze the spatial distribution of traffic congestion and collisions across different neighborhoods or regions in Chicago.

Technologies and Tools

PySpark: Distributed computing framework for big data processing.
Azure Virtual Machine: Cloud computing platform for running PySpark jobs and managing data.
MongoDB: NoSQL database for storing and querying data.
Jupyter Notebook: Interactive environment for data analysis and visualization.
Geopandas: Python library for working with geospatial data.

Team Members

John Olusetire
Timothy Obuadey
Anand Seshadri

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
traffic-data-spark-analysis.ipynb		traffic-data-spark-analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traffic Congestion and Collisions Analysis in Chicago

Data Sources

Methodology

Technologies and Tools

Team Members

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Traffic Congestion and Collisions Analysis in Chicago

Data Sources

Methodology

Technologies and Tools

Team Members

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages