Skip to content

This project consists of two major operations i.e. cleaning the raw data and using it for exploratory data analysis. The project was made in SQL and guided by Alex- The analyst and made by me i.e. Yog Gupta. A few modifications were made to in the beginning steps for data project to take up less space.

Notifications You must be signed in to change notification settings

CoderB10/Data-cleaning-and-more

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Layout of the project

Snippet of data taken for cleaning and analysis snippet

The project has two stages:

  1. DATA CLEANING
  2. EXPLORATORY DATA ANALYSIS

Data cleaning tasks:

  1. Delete duplicates.
  2. Remove extra spaces from columns that might have taken place while doing data entry.
  3. Treat null values and blanks.
  4. Remove entries with null values in the focus columns. e.g. here the database was about the count and percentage of laid off employees by different companies(their country, industry, funding stage mentioned as well) at particular dates.

Exploratory Data Analysis (EDA):

  1. Finding the companies with highest lay offs by count as well as percentage.
  2. Finding the year with highest lay offs by count.
  3. Finding the stage at which the companies laid off its employess from max to min.
  4. Rolling total of laid off employees on each month and year from the beginning of the records
  5. Ranking the companies with highest to lowest layoffs amount
  6. Finding Top 5 companies ranked in order of most lay offs made over the course of database

About

This project consists of two major operations i.e. cleaning the raw data and using it for exploratory data analysis. The project was made in SQL and guided by Alex- The analyst and made by me i.e. Yog Gupta. A few modifications were made to in the beginning steps for data project to take up less space.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published