Skip to content

cloud9international/employee-data-audit-retention-insights

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

119 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📊 Employee Data Audit & Retention Insights

(Auditoría de Datos de Empleados y Análisis de Retención)

Start Here (Abrir Primero)

  1. Final presentation (PDF): docs/employee-retention-audit-presentation.pdf
  2. Key notebook (EDA + ETL + visuals): ETL/visualizaciones.ipynb
  3. Clean dataset used for analysis: files/raw_data_limpio.csv

Project Overview (Resumen del Proyecto)

This project is a structured data audit and exploratory analysis of employee and survey data for a fictional company (ABC Corporation), aimed at identifying key drivers of employee satisfaction and retention.

The work focuses on:

  • Intensive Exploratory Data Analysis (EDA)
  • Data cleaning and transformation (ETL)
  • Insight-driven visualizations in Python (Jupyter)

The goal was not only to uncover workforce trends, but also to evaluate potential bias and data quality issues that could impact strategic HR decision-making.

This project was developed as an academic team project within the Adalab Data Analytics & AI Bootcamp.


Objectives (Objetivos)

  • Audit and clean raw HR and survey data
  • Standardize and transform datasets for reliable analysis
  • Identify key factors linked to employee attrition
  • Visualize trends to support retention strategies
  • Apply agile teamwork and version control practices

Methodology (Metodología)

  • Exploratory Data Analysis (EDA) across multiple iterations
  • Data transformation and validation in Python
  • Custom helper functions for dataset inspection
  • Visual analysis using Matplotlib and Seaborn
  • Documentation of technical decisions and assumptions

Repository Structure (Estructura del Repositorio)

  • docs/ — Technical documentation and final project presentation (PDF)
  • ETL/ — Jupyter notebooks for EDA, data transformation, and visual analysis
  • files/ — Raw and cleaned datasets (CSV)
  • img/ — Key visualizations used in analysis
  • src/ — Custom Python helper functions
  • README.md — Project overview and insights

Key Insights (Hallazgos Clave)

  • Attrition is significantly higher in specific departments, particularly Sales and Human Resources.
  • Certain job roles show disproportionately high turnover compared to others.
  • Lower income levels are frequently associated with higher attrition rates.
  • Retention challenges are not evenly distributed across the organization, highlighting targeted areas for intervention.

Key Visualizations (Visualizaciones Clave)

Attrition Frequency by Department

(Frecuencia de rotación por departamento)

Attrition by Department


Attrition by Job Role

(Rotación por rol de trabajo)

![Attrition by Job Role](img/Rotación Vs JobRole.png)


Technologies Used (Tecnologías Utilizadas)

  • Python (Pandas, NumPy, Matplotlib, Seaborn)
  • Jupyter Notebooks
  • CSV data processing
  • Git & GitHub

Team & Credits (Equipo y Créditos)

This project was originally developed as a collaborative course project by:

Original team repository:
https://github.com/Maykaduran/employee-data-audit-retention-insights

This repository is a curated portfolio version maintained by Claudia Cervantes.


Final Notes (Notas Finales)

  • The analysis prioritizes data integrity and bias awareness
  • Transformations were designed to preserve original data meaning
  • Visual insights focus on actionable workforce trends
  • The project simulates a real-world HR data audit workflow

About

Employee dataset audit uncovering attrition patterns through data cleaning, transformation, and visualization.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors