Skip to content

A professional-grade Python tool for automated CSV data cleaning and normalization. Built with Pandas to handle empty rows, whitespace trimming, and data type conversion for cleaner datasets

License

Notifications You must be signed in to change notification settings

emanuele-tarchi-it/python-csv-data-cleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Python CSV Data Cleaner

Version License Pandas

An automated utility designed to transform "messy" CSV files into structured, clean datasets. Ideal for data analysts and businesses that need to prep data for databases or BI tools.

✨ Features

  • Empty Row Removal: Automatically detects and drops rows that contain no data.
  • String Normalization: Trims hidden whitespaces from text (leading and trailing).
  • Smart Type Conversion: Converts columns to numeric types where possible for better analysis.
  • Safe Export: Generates a new cleaned file without modifying the original source.

πŸ›  Prerequisites

This tool uses the Pandas library. You can install it via terminal:

pip install pandas

πŸš€ How to Use Clone the repository.

Place your "dirty" CSV in the project folder.

Open csv_cleaner.py and update the input_file name in the main block.

Run: python csv_cleaner.py

πŸ“„ License

This project is licensed under the MIT License. See the LICENSE file for details.


Developed by Emanuele Tarchi | Automation Specialist Portfolio

About

A professional-grade Python tool for automated CSV data cleaning and normalization. Built with Pandas to handle empty rows, whitespace trimming, and data type conversion for cleaner datasets

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages