This project automates Exploratory Data Analysis (EDA) by combining Pandas, Seaborn, Matplotlib with Google Gemini LLM for AI-driven insights.
It generates:
- π Summary statistics
- π§© Missing values report
- π Visualizations (histograms, heatmaps, etc.)
- π€ AI-powered insights from Gemini
- π Downloadable PDF Report
- Upload a CSV file and instantly get an EDA Report
- Structured summary + missing values in tabular format
- Automated visualizations for numeric features
- AI insights powered by Gemini
- Export results as a PDF report
LLM-Powered-EDA/
βββ app.py # Main Gradio app
βββ requirements.txt # Dependencies
βββ README.md # Project description
-
Clone the repository:
git clone https://github.com/your-username/LLM-Powered-EDA.git cd LLM-Powered-EDA -
Install dependencies:
pip install -r requirements.txt
-
Set up your Gemini API Key:
export GOOGLE_API_KEY="your_api_key_here" # Mac/Linux set GOOGLE_API_KEY="your_api_key_here" # Windows PowerShell
-
Run the app:
python app.py
- Get your free API key from Google AI Studio
- Students usually get $300 free credits (3 months)
- Each request costs only a few tokens β more than enough for EDA projects
- Upload
sample.csv - View summary + missing values + AI insights
- Explore histograms + correlation heatmap
- Download full PDF report
MIT License. Use freely for education and research.
gradio==4.36.1
pandas==2.2.2
numpy==1.26.4
matplotlib==3.9.2
seaborn==0.13.2
reportlab==4.2.2
google-generativeai==0.5.4