This project showcases end-to-end data extraction, cleaning, and interactive visualization using Selenium, Pandas, and Power BI. Data was scraped from Croma across various product categories such as Mobiles, Air Conditioners, Refrigerators, and Televisions.
To scrape product listings from the Croma eCommerce website, clean the data, and create a Power BI dashboard for price-based insights and product exploration.
- Python 3
- Selenium – for automating web scraping
- Pandas – for data processing
- Power BI – for creating interactive dashboards
- ChromeDriver
- Automated scraping from Croma.com using Selenium.
- Extracted fields:
Category,Title,Price, andProduct Link. - Cleaned prices (₹ symbol removed, converted to integer).
- Final dataset saved as
croma_cleaned_products.csv. - Created Power BI dashboard:
- 📦 Category-wise product price distribution
- 💰 Top 10 expensive products
- 📈 Price range histogram
- 🔗 Table with clickable product links
croma-web-scraping/
├── project_notebook.ipynb # All scraping and cleaning code
├── croma_cleaned_products.csv # Final cleaned dataset
├── dashboard.png # Screenshot of Power BI dashboard
└── README.md # Project overview