📚 BookMiner

BookMiner is a data pipeline project that scrapes book data from web pages, stores the raw HTML, processes and combines the data, and performs exploratory data analysis (EDA) to derive insights.

🚀 Project Workflow

Web Scraping
Scrapes book listings from online pages and stores the HTML files.
Data Extraction & Storage
Parses and combines data from multiple HTML pages into a single structured CSV file.
Exploratory Data Analysis (EDA)
Performs visual and statistical analysis to uncover patterns in book pricing, ratings, value scores, and more.

📁 Project Structure

├── 1_scraping.ipynb        # Scrapes book data and stores HTML files
├── 2_EDA.ipynb             # Performs EDA on the combined CSV data
├── DATA.csv                # Cleaned and structured dataset
├── README.md               # Project overview and instructions
├── HTMLs                   # All scraped pages from website

📊 Sample Insights

Price distribution of books
Correlation between rating and value score
Most common price ranges for high-rated books

🛠️ Tools & Libraries

Python (BeautifulSoup, Requests, Pandas)
Jupyter Notebook
Matplotlib, Seaborn for visualization

📌 Getting Started

Clone the repo:

git clone https://github.com/your-username/BookMiner.git
cd BookMiner

Run the notebooks in order:
- 1_scraping.ipynb
- 2_EDA.ipynb

📃 License

This project is for educational and non-commercial use.

Made with ❤️ for data and books.

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
htmls		htmls
.gitignore		.gitignore
1_scraping.ipynb		1_scraping.ipynb
2_EDA.ipynb		2_EDA.ipynb
DATA.csv		DATA.csv
LICENSE		LICENSE
README.md		README.md
page3.html		page3.html
page30.html		page30.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📚 BookMiner

🚀 Project Workflow

📁 Project Structure

📊 Sample Insights

🛠️ Tools & Libraries

📌 Getting Started

📃 License

About

Uh oh!

Releases

Packages

Languages

License

SaurabhSSB/BookMiner

Folders and files

Latest commit

History

Repository files navigation

📚 BookMiner

🚀 Project Workflow

📁 Project Structure

📊 Sample Insights

🛠️ Tools & Libraries

📌 Getting Started

📃 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages