This project is a complete Python-based solution that demonstrates web scraping, data storage, and web application development.
The application scrapes quotes from https://quotes.toscrape.com, stores the data in both CSV and SQLite, and displays the scraped data using a Django web application.
This project was developed as part of a technical assignment to showcase backend and Python skills.
- Python
- Requests
- BeautifulSoup4
- SQLite
- Django
- HTML & CSS
-
Scrapes data from minimum 10 pages (pagination handled)
-
Extracts:
- Quote text
- Author name
- Tags
- Author profile URL
-
Stores data in:
- CSV file
- SQLite database
-
Prevents duplicate entries in the database
-
Displays scraped data using Django web application
-
Clean and readable user interface
git clone <repository-url>
cd project-rootpython -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activatepip install requests beautifulsoup4 djangopython scraper.pyThis will:
- Scrape quotes from the website
- Create
quotes.csv - Create
quotes.dbwith all scraped data
cd quotes_webpython manage.py runserverhttp://127.0.0.1:8000/
You will see all scraped quotes displayed with author names, tags, and profile links.
- SQLite database used
- Single table:
quotes - Duplicate prevention using UNIQUE constraint on quote text
- Django models configured with
managed = Falseto use existing database
csv
quote,author,tags,author_profile
- UI is kept simple for clarity and readability
- Project focuses on correctness, structure, and functionality
- Can be extended with search, filters, or advanced styling
Harsh Galathiya Python Developer