GitHub - JosepRDA/IMDb-Scraper

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Repository files navigation

IMDbScraper

This Project aims to facilitate gathering data on IMDb.

Features:

Movie Title;
Release Year;
Age Rating;
Genres;
Duration;
Rating;
MetaScore;
Customizable Output

Requirements:

beautifulsoup4 – for parsing HTML
lxml - the html parser
requests – for making HTTP requests to retrieve webpage content.
pandas – for storing and saving the data to a CSV or Excel file.
openpyxl – required for saving to Excel format.

How to use:

Provide your urls: provide a list of movie urls to scrape.
Run the script.
Save the data: you can choose between csv and excel.

Notes:

Rate Limiting: The algorithm has a builtin cooldown to avoid sending too many requests in a short period of time ann getting your ip blocked by IMDb.
Always Changing Website: Due to IMDb's constant updating of their website, the elements and classes may change, requiring your attention before executing the program.

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%