Skip to content

JosepRDA/IMDb-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

IMDbScraper

This Project aims to facilitate gathering data on IMDb.

Features:

  • Movie Title;
  • Release Year;
  • Age Rating;
  • Genres;
  • Duration;
  • Rating;
  • MetaScore;
  • Customizable Output

Requirements:

  • beautifulsoup4 – for parsing HTML
  • lxml - the html parser
  • requests – for making HTTP requests to retrieve webpage content.
  • pandas – for storing and saving the data to a CSV or Excel file.
  • openpyxl – required for saving to Excel format.

How to use:

  1. Provide your urls: provide a list of movie urls to scrape.
  2. Run the script.
  3. Save the data: you can choose between csv and excel.

Notes:

  • Rate Limiting: The algorithm has a builtin cooldown to avoid sending too many requests in a short period of time ann getting your ip blocked by IMDb.
  • Always Changing Website: Due to IMDb's constant updating of their website, the elements and classes may change, requiring your attention before executing the program.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages