YouTubeViewScraper

Description

This repository is an out-of-the-box demonstration of collecting publicly accessible data from sources containing dynamically loaded content, such as YouTube. It aims to display that Selenium provides viable solutions for large-scale web scraping on JavaScript-heavy websites.

For use with non-norwegian setups, adequate adjusting must be made:

The formatting of the str_to_n_views() function is written for browsers displaying view count in Norwegian (e.g., sett 3,5 mill. ganger) and will hence require re-writing if this is not accommodated for.

To run this demo, follow the installation guide and run the main.py file.

Installation

Clone this repo (!git clone https://github.com/davidharket/YouTubeViewScraper)
Install required packages (!pip install -r requirements.txt)
Change the value of the profile_path variable on line 14 (often C:/Users/User/AppData/Local/Google/Chrome/User Data/Default).
Install a Chrome Web Driver if not already acquired (download is available here: https://chromedriver.storage.googleapis.com/index.html?path=114.0.5735.90/).
Run main.py

Usage

This is just a demonstration, but in running the script, a database containing the date of data collection, title of the video investigated, view count associated with the corresponding video, and duration time (in seconds) associated with the corresponding video for all elements scraped will be produced.

Here is a screenshot of the collected data displayed in the DB Browser (SQLite) desktop application (can be downloaded at:https://sqlitebrowser.org/dl/):

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
screenshot.png		screenshot.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YouTubeViewScraper

Description

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

davidharket/YouTubeViewScraper

Folders and files

Latest commit

History

Repository files navigation

YouTubeViewScraper

Description

Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages