Skip to content

Shree2604/ScrapNSearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

9 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ•ต๏ธโ€โ™‚๏ธ Scrape & Guess - Web Scraping Challenge Game

Python Version License

An interactive educational game designed to teach web scraping fundamentals through hands-on challenges using BeautifulSoup4.

๐ŸŽฏ Overview

Scrape & Guess is an educational web scraping game that helps students and developers learn data extraction techniques in a fun, competitive environment. Players scrape HTML files or live websites to answer progressively challenging questions.

Learning Objectives

  • Master HTML parsing with BeautifulSoup4
  • Understand CSS selectors and DOM navigation
  • Practice data extraction and transformation
  • Learn web scraping best practices
  • Develop problem-solving skills

โœจ Features

  • Offline Mode: Practice with static HTML files (no internet required)
  • Progressive Difficulty: Easy โ†’ Medium โ†’ Hard โ†’ Expert challenges
  • Auto-Validation: Automated answer checking system
  • Real-World Scenarios: Movie databases, news sites, e-commerce layouts
  • Educational: Includes detailed solutions and explanations
  • Extensible: Easy to add custom challenges

๐Ÿš€ Installation

Prerequisites

  • Python 3.8 or higher
  • pip package manager
  • Text editor or IDE

Setup Instructions

# Clone the repository
git clone https://github.com/Shree2604/ScrapNSearch.git
cd ScrapNSearch

# Create virtual environment (recommended)
python -m venv venv

# Activate virtual environment
# On Windows:
venv\Scripts\activate


# Install dependencies
pip install -r requirements.txt

๐ŸŽฎ Quick Start

  1. Choose a challenge level from the challenges/ directory
  2. Read the HTML file in the data/ directory to understand the structure
  3. Write your scraping script to extract data and answer questions
  4. Quote your answer & Update .py file in your fork

๐Ÿ“Š Challenge Levels

Level File Difficulty Time Skills Required
1 movies.html โญ Easy 15-20 min Basic tag finding, attribute extraction
2 news.html โญโญ Medium 20-30 min Text processing, data aggregation
3 ecommerce.html โญโญโญ Hard 30-45 min Complex selectors, nested data
4 social_media.html โญโญโญโญ Expert 45-60 min Dynamic content, edge cases

Detailed challenge instructions: See individual files in the challenges/ directory.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

About

๐ŸŽฏ --- Web Scraping Challenge Game --- A series of progressively difficult web scraping exercises to master BeautifulSoup4 and data extraction techniques. Test your skills with real-world HTML structures.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages