Skip to content

renatoroquejani/connecta-web-scrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Connecta Web Scraper API

A FastAPI application that scrapes product data from Connecta Venda catalogs and returns it as TEXT.

Installation

  1. Clone the repository:
git clone https://github.com/yourusername/connecta-web-scraper.git
cd connecta-web-scraper
  1. Create a virtual environment:
python -m venv venv
  1. Activate the virtual environment:
  • On Windows:
venv\Scripts\activate
  • On macOS/Linux:
source venv/bin/activate
  1. Install dependencies:
pip install -r requirements.txt

Usage

Local Development

  1. Start the FastAPI server:
python app.py
  1. The API will be available at http://localhost:8000

  2. Access the API documentation at http://localhost:8000/docs

Using Docker Compose

  1. Build and start the container:
docker-compose up
  1. The API will be available at http://localhost:8000

  2. Access the API documentation at http://localhost:8000/docs

API Endpoints

Scrape Catalog

  • GET /scrape
  • Parameters:
    • url: The Connecta catalog URL to scrape
  • Example:
http://localhost:8000/scrape?url=https://app.conectavenda.com.br/c25adbbf63a83befa6c04e686c3c090f

Requirements

  • Python 3.8+
  • Docker (optional)
  • See requirements.txt for complete dependency list

License

MIT License - See LICENSE for details

About

API to scrape Connecta Venda to extract product information with images

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors