KleinanzeigenScraper

A tool for scraping laptop listings from Kleinanzeigen.de and analyzing them with ChatGPT.

Features

Scrapes laptop listings from Kleinanzeigen.de
Analyzes listings using OpenAI's GPT models to extract key information
Web interface to view and manage listings
Systemd service for continuous operation

Prerequisites

Python 3.6 or newer
Node.js and npm
Python venv module (on Debian/Ubuntu: sudo apt install python3-venv)

Installation

Automatic Installation

Clone the repository:

git clone https://github.com/yourusername/KleinanzeigenScraper.git
cd KleinanzeigenScraper

Make the installer executable:
```
chmod +x install.sh
```
Run the installer:
```
./install.sh
```
Follow the on-screen instructions to complete the installation.
Edit the config.py file to add your OpenAI API key and customize settings.

Install the systemd service (optional):

sudo cp kleinanzeigen-scraper.service.tmp /etc/systemd/system/kleinanzeigen-scraper.service
sudo systemctl daemon-reload
sudo systemctl enable kleinanzeigen-scraper.service
sudo systemctl start kleinanzeigen-scraper.service

Manual Installation

If you prefer to install manually:

Create a Python virtual environment:

python3 -m venv kleinanzeigenScraper
source kleinanzeigenScraper/bin/activate

Install Python dependencies:

pip install --upgrade pip
pip install -r requirements.txt

Install Node.js dependencies:
```
npm install
```
Create a configuration file:
```
cp config_template.py config.py
```
Edit config.py to add your OpenAI API key and customize settings.

Usage

Running the Web Interface

source kleinanzeigenScraper/bin/activate
node server.js

The web interface will be available at http://localhost:3030

Running the Scraper Directly

source kleinanzeigenScraper/bin/activate
python main.py --mode both

Command line options:

--mode: Choose between scrape, process, or both (default: both)
--urls: Specify URLs to scrape (optional)
--max-listings: Maximum number of listings to scrape per URL (optional)

Architecture

The system consists of two main components:

Node.js Server (server.js): Provides a web interface for viewing and managing scraped listings
Python Scraper (main.py): Handles the actual scraping and processing of listings

The Node.js server can trigger the Python scraper through the child_process.spawn() method, allowing users to initiate scraping jobs through the web interface.

Troubleshooting

Virtual Environment Creation Fails

If you see an error like:

The virtual environment was not created successfully because ensurepip is not available.

Install the Python venv package:

# For Debian/Ubuntu
sudo apt install python3-venv

# For Fedora
sudo dnf install python3-venv

# For Arch Linux
sudo pacman -S python-virtualenv

License

MIT License

Accessing the Application

Once the service is running, open your web browser and navigate to:

http://localhost:3030

If accessing from another device on your network, replace "localhost" with your server's IP address:

http://YOUR_SERVER_IP:3030

Managing the Service

View logs:

sudo journalctl -u kleinanzeigen-scraper.service -f

Restart the service:

sudo systemctl restart kleinanzeigen-scraper.service

Stop the service:

sudo systemctl stop kleinanzeigen-scraper.service

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
public		public
.gitignore		.gitignore
README.md		README.md
config_template.py		config_template.py
diagnose.sh		diagnose.sh
install.sh		install.sh
kleinanzeigen-scraper.service		kleinanzeigen-scraper.service
kleinanzeigen-scraper.service.tmp		kleinanzeigen-scraper.service.tmp
main.py		main.py
package.json		package.json
process_listings.py		process_listings.py
prompts.py		prompts.py
requirements.txt		requirements.txt
run_scraper.js		run_scraper.js
scraper.py		scraper.py
server.js		server.js
start-service.sh		start-service.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KleinanzeigenScraper

Features

Prerequisites

Installation

Automatic Installation

Manual Installation

Usage

Running the Web Interface

Running the Scraper Directly

Architecture

Troubleshooting

Virtual Environment Creation Fails

License

Accessing the Application

Managing the Service

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Infraviored/KleinanzeigenScraper

Folders and files

Latest commit

History

Repository files navigation

KleinanzeigenScraper

Features

Prerequisites

Installation

Automatic Installation

Manual Installation

Usage

Running the Web Interface

Running the Scraper Directly

Architecture

Troubleshooting

Virtual Environment Creation Fails

License

Accessing the Application

Managing the Service

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages