Skip to content

colorpulse6/job-extraction-service

Repository files navigation

Job Extraction Service

A service for extracting and processing job-related information from various sources.

Description

This service is designed to extract, process, and manage job-related data. It provides functionality for parsing job listings, storing job information, and managing job-related metadata.

Features

  • Job data extraction
  • Data processing and normalization
  • Job information storage
  • API endpoints for job data access

Installation

  1. Clone the repository:
git clone https://github.com/colorpulse6/job-extraction-service.git
cd job-extraction-service
  1. Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate
  1. Install dependencies:
pip install -r requirements.txt

Usage

Starting the Service

python app.py

The service will start on port 5002 by default.

API Endpoints

Extract Job Data

POST /extract

Request body:

{
  "url": "https://example.com/job-posting",
  "source": "linkedin" // Optional: linkedin, indeed, or generic
}

Response:

{
  "title": "Software Engineer",
  "company": "Example Corp",
  "location": "San Francisco, CA",
  "description": "Job description...",
  "salary": "$100,000 - $150,000",
  "requirements": ["5+ years of experience", "Python knowledge"],
  "benefits": ["Health insurance", "401(k)"],
  "postedDate": "2023-01-01",
  "applicationDeadline": "2023-02-01",
  "jobType": "Full-time",
  "remoteStatus": "Remote",
  "originalHtml": "<html>...</html>",
  "extractionDate": "2023-01-15T12:00:00Z",
  "source": "linkedin"
}

Health Check

GET /health

Response:

{
  "status": "healthy"
}

Configuration

[Add configuration details here]

Contributing

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

A Python service to extract job posts from various job platforms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages