A service for extracting and processing job-related information from various sources.
This service is designed to extract, process, and manage job-related data. It provides functionality for parsing job listings, storing job information, and managing job-related metadata.
- Job data extraction
- Data processing and normalization
- Job information storage
- API endpoints for job data access
- Clone the repository:
git clone https://github.com/colorpulse6/job-extraction-service.git
cd job-extraction-service- Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows use: venv\Scripts\activate- Install dependencies:
pip install -r requirements.txtpython app.py
The service will start on port 5002 by default.
POST /extract
Request body:
{
"url": "https://example.com/job-posting",
"source": "linkedin" // Optional: linkedin, indeed, or generic
}Response:
{
"title": "Software Engineer",
"company": "Example Corp",
"location": "San Francisco, CA",
"description": "Job description...",
"salary": "$100,000 - $150,000",
"requirements": ["5+ years of experience", "Python knowledge"],
"benefits": ["Health insurance", "401(k)"],
"postedDate": "2023-01-01",
"applicationDeadline": "2023-02-01",
"jobType": "Full-time",
"remoteStatus": "Remote",
"originalHtml": "<html>...</html>",
"extractionDate": "2023-01-15T12:00:00Z",
"source": "linkedin"
}GET /health
Response:
{
"status": "healthy"
}[Add configuration details here]
- Fork the repository
- Create your feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add some amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.