Job Posting Database Management System

DSCI551 | Group 48: David Tovmasyan, Jinyang Du, Wenjing Huang

A comprehensive Django-based Database Management System for job postings that enhances the job search and recruitment process by providing a user-friendly, efficient, and interactive job posting platform with distributed database architecture.

Implementation Demo

🎯 Overview

This project implements a distributed job posting management system using Django with MySQL database partitioning. The system features:

Distributed Database Architecture: Uses hash-based partitioning across 3 MySQL databases
Web Scraping: Automated LinkedIn job data collection using Selenium and BeautifulSoup
User Authentication: Complete user registration, login, and profile management
Job Management: CRUD operations for job postings with advanced search capabilities
Responsive UI: Modern Bootstrap-based interface with interactive features

✨ Features

🔍 Advanced Job Search: Search by title, location, company, and filters
📊 Database Partitioning: Hash-based distribution across multiple databases
🤖 Automated Data Collection: LinkedIn scraping with Selenium WebDriver
👤 User Management: Registration, authentication, and profile management
💾 Data Import/Export: CSV-based data management with custom commands
📱 Responsive Design: Mobile-friendly Bootstrap interface
⭐ Favorites System: Save and manage favorite job postings
🗓️ Date Range Operations: Bulk delete jobs within specified time periods

🏗️ Architecture

The system uses a distributed database architecture with:

Primary Database: SQLite for user management and session data
Partitioned Databases: 3 MySQL databases for job data distribution
Hash Function: Custom partitioning algorithm based on job location state codes
Django ORM: Database-agnostic data access layer

📋 Prerequisites

Before running this project, ensure you have:

Python 3.8 or higher
MySQL 8.0 or higher
pip (Python package installer)
Git

🚀 Installation

1. Clone the Repository

git clone https://github.com/Jinyangd/DSCI551_Group48_Project.git
cd DSCI551_Group48_Project

2. Install Dependencies

pip install django
pip install mysqlclient
pip install crispy-forms
pip install crispy-bootstrap4
pip install selenium
pip install beautifulsoup4
pip install requests

3. Database Setup

Create three MySQL databases for job data partitioning:

CREATE DATABASE db_one;
CREATE DATABASE db_two;
CREATE DATABASE db_three;

4. Configure Database Settings

Edit django_project/django_project/settings.py and update the database configurations:

DATABASES = {
    'default': {
        'ENGINE': 'django.db.backends.sqlite3',
        'NAME': 'jobs'
    },
    'first': {
        'ENGINE': 'django.db.backends.mysql',
        'NAME': 'db_one',
        'USER': 'your_username',
        'PASSWORD': 'your_password',
        'HOST': 'localhost',
        'PORT': '3306',
    },
    'second': {
        'ENGINE': 'django.db.backends.mysql',
        'NAME': 'db_two',
        'USER': 'your_username',
        'PASSWORD': 'your_password',
        'HOST': 'localhost',
        'PORT': '3306',
    },
    'third': {
        'ENGINE': 'django.db.backends.mysql',
        'NAME': 'db_three',
        'USER': 'your_username',
        'PASSWORD': 'your_password',
        'HOST': 'localhost',
        'PORT': '3306',
    }
}

5. Run Database Migrations

python manage.py makemigrations
python manage.py migrate
python manage.py migrate --database=first
python manage.py migrate --database=second
python manage.py migrate --database=third

🔧 Configuration

Environment Variables

Create a .env file in the project root (optional):

SECRET_KEY=your-secret-key-here
DEBUG=True
ALLOWED_HOSTS=localhost,127.0.0.1

Static Files

Collect static files for production:

python manage.py collectstatic

📖 Usage

1. Data Collection

Option A: Use Pre-processed Data

Download the example.csv file with pre-processed job data.

Option B: Scrape Fresh Data from LinkedIn

python linkedin_scrape.py

Note: Scraping may take 2+ hours depending on the number of jobs and network conditions.

2. Import Job Data

python manage.py import_jobs "path/to/your/jobs.csv"

3. Start Development Server

python manage.py runserver

Access the application at: http://127.0.0.1:8000/

4. Additional Commands

Delete Jobs by Date Range

python manage.py remove_jobs "all" "2024-01-01" "2024-12-31"

Create Superuser

python manage.py createsuperuser

📁 Project Structure

dsci551_group48/
├── django_project/          # Main Django project
│   ├── django_project/      # Project settings and configuration
│   ├── blog/               # Job posting application
│   │   ├── management/     # Custom Django commands
│   │   ├── templates/      # HTML templates
│   │   ├── static/         # CSS, JS, and static files
│   │   └── models.py       # Job model and database logic
│   ├── users/              # User authentication app
│   └── manage.py           # Django management script
├── linkedin_scrape.py      # LinkedIn data scraper
├── jobs                    # SQLite database file
└── media/                  # User-uploaded files

Key Files

Hash Function Implementation: Database partitioning logic
LinkedIn Scraper: Automated data collection using Selenium
Job Model: Database schema and business logic
Django Settings: Project configuration

🔌 API Endpoints

Job Management

GET / - Home page with job listings
GET /job/<int:pk>/ - Job detail view
GET /job/new/ - Create new job posting
POST /job/<int:pk>/update/ - Update job posting
POST /job/<int:pk>/delete/ - Delete job posting

User Management

GET /register/ - User registration
GET /login/ - User login
GET /profile/ - User profile
GET /logout/ - User logout

Search and Filtering

GET /search/ - Job search interface
GET /favorites/ - User's favorite jobs
GET /user/<str:username>/ - User's job postings

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

👥 Team

David Tovmasyan - Backend Development & Database Architecture
Jinyang Du - Frontend Development & UI/UX Design
Wenjing Huang - Data Scraping & System Integration

📞 Support

For questions or support, please contact the development team or create an issue in the repository.

Built with ❤️ for DSCI551 Database Systems

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
blog		blog
django_project		django_project
users		users
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
linkedin_scrape.py		linkedin_scrape.py
manage.py		manage.py

Folders and files

Latest commit

History

Repository files navigation

Job Posting Database Management System

📋 Table of Contents

🎯 Overview

✨ Features

🏗️ Architecture

📋 Prerequisites

🚀 Installation

1. Clone the Repository

2. Install Dependencies

3. Database Setup

4. Configure Database Settings

5. Run Database Migrations

🔧 Configuration

Environment Variables

Static Files

📖 Usage

1. Data Collection

Option A: Use Pre-processed Data

Option B: Scrape Fresh Data from LinkedIn

2. Import Job Data

3. Start Development Server

4. Additional Commands

Delete Jobs by Date Range

Create Superuser

📁 Project Structure

Key Files

🔌 API Endpoints

Job Management

User Management

Search and Filtering

🤝 Contributing

📄 License

👥 Team

📞 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages