Skip to content

"A Machine Learning project to detect fake job postings using NLP (TF-IDF) and Random Forest Classifier, achieving ~93% accuracy. Helps identify fraudulent job listings through automated text classification."

Notifications You must be signed in to change notification settings

pulkittaneja09/Fake-Job-Detector-Training

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ•΅οΈ Fake Job Posting Detection πŸ“Œ Overview

This project uses Natural Language Processing (NLP) and Machine Learning to detect fake job postings. By analyzing job descriptions with TF-IDF vectorization and training a Random Forest Classifier, the model can classify postings as real or fake with ~93% accuracy.

πŸ“‚ Dataset

Dataset: Fake Job Postings Dataset (Kaggle)

Contains job descriptions labeled as real (0) or fake (1).

βš™οΈ Tech Stack

Python 3

pandas, numpy, matplotlib, seaborn β†’ Data Analysis & Visualization

scikit-learn β†’ ML model (Random Forest, TF-IDF Vectorizer)

NLTK β†’ Text Preprocessing (stopwords, stemming)

joblib β†’ Saving & Loading Model

πŸš€ Steps in the Project

Data Cleaning & Preprocessing

Removed stopwords, punctuation, and applied stemming

Converted text into numerical features using TF-IDF

Model Training

Trained Random Forest Classifier

Achieved 93% accuracy

Model Saving

Saved trained model (model.pkl)

Saved vectorizer (vectorizer.pkl)

Deployment (Optional)

Can be deployed using Streamlit or Flask + Vercel/Render

πŸ“Š Results

Accuracy: ~93%

Algorithm Used: Random Forest

Feature Extraction: TF-IDF

About

"A Machine Learning project to detect fake job postings using NLP (TF-IDF) and Random Forest Classifier, achieving ~93% accuracy. Helps identify fraudulent job listings through automated text classification."

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published