This repository contains a Jupyter Notebook demonstrating a case study on fake news detection using Natural Language Processing (NLP) and machine learning.
The notebook covers:
- Data cleaning and preprocessing
- Tokenization and lemmatization
- Feature extraction using Bag-of-Words
- Model training and evaluation using:
- Logistic Regression
- Support Vector Machine (SVM)
- Exploratory data analysis with visualizations (Seaborn)
This project is part of a learning exercise to understand:
- Text preprocessing for NLP
- Transforming text into machine learning-ready features
- Comparing performance of different ML classifiers
- Publicly available dataset
- Contains news articles labeled as
FakeorReal
- Clone the repository
- Open the notebook in Jupyter
- Run each cell sequentially to reproduce the analysis
This repository is for educational purposes and showcases my learning in NLP and text classification.