MBTI-Personality-Prediction

The project aims to predict personality types of individuals using Natural Language Processing (NLP) techniques and Machine Learning (ML) algorithms. The dataset used for training and testing the model is the Myers-Briggs Type Indicator (MBTI) dataset which contains a collection of posts from individuals in the PersonalityCafe forum, along with their corresponding personality types based on the MBTI framework.

The project report can be read here

Data Preprocessing

The dataset was preprocessed by performing the following steps:

Converting all text to lowercase
Removing URLs, mentions, special characters, and stop words
Stemming and lemmatization
Vectorizing the text using the Term Frequency-Inverse Document Frequency (TF-IDF) technique

Handling Imbalanced Data

The MBTI dataset was imbalanced, with some personality types having a significantly smaller number of samples than others. To handle this, undersampling, oversampling, and SMOTe techniques were used to balance the data.

Model Training

Three different models were trained on the preprocessed dataset:

Linear SVC
SVC
KNN
Random Forest
Multinomial Naive Bayes
Logistic Regression

GUI

A simple web-based graphical user interface (GUI) was built using Flask, which allows users to input a text sample and receive a predicted personality type based on the trained models.

pip install -r requirements.txt

Then run the following command:

python app.py

Kaggle Notebook

A Kaggle notebook was created to provide a step-by-step guide for the project. It includes the code, visualizations, and explanations of the various techniques used.

Credits

This project was created by:

The dataset used in this project was obtained from Kaggle and can be found here.

If you have any questions or feedback, feel free to open an issue or contact me at:

Email: mohdazeemkhan64@gmail.com

Thank you for checking out this project!

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
GUI		GUI
4personality-prediction-using-nlp-ml.ipynb		4personality-prediction-using-nlp-ml.ipynb
README.md		README.md
license		license
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MBTI-Personality-Prediction

Data Preprocessing

Handling Imbalanced Data

Model Training

GUI

Kaggle Notebook

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

MOAzeemKhan/MBTI-Personality-Prediction

Folders and files

Latest commit

History

Repository files navigation

MBTI-Personality-Prediction

Data Preprocessing

Handling Imbalanced Data

Model Training

GUI

Kaggle Notebook

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages