Mailguard

Here’s a README.md file template that explains your project, the steps involved in the setup, and the process of email extraction and analysis:

Mailguard

Mailguard is a Python-based tool that extracts emails from a specified Gmail account and analyzes them to classify emails as phishing or legitimate using a pre-trained machine learning model.

Features

Email Extraction: Connects to a Gmail account using IMAP to fetch emails based on specific search criteria.
Email Analysis: Analyzes extracted emails for phishing content using a machine learning model.
Frontend Interface: A Flask-based web application that allows users to input credentials, extract emails, and view results in a structured table.

Project Structure

Mailguard/
├── classified_emails.csv    # Output file containing classified emails
├── credentials.yml          # Stores email credentials (auto-generated)
├── email_analyzer.py        # Script for analyzing emails
├── email_extract.py         # Script for extracting emails from Gmail
├── emails.csv               # Intermediate file containing extracted emails
├── phishing_model.pkl       # Pre-trained model for phishing detection (not uploaded)
├── vectorizer.pkl           # Saved vectorizer for preprocessing text (not uploaded)
├── app.py                   # Flask application for user interaction
├── requirements.txt         # Python dependencies
└── README.md                # Project documentation

Important : We could not upload our phising_model.pkl due to its large size.

Here is the link: https://drive.google.com/file/d/1Y_epwWX7fbufkHq9ooOO4dIpAlXcH_Hf/view?usp=sharing

Requirements

Python 3.8 or higher
Gmail account with App Password
Required libraries (see requirements.txt):
- Flask
- Pandas
- scikit-learn
- PyYAML
- IMAPClient
- joblib

Setup Instructions

Clone the Repository:

git clone https://github.com/your-username/Mailguard.git
cd Mailguard

Install Dependencies: Use the requirements.txt file to install all necessary dependencies.
```
pip install -r requirements.txt
```
Place the Pre-trained Model: Download the phishing_model.pkl and vectorizer.pkl files and place them in the project directory.
Run the Flask Application:
```
python app.py
```
Access the Web App: Open your browser and go to http://127.0.0.1:5000.

Usage Workflow

Step 1: Email Extraction

Log in using your Gmail email and app password via the web app.
The email_extract.py script:
1. Connects to Gmail using the IMAP protocol.
2. Searches for emails based on predefined criteria (e.g., sender email address).
3. Extracts the plain text content of each email and saves it in emails.csv.

Step 2: Email Analysis

After emails are extracted, the email_analyzer.py script:
1. Preprocesses the extracted email content by removing HTML tags, URLs, special characters, and converting to lowercase.
2. Uses the pre-trained model and vectorizer to predict whether each email is phishing (1) or legitimate (0).
3. Saves the predictions in classified_emails.csv.

Step 3: Viewing Results

The results are displayed in a table on the "Results" page of the web app.

Example Workflow

Login: Enter your Gmail email and app password on the home page of the app.
Email Extraction: Emails are fetched from your inbox based on the specified search criteria (e.g., sender address).
Email Analysis: Extracted emails are analyzed, and a phishing classification report is generated.
Results: Visit the "Results" page to view the phishing classifications for each email.

Notes

Security: Ensure your Gmail account has App Passwords enabled. Avoid using your primary account password.
Custom Criteria: Update email_extract.py to change the email search criteria (e.g., different sender, subject).

Example Output

Classified Emails (`classified_emails.csv`)

Email Text	Prediction
"Update your account details immediately"	1
"Welcome to our platform!"	0

1: Phishing
0: Legitimate

Troubleshooting

Email Extraction Issues:
- Ensure your Gmail account has IMAP enabled (Gmail IMAP Settings).
- Verify the sender email address specified in email_extract.py.
Model/Vectorizer Missing:
- Ensure the phishing_model.pkl and vectorizer.pkl files are present in the project directory.
App Not Running:
- Ensure Flask is installed and all dependencies are satisfied (pip install -r requirements.txt).

License

This project is licensed under the MIT License.

You can copy this README.md template, customize the repository URL, and adjust details to fit your specific project setup. Let me know if you need further clarification!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mailguard

Features

Project Structure

Important : We could not upload our phising_model.pkl due to its large size.

Requirements

Setup Instructions

Usage Workflow

Step 1: Email Extraction

Step 2: Email Analysis

Step 3: Viewing Results

Example Workflow

Notes

Example Output

Classified Emails (`classified_emails.csv`)

Troubleshooting

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
training_model		training_model
website		website
README.md		README.md
classified_emails.csv		classified_emails.csv
credentials.yml		credentials.yml
email_analyzer.py		email_analyzer.py
email_extract.py		email_extract.py
emails.csv		emails.csv

source-rashi/SpamMail-Detection

Folders and files

Latest commit

History

Repository files navigation

Mailguard

Features

Project Structure

Important : We could not upload our phising_model.pkl due to its large size.

Requirements

Setup Instructions

Usage Workflow

Step 1: Email Extraction

Step 2: Email Analysis

Step 3: Viewing Results

Example Workflow

Notes

Example Output

Classified Emails (classified_emails.csv)

Troubleshooting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Classified Emails (`classified_emails.csv`)

Packages