Threat-Scope: PHP Vulnerability Scanner

A comprehensive web-based PHP vulnerability scanner that uses machine learning and static analysis to detect SQL injection vulnerabilities in PHP source code.

Features

🔍 Vulnerability Detection

Machine Learning: Uses Logistic Regression with TF-IDF vectorization (95.6% recall, 39.8% precision)
Static Analysis: Implements taint analysis to track data flow from user inputs to SQL queries
Real-time Scanning: Upload PHP files and get instant vulnerability analysis
Drag & Drop: Modern file upload with drag-and-drop functionality
Interactive Navigation: Click on vulnerable lines to see detailed taint analysis reports
Comment Navigation: Click on comment lines to jump to related code errors

🔧 Code Mitigation

Automatic Fix Generation: AI-powered secure code suggestions for detected vulnerabilities
Multiple Vulnerability Types: Handles SQL injection, XSS, command injection, and file inclusion
Detailed Explanations: Comprehensive explanations of vulnerabilities and fixes
Copy-to-Clipboard: Easy copying of mitigation reports for implementation

🗄️ Database Management

MySQL Integration: Full MySQL support with phpMyAdmin interface
SQLite Fallback: Automatic fallback to SQLite for development
Advanced Analytics: Comprehensive statistics and trend analysis with 4 interactive charts
Model Performance Metrics: Real-time display of precision, recall, F1-score, and threshold
Security Assessment: Threat level analysis and risk distribution visualization
Data Export: CSV export functionality for scan results

🌐 Web Interface

Modern UI: Clean, responsive interface built with Tailwind CSS
Real-time Results: Interactive vulnerability display with detailed reports
Scan History: Complete history of all scans with statistics and mitigation buttons
Database Dashboard: Enterprise-grade analytics with 12+ metrics and 4 charts
Professional Visualizations: Risk distribution, vulnerability trends, and activity monitoring

Quick Start

Using Docker (Recommended)

Clone and navigate to the project:

git clone <repository-url>
cd threat-scope

Start the services:
```
docker-compose up -d
```
Access the applications:
- Main Scanner: http://127.0.0.1:5000 (or http://localhost:5000)
- phpMyAdmin: http://localhost:8080
- MySQL: localhost:3306

Manual Setup

Install dependencies:
```
cd web
pip install -r requirements.txt
```

Set up MySQL (optional):

# Create database
mysql -u root -p
CREATE DATABASE threat_scope;
CREATE USER 'threat_user'@'localhost' IDENTIFIED BY 'threat_password';
GRANT ALL PRIVILEGES ON threat_scope.* TO 'threat_user'@'localhost';

Set environment variables:

export DATABASE_URL="mysql+pymysql://threat_user:threat_password@localhost:3306/threat_scope"

Run the application:
```
python app.py
```

Database Migration

If you have existing SQLite data, migrate it to MySQL:

cd web
python migrate_to_mysql.py

Usage

1. Upload and Scan

Navigate to http://127.0.0.1:5000 (or http://localhost:5000)
Upload a PHP file using the drag-and-drop interface
View real-time vulnerability analysis results

2. View Scan History

Click on "Past Scans" tab to see all previous scans
View detailed statistics for each scan
Click on any scan to view its results

3. Database Management

Click on "Database" tab to access the management interface
View comprehensive statistics with 12+ metrics and 4 interactive charts
Monitor model performance (precision, recall, F1-score)
Assess security posture with threat level and risk distribution
Export data as CSV
Delete individual scans

4. Code Mitigation

Upload vulnerable PHP files for scanning
Click "🔧 Generate Fixes" button on scans with vulnerabilities
View detailed mitigation suggestions in interactive modal
Copy comprehensive fix reports to clipboard
Implement suggested secure coding patterns

5. phpMyAdmin Access

Navigate to http://localhost:8080
Login with MySQL credentials
Direct database access and management

API Endpoints

Scan Management

GET /api/scans - Get paginated scan data
GET /api/stats - Get comprehensive system statistics and metrics
DELETE /api/scan/<id>/delete - Delete a specific scan

Code Mitigation

GET /api/scan/<id>/mitigate - Generate mitigation suggestions for a specific scan
POST /api/mitigate - Generate mitigation suggestions for uploaded file

Data Export

GET /api/export/csv - Export all scan data as CSV

Project Structure

threat-scope/
├── web/                    # Flask web application
│   ├── app.py             # Main Flask application
│   ├── code_mitigator.py  # Code mitigation engine
│   ├── templates/         # HTML templates
│   │   ├── index.html     # Main scanner interface
│   │   └── database.html  # Database management interface
│   ├── scripts/           # Detection scripts
│   └── requirements.txt   # Python dependencies
├── data/                  # Training data
│   ├── train/safe/        # Safe PHP files
│   └── train/unsafe/      # Unsafe PHP files
├── models/                # Trained ML models
├── scripts/               # Training and analysis scripts
├── docker-compose.yml     # Docker configuration
└── README.md             # This file

Machine Learning Model

The scanner uses a Logistic Regression classifier trained on:

Features: TF-IDF vectorization with character n-grams (3-5 characters)
Training Data: 8,640 safe and 912 unsafe PHP files
Analysis: Line-level vulnerability detection with taint analysis
Performance: 95.6% recall, 39.8% precision, 56.2% F1-score
Threshold: 0.719 (optimized for maximum vulnerability detection)

Database Schema

Scan Table

id: Primary key
filename: Name of scanned file
timestamp: When the scan was performed
results: JSON containing detailed scan results
total_lines: Total number of lines in the file
unsafe_lines: Number of unsafe lines detected
safe_lines: Number of safe lines
file_size: File size in bytes

Development

Training the Model

cd scripts
python prepare_data.py
python train_model.py --input preprocessed/train_linelevel.csv --model_out ../models/logreg_model.pkl --vectorizer_out ../models/tfidf_vectorizer.pkl

Testing Detection

python scripts/detect_lines.py --file path/to/file.php --model models/logreg_model.pkl --vectorizer models/tfidf_vectorizer.pkl --threshold 0.719

Model Evaluation

# Run comprehensive evaluation
python scripts/comprehensive_evaluation.py

# Evaluate thresholds
python scripts/eval_thresholds.py

# Test mitigation system
python test_mitigation.py

Configuration

Environment Variables

DATABASE_URL: Database connection string
FLASK_ENV: Flask environment (development/production)

Docker Environment

MySQL root password: rootpassword
Database: threat_scope
User: threat_user
Password: threat_password

Security Features

Input Validation: Secure file upload handling
SQL Injection Prevention: Parameterized queries
XSS Protection: Proper output escaping
CSRF Protection: Flask-WTF integration

Troubleshooting

Common Issues

Database Connection Failed
- Check MySQL service is running
- Verify connection credentials
- Ensure database exists
Model Loading Error
- Ensure model files exist in models/ directory
- Check file permissions
Docker Issues
- Run docker-compose down and docker-compose up -d
- Check logs with docker-compose logs

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Key Features Summary

🎯 Enterprise-Grade Security Analytics

12+ Comprehensive Metrics: Model performance, security assessment, operational intelligence
4 Interactive Charts: Vulnerability trends, risk distribution, file analysis, activity monitoring
Professional Dashboard: Real-time threat level assessment and coverage analysis

🔧 AI-Powered Code Mitigation

Automatic Fix Generation: Secure code suggestions for SQL injection, XSS, command injection
Interactive Modal Interface: Beautiful popup with detailed explanations and copy functionality
Multiple Vulnerability Types: Comprehensive coverage of common web vulnerabilities

📊 Model Performance Validation

95.6% Recall: Catches 95.6% of all vulnerabilities (excellent for security applications)
Optimized Threshold: 0.719 threshold provides maximum detection capability
Real-time Metrics: Live display of precision, recall, F1-score in dashboard

Acknowledgments

Training data based on CWE-89 (SQL Injection) vulnerability patterns
Machine learning implementation using scikit-learn
Web interface built with Flask and Tailwind CSS
Interactive charts powered by Chart.js
Code mitigation engine with AI-powered suggestions

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
data/train		data/train
manifests		manifests
models		models
preprocessed		preprocessed
results		results
scripts		scripts
test_samples		test_samples
web		web
.gitattributes		.gitattributes
.gitignore		.gitignore
CWE_COVERAGE.md		CWE_COVERAGE.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
MITIGATION_TESTING_GUIDE.md		MITIGATION_TESTING_GUIDE.md
README.md		README.md
TESTING_GUIDE.md		TESTING_GUIDE.md
VERIFICATION_CHECKLIST.md		VERIFICATION_CHECKLIST.md
docker-compose.yml		docker-compose.yml
migrate_db.py		migrate_db.py
start.bat		start.bat
start.sh		start.sh
test_mitigation.py		test_mitigation.py
test_setup.py		test_setup.py

proknapz/threat-scope

Folders and files

Latest commit

History

Repository files navigation