SentinelVNC 🛡️

AI-Driven Defense and Monitoring Platform for VNC Data Exfiltration

SentinelVNC detects and contains data exfiltration attacks in VNC sessions through hybrid rule-based and ML detection, with blockchain-anchored forensic evidence.

🎯 Overview

SentinelVNC monitors VNC (Virtual Network Computing) sessions for:

Clipboard Abuse: Large clipboard operations indicating data exfiltration
Screenshot Scraping: Rapid screenshot capture patterns
File Exfiltration: Unusual file transfer activities

The system uses a hybrid approach combining:

Rule-based detection (3 core rules with low false-positive rates)
ML-based anomaly detection (RandomForest with SHAP explainability)
Blockchain anchoring (Merkle tree-based forensic evidence)

🚀 Quick Start

Prerequisites

Python 3.10+ (3.11 preferred, but 3.10+ works)
Linux/macOS (tested on macOS, should work on Linux)
2GB+ RAM
Internet connection (for initial package installation)

Installation

# Clone or navigate to the repository
cd /path/to/SentinelVNC

# Create virtual environment
python3 -m venv venv  # or python3.11 if available

# Activate virtual environment
source venv/bin/activate  # On macOS/Linux
# OR
venv\Scripts\activate  # On Windows

# Install dependencies
pip install --upgrade pip
pip install -r requirements.txt

# Create necessary directories
mkdir -p data/synthetic models logs forensic anchors

Run Complete Demo

# Make script executable
chmod +x run_demo.sh

# Run the demo (trains model, simulates attacks, detects, anchors, launches dashboard)
./run_demo.sh

The script will:

Train the ML model (if not already trained)
Clear old simulation data
Generate synthetic attack events
Run the detector to identify threats
Create blockchain anchors from forensic evidence
Launch the Streamlit dashboard

📁 Project Structure

SentinelVNC/
├── attack_simulator.py      # Generates synthetic VNC attack events
├── detector.py               # Hybrid rule-based + ML detection engine
├── train_model.py            # ML model training with SHAP
├── streamlit_app.py          # Real-time monitoring dashboard
├── merkle_anchor.py          # Blockchain anchoring (Merkle tree)
├── run_demo.sh               # End-to-end demo orchestration
├── requirements.txt           # Python dependencies
├── README.md                 # This file
├── DEMO_SCRIPT.md            # Demo presentation script
├── SLIDES.md                 # 6-slide presentation outline
├── FAQ.md                    # FAQ for judges
├── DEVELOPMENT_PLAN.md       # Development plan
├── data/
│   └── synthetic/            # Generated attack events
├── models/                   # Trained ML models
├── logs/                     # Detection alerts
├── forensic/                 # Forensic JSON records
└── anchors/                  # Blockchain anchor files

🔧 Component Details

1. Attack Simulator (`attack_simulator.py`)

Generates synthetic VNC events to simulate attacks:

Scenarios:

normal: Normal user activity
clipboard_abuse: Large clipboard operations
screenshot_scraping: Rapid screenshot capture
file_exfiltration: Large file transfers
mixed: Combination of all attacks

Usage:

python attack_simulator.py

Output: data/synthetic/vnc_events.jsonl (JSONL format, one event per line)

2. Detector (`detector.py`)

Hybrid detection engine with 3 core rules:

Rule 1: Clipboard Size Threshold

Alerts if clipboard operation > 200KB
Reason: Large clipboard operations indicate bulk data exfiltration

Rule 2: Screenshot Burst

Alerts if 5+ screenshots within 10 seconds
Reason: Rapid screenshot capture suggests scraping

Rule 3: File Transfer

Alerts if file > 50MB OR 2+ large files within 30 seconds
Reason: Unusual file transfer patterns

ML Detection:

Uses trained RandomForest model
Anomaly score threshold: 0.5
Features: event type, sizes, temporal patterns, history

Usage:

python detector.py

Output:

logs/alerts.jsonl: All detected alerts
forensic/*.json: Forensic records for each alert

3. ML Training (`train_model.py`)

Trains a lightweight RandomForest classifier:

Features:

Event type encoding (clipboard/screenshot/file_transfer)
Size features (normalized)
Temporal features (time of day)
History features (recent activity counts)

Explainability:

SHAP values for feature importance
Feature importance rankings
Saved to models/shap_data.json

Usage:

python train_model.py

Output:

models/detection_model.pkl: Trained model
models/model_metadata.json: Model metadata
models/shap_data.json: SHAP explainability data

4. Streamlit Dashboard (`streamlit_app.py`)

Real-time monitoring dashboard with:

Live alerts feed
Detection analysis (charts and statistics)
Forensic timeline
Blockchain anchors viewer
Containment button (simulated)

Usage:

streamlit run streamlit_app.py

Access: Dashboard opens at http://localhost:8501

5. Blockchain Anchoring (`merkle_anchor.py`)

Creates Merkle tree from forensic events:

Process:

Collects all forensic JSON files
Computes SHA-256 hash of each file
Builds Merkle tree
Generates root hash
Signs anchor with signature hash

Usage:

python merkle_anchor.py

Output: anchors/*.json (anchor metadata with Merkle root)

Verification:

from merkle_anchor import ForensicAnchoring
anchorer = ForensicAnchoring()
anchorer.verify_anchor(Path("anchors/ANCHOR_123.json"))

🧪 Testing Individual Components

Test Attack Simulator

python attack_simulator.py
# Check: data/synthetic/vnc_events.jsonl

Test Detector

# First generate events
python attack_simulator.py

# Then run detector
python detector.py
# Check: logs/alerts.jsonl, forensic/*.json

Test ML Training

python train_model.py
# Check: models/detection_model.pkl

Test Anchoring

# First generate alerts (creates forensic files)
python attack_simulator.py
python detector.py

# Then create anchor
python merkle_anchor.py
# Check: anchors/*.json

📊 Demo Flow

Setup (30 seconds)
- Show project structure
- Explain hybrid detection approach
Attack Simulation (20 seconds)
- Run attack_simulator.py with mixed scenario
- Show generated events
Detection (30 seconds)
- Run detector.py
- Show alerts with explainable reasons
- Highlight rule-based + ML detection
Forensic Anchoring (20 seconds)
- Run merkle_anchor.py
- Show Merkle root and verification
Dashboard (30 seconds)
- Launch Streamlit dashboard
- Show live alerts, analysis, anchors
- Demonstrate containment button

Total: ~2 minutes

🔒 Security & Privacy

Simulated attacks only: All attack patterns are synthetic and benign
No real VNC data: System works with simulated events
Air-gapped compatible: No cloud dependencies, runs entirely locally
Forensic integrity: Merkle tree ensures evidence tamper-proofing

🛠️ Troubleshooting

Issue: Model not found

Solution: Run python train_model.py first

Issue: No alerts detected

Solution: Ensure events are generated: python attack_simulator.py

Issue: Dashboard shows no data

Solution: Run the full demo: ./run_demo.sh

Issue: Import errors

Solution: Ensure virtual environment is activated and requirements installed

Issue: Permission denied on run_demo.sh

Solution: chmod +x run_demo.sh

📈 Performance

Model training: ~10-30 seconds (2000 samples)
Detection latency: <100ms per event
Dashboard refresh: 1-5 seconds (configurable)
Memory usage: ~200-500MB

See:

DEMO_SCRIPT.md: Step-by-step demo script
FAQ.md: Answers to common questions

👤 Author

Priyanshu Mishra

🙏 Acknowledgments

scikit-learn for ML capabilities
Streamlit for dashboard framework
SHAP for explainability

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
anchors		anchors
attack_simulator		attack_simulator
backend		backend
dashboard		dashboard
data/synthetic		data/synthetic
docs		docs
forensic		forensic
infra		infra
logs		logs
models		models
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
AUDIT_REPORT.md		AUDIT_REPORT.md
DEMO_SCRIPT.md		DEMO_SCRIPT.md
DEVELOPMENT_PLAN.md		DEVELOPMENT_PLAN.md
Dockerfile.api		Dockerfile.api
Dockerfile.dashboard		Dockerfile.dashboard
Dockerfile.proxy		Dockerfile.proxy
Dockerfile.worker		Dockerfile.worker
FAQ.md		FAQ.md
GITHUB_ACTIONS_TROUBLESHOOTING.md		GITHUB_ACTIONS_TROUBLESHOOTING.md
INSTALLATION.md		INSTALLATION.md
Makefile		Makefile
OPTIONAL_IMPROVEMENTS.md		OPTIONAL_IMPROVEMENTS.md
QUICK_START.md		QUICK_START.md
README.md		README.md
README_V2.md		README_V2.md
REPOSITORY_SCAFFOLD_SUMMARY.md		REPOSITORY_SCAFFOLD_SUMMARY.md
SECURITY.md		SECURITY.md
SLIDES.md		SLIDES.md
V2_UPGRADE_SUMMARY.md		V2_UPGRADE_SUMMARY.md
attack_simulator.py		attack_simulator.py
coverage.xml		coverage.xml
detector.py		detector.py
docker-compose.yml		docker-compose.yml
merkle_anchor.py		merkle_anchor.py
pytest.ini		pytest.ini
quick_start_local.sh		quick_start_local.sh
requirements-v2.txt		requirements-v2.txt
requirements.txt		requirements.txt
run_demo.sh		run_demo.sh
sentinelvnc_proxy.py		sentinelvnc_proxy.py
streamlit_app.py		streamlit_app.py
train_model.py		train_model.py

Folders and files

Latest commit

History

Repository files navigation

SentinelVNC 🛡️

🎯 Overview

🚀 Quick Start

Prerequisites

Installation

Run Complete Demo

📁 Project Structure

🔧 Component Details

1. Attack Simulator (attack_simulator.py)

2. Detector (detector.py)

3. ML Training (train_model.py)

4. Streamlit Dashboard (streamlit_app.py)

5. Blockchain Anchoring (merkle_anchor.py)

🧪 Testing Individual Components

Test Attack Simulator

Test Detector

Test ML Training

Test Anchoring

📊 Demo Flow

🔒 Security & Privacy

🛠️ Troubleshooting

Issue: Model not found

Issue: No alerts detected

Issue: Dashboard shows no data

Issue: Import errors

Issue: Permission denied on run_demo.sh

📈 Performance

👤 Author

🙏 Acknowledgments

About

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Attack Simulator (`attack_simulator.py`)

2. Detector (`detector.py`)

3. ML Training (`train_model.py`)

4. Streamlit Dashboard (`streamlit_app.py`)

5. Blockchain Anchoring (`merkle_anchor.py`)

Packages