🛡️ ScamShield - AI-Powered Real-Time Fraud Detection

1️⃣ Problem Statement

PS 11: Real-Time Audio Fraud Detection for Scam Prevention

With the rapid rise of voice-based scams, fraudsters increasingly exploit phone calls to deceive users. Particularly vulnerable groups such as elderly individuals, digitally unaware users, and first-time internet adopters. These scams often involve impersonation, emotional manipulation, urgency tactics, and psychological pressure, making them difficult to detect in real time.

Traditional fraud detection systems primarily focus on post-transaction analysis or text-based signals, offering little to no protection during live phone conversations, where most financial and emotional damage occurs.

There is a critical need for an AI-powered, real-time audio intelligence system that can detect scam patterns as a call is happening and proactively protect users before fraud occurs.

Objective

Develop an innovative AI-driven solution that leverages real-time audio analysis and fraud detection to:

Identify scam or fraudulent phone calls as they occur
Protect users, especially elderly and vulnerable populations from financial and emotional harm
Provide timely alerts, guidance, or interventions during suspicious calls

2️⃣ Project Name

ScamShield - AI-Powered Real-Time Fraud Detection

3️⃣ Team Name

Team Syndicate Members

4️⃣ Deployed Link

🌐 Live Application: https://ai-fraud-detection-msv.netlify.app/

5️⃣ 2-Minute Demonstration Video Link

🎥 Demo Video: https://drive.google.com/drive/folders/1kiyIS_JOgh2pPR_Ot4XRnKb7wZ2DBTZE?usp=sharing

6️⃣ PPT Link

📊 Presentation: https://in.docworkspace.com/d/sICjeusrJArK26MoG?sa=601.1037

🚀 Overview

ScamShield is an advanced AI-powered system that provides real-time protection against fraud calls. Using sophisticated speech recognition, natural language processing, and machine learning, it helps protect vulnerable users (especially elderly) from financial scams across 8 Indian languages.

✨ Key Features

🎤 Real-time Speech Recognition - Browser-based speech-to-text conversion
🤖 AI-Powered ML Analysis - Advanced ensemble model with 75%+ accuracy
🌍 8-Language Support - English, Hindi, Telugu, Tamil, Kannada, Malayalam, Marathi, Bengali
📱 Elderly-Friendly UI - Large text, clear colors, simple messaging
⚡ Instant Alerts - Immediate warnings for high-risk calls
📊 Comprehensive Training - 487 real-world fraud patterns
🕒 Real-time Analysis - Live fraud probability scoring
🔒 Privacy First - No personal data storage, local processing

🏗️ Architecture

Audio Input → Speech-to-Text → ML Processing → Risk Analysis → Alert Generation

Technology Stack

Frontend:

React.js with modern JavaScript
Web Speech API for real-time transcription
Responsive design with CSS3 animations
Multi-language UI support

Backend:

Node.js with Express.js
Python ML integration
RESTful API architecture
Real-time fraud detection pipeline

Machine Learning:

Advanced ensemble model (LogisticRegression + SVM + NaiveBayes + GradientBoosting)
TF-IDF vectorization with 5000 features
4-gram analysis for pattern detection
487 comprehensive training samples

🤖 ML Model Performance

Metric	Value
Overall Accuracy	75.4%
Training Samples	487
Fraud Samples	225
Legitimate Samples	262
Features	5000 TF-IDF
N-gram Range	(1, 4)

Language-Specific Performance

Malayalam: 91.7% fraud detection
Marathi: 93.9% fraud detection
Bengali: 90.8% fraud detection
Telugu: 91.0% fraud detection
All Languages: 90%+ critical fraud detection

🌍 Multi-Language Support

Language	Script	Code	Status
English	Latin	en-US	✅ Active
Hindi	Devanagari	hi-IN	✅ Active
Telugu	Telugu	te-IN	✅ Active
Tamil	Tamil	ta-IN	✅ Active
Kannada	Kannada	kn-IN	✅ Active
Malayalam	Malayalam	ml-IN	✅ Active
Marathi	Devanagari	mr-IN	✅ Active
Bengali	Bengali	bn-IN	✅ Active

🚀 Quick Start

Prerequisites

Node.js (v14 or higher)
Python 3.7+ with pip
Modern web browser with microphone access
Internet connection for API calls

Installation

Clone the repository

git clone https://github.com/ByteQuest-2025/GFGBQ-Team-syndicate-members.git
cd GFGBQ-Team-syndicate-members/fraud-audio-detection

Backend Setup

cd backend
npm install
pip install -r requirements.txt
python ml_fraud_detector.py train  # Train ML model
npm start

Server runs on http://localhost:5000

Frontend Setup
```
cd frontend
npm install
npm start
```
App runs on http://localhost:3000

🎯 Usage

Start Protection - Click "Start Protection" to begin monitoring
Grant Permissions - Allow microphone access when prompted
Select Language - Choose your preferred language from dropdown
Real-time Analysis - System analyzes speech in real-time
Instant Alerts - Receive immediate warnings for suspicious calls
Stay Safe - Follow the system's recommendations

🔍 API Documentation

Analyze Text

POST /api/analyze-text
Content-Type: application/json

{
  "transcript": "Your bank account has been blocked share OTP immediately"
}

Response:

{
  "riskLevel": "Critical",
  "scamPercentage": 91,
  "confidence": 0.91,
  "message": "🚨 CRITICAL SCAM ALERT: Extremely high fraud probability detected!",
  "mlPrediction": {
    "isFraud": true,
    "fraudProbability": 0.91,
    "riskLevel": "Critical"
  },
  "detectedLanguage": "en",
  "languageName": "English",
  "analysisMethod": "ML-Powered Detection"
}

Train Model

POST /api/train-model

Get Supported Languages

GET /api/languages

Emergency Alert

POST /api/emergency-alert
Content-Type: application/json

{
  "phoneNumber": "+91-9876543210",
  "transcript": "Scam call transcript",
  "userLocation": "Mumbai, India"
}

🛡️ Security Features

Advanced ML Detection - Ensemble model with multiple algorithms
Multi-language Analysis - Unicode-aware text processing
Real-world Patterns - 487 actual fraud techniques
Privacy First - No personal data storage
Local Processing - Speech recognition in browser
Emergency Alerts - Automatic threat logging

📊 Fraud Detection Categories

Category	Examples	Risk Level
Banking Scams	"Account blocked", "Share OTP"	Critical
Government Threats	"Police case", "Arrest warrant"	Critical
Prize Scams	"Lottery winner", "Processing fee"	High
Tech Support	"Computer virus", "Remote access"	High
Delivery Scams	"Parcel held", "Customs fee"	Medium

🎨 UI/UX Highlights

Modern Design - Gradient backgrounds and smooth animations
Accessibility - Large fonts and high contrast for elderly users
Visual Feedback - Color-coded risk levels (Red/Yellow/Green)
Responsive - Works on desktop and mobile devices
Multi-language UI - Native language support
Intuitive - Simple interface with clear instructions

📁 Project Structure

fraud-audio-detection/
├── frontend/
│   ├── src/
│   │   ├── App.js          # Main React component
│   │   └── index.js        # Entry point
│   ├── package.json        # Frontend dependencies
│   └── package-lock.json
├── backend/
│   ├── server.js           # Node.js server
│   ├── ml_fraud_detector.py # Python ML model
│   ├── requirements.txt    # Python dependencies
│   ├── package.json        # Backend dependencies
│   ├── setup.bat          # Windows setup script
│   └── ML_TEST_RESULTS.md # ML performance results
├── .gitignore             # Git ignore rules
└── README.md              # This file

🔮 Future Roadmap

Mobile app development (Android/iOS)
Voice pattern analysis integration
Government database integration
Community reporting features
Smart home integration
Advanced ML models (BERT, Transformers)
Real-time collaboration features
Blockchain-based fraud reporting

🤝 Contributing

We welcome contributions! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📸 Live Application Screenshots

🏠 Main Dashboard

Clean, elderly-friendly interface with intuitive controls

🌐 Multi-Language Support

8 Indian languages with native script support

🎤 Real-Time Speech Processing

Live speech-to-text with instant fraud analysis

🚨 Fraud Detection System

Critical Risk Alert	High Risk Warning
Medium Risk Caution	Live Test Results

🤖 ML Model Training & Performance

ML model training with 487 samples achieving 75.4% accuracy

Comprehensive performance metrics and validation results

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🏆 Team Syndicate Members

ByteQuest 2025 - GeeksforGeeks Hackathon

Advanced ML implementation with ensemble models
Multi-language fraud detection system
Real-time speech processing
Production-ready deployment

🙏 Acknowledgments

Web Speech API for real-time transcription
scikit-learn for machine learning capabilities
React.js community for excellent documentation
Fraud research organizations for pattern data
Beta testers and elderly user feedback
GeeksforGeeks for hosting ByteQuest 2025

📞 Support

For support, create an issue in this repository or contact the development team.

🏆 ByteQuest 2025 - GeeksforGeeks Hackathon

🛡️ Protecting millions from fraud, one call at a time

Built with ❤️ by Team Syndicate Members

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
backend		backend
frontend		frontend
screenshot pics		screenshot pics
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

🛡️ ScamShield - AI-Powered Real-Time Fraud Detection

1️⃣ Problem Statement

Objective

2️⃣ Project Name

3️⃣ Team Name

4️⃣ Deployed Link

5️⃣ 2-Minute Demonstration Video Link

6️⃣ PPT Link

🚀 Overview

✨ Key Features

🏗️ Architecture

Technology Stack

🤖 ML Model Performance

Language-Specific Performance

🌍 Multi-Language Support

🚀 Quick Start

Prerequisites

Installation

🎯 Usage

🔍 API Documentation

Analyze Text

Train Model

Get Supported Languages

Emergency Alert

🛡️ Security Features

📊 Fraud Detection Categories

🎨 UI/UX Highlights

📁 Project Structure

🔮 Future Roadmap

🤝 Contributing

📸 Live Application Screenshots

🏠 Main Dashboard

🌐 Multi-Language Support

🎤 Real-Time Speech Processing

🚨 Fraud Detection System

🤖 ML Model Training & Performance

📄 License

🏆 Team Syndicate Members

🙏 Acknowledgments

📞 Support

🏆 ByteQuest 2025 - GeeksforGeeks Hackathon

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages