PS 11: Real-Time Audio Fraud Detection for Scam Prevention
With the rapid rise of voice-based scams, fraudsters increasingly exploit phone calls to deceive users. Particularly vulnerable groups such as elderly individuals, digitally unaware users, and first-time internet adopters. These scams often involve impersonation, emotional manipulation, urgency tactics, and psychological pressure, making them difficult to detect in real time.
Traditional fraud detection systems primarily focus on post-transaction analysis or text-based signals, offering little to no protection during live phone conversations, where most financial and emotional damage occurs.
There is a critical need for an AI-powered, real-time audio intelligence system that can detect scam patterns as a call is happening and proactively protect users before fraud occurs.
Develop an innovative AI-driven solution that leverages real-time audio analysis and fraud detection to:
- Identify scam or fraudulent phone calls as they occur
- Protect users, especially elderly and vulnerable populations from financial and emotional harm
- Provide timely alerts, guidance, or interventions during suspicious calls
ScamShield - AI-Powered Real-Time Fraud Detection
Team Syndicate Members
๐ Live Application: https://ai-fraud-detection-msv.netlify.app/
๐ฅ Demo Video: https://drive.google.com/drive/folders/1kiyIS_JOgh2pPR_Ot4XRnKb7wZ2DBTZE?usp=sharing
๐ Presentation: https://in.docworkspace.com/d/sICjeusrJArK26MoG?sa=601.1037
ScamShield is an advanced AI-powered system that provides real-time protection against fraud calls. Using sophisticated speech recognition, natural language processing, and machine learning, it helps protect vulnerable users (especially elderly) from financial scams across 8 Indian languages.
- ๐ค Real-time Speech Recognition - Browser-based speech-to-text conversion
- ๐ค AI-Powered ML Analysis - Advanced ensemble model with 75%+ accuracy
- ๐ 8-Language Support - English, Hindi, Telugu, Tamil, Kannada, Malayalam, Marathi, Bengali
- ๐ฑ Elderly-Friendly UI - Large text, clear colors, simple messaging
- โก Instant Alerts - Immediate warnings for high-risk calls
- ๐ Comprehensive Training - 487 real-world fraud patterns
- ๐ Real-time Analysis - Live fraud probability scoring
- ๐ Privacy First - No personal data storage, local processing
Audio Input โ Speech-to-Text โ ML Processing โ Risk Analysis โ Alert Generation
Frontend:
- React.js with modern JavaScript
- Web Speech API for real-time transcription
- Responsive design with CSS3 animations
- Multi-language UI support
Backend:
- Node.js with Express.js
- Python ML integration
- RESTful API architecture
- Real-time fraud detection pipeline
Machine Learning:
- Advanced ensemble model (LogisticRegression + SVM + NaiveBayes + GradientBoosting)
- TF-IDF vectorization with 5000 features
- 4-gram analysis for pattern detection
- 487 comprehensive training samples
| Metric | Value |
|---|---|
| Overall Accuracy | 75.4% |
| Training Samples | 487 |
| Fraud Samples | 225 |
| Legitimate Samples | 262 |
| Features | 5000 TF-IDF |
| N-gram Range | (1, 4) |
- Malayalam: 91.7% fraud detection
- Marathi: 93.9% fraud detection
- Bengali: 90.8% fraud detection
- Telugu: 91.0% fraud detection
- All Languages: 90%+ critical fraud detection
| Language | Script | Code | Status |
|---|---|---|---|
| English | Latin | en-US | โ Active |
| Hindi | Devanagari | hi-IN | โ Active |
| Telugu | Telugu | te-IN | โ Active |
| Tamil | Tamil | ta-IN | โ Active |
| Kannada | Kannada | kn-IN | โ Active |
| Malayalam | Malayalam | ml-IN | โ Active |
| Marathi | Devanagari | mr-IN | โ Active |
| Bengali | Bengali | bn-IN | โ Active |
- Node.js (v14 or higher)
- Python 3.7+ with pip
- Modern web browser with microphone access
- Internet connection for API calls
-
Clone the repository
git clone https://github.com/ByteQuest-2025/GFGBQ-Team-syndicate-members.git cd GFGBQ-Team-syndicate-members/fraud-audio-detection -
Backend Setup
cd backend npm install pip install -r requirements.txt python ml_fraud_detector.py train # Train ML model npm start
Server runs on
http://localhost:5000 -
Frontend Setup
cd frontend npm install npm startApp runs on
http://localhost:3000
- Start Protection - Click "Start Protection" to begin monitoring
- Grant Permissions - Allow microphone access when prompted
- Select Language - Choose your preferred language from dropdown
- Real-time Analysis - System analyzes speech in real-time
- Instant Alerts - Receive immediate warnings for suspicious calls
- Stay Safe - Follow the system's recommendations
POST /api/analyze-text
Content-Type: application/json
{
"transcript": "Your bank account has been blocked share OTP immediately"
}Response:
{
"riskLevel": "Critical",
"scamPercentage": 91,
"confidence": 0.91,
"message": "๐จ CRITICAL SCAM ALERT: Extremely high fraud probability detected!",
"mlPrediction": {
"isFraud": true,
"fraudProbability": 0.91,
"riskLevel": "Critical"
},
"detectedLanguage": "en",
"languageName": "English",
"analysisMethod": "ML-Powered Detection"
}POST /api/train-modelGET /api/languagesPOST /api/emergency-alert
Content-Type: application/json
{
"phoneNumber": "+91-9876543210",
"transcript": "Scam call transcript",
"userLocation": "Mumbai, India"
}- Advanced ML Detection - Ensemble model with multiple algorithms
- Multi-language Analysis - Unicode-aware text processing
- Real-world Patterns - 487 actual fraud techniques
- Privacy First - No personal data storage
- Local Processing - Speech recognition in browser
- Emergency Alerts - Automatic threat logging
| Category | Examples | Risk Level |
|---|---|---|
| Banking Scams | "Account blocked", "Share OTP" | Critical |
| Government Threats | "Police case", "Arrest warrant" | Critical |
| Prize Scams | "Lottery winner", "Processing fee" | High |
| Tech Support | "Computer virus", "Remote access" | High |
| Delivery Scams | "Parcel held", "Customs fee" | Medium |
- Modern Design - Gradient backgrounds and smooth animations
- Accessibility - Large fonts and high contrast for elderly users
- Visual Feedback - Color-coded risk levels (Red/Yellow/Green)
- Responsive - Works on desktop and mobile devices
- Multi-language UI - Native language support
- Intuitive - Simple interface with clear instructions
fraud-audio-detection/
โโโ frontend/
โ โโโ src/
โ โ โโโ App.js # Main React component
โ โ โโโ index.js # Entry point
โ โโโ package.json # Frontend dependencies
โ โโโ package-lock.json
โโโ backend/
โ โโโ server.js # Node.js server
โ โโโ ml_fraud_detector.py # Python ML model
โ โโโ requirements.txt # Python dependencies
โ โโโ package.json # Backend dependencies
โ โโโ setup.bat # Windows setup script
โ โโโ ML_TEST_RESULTS.md # ML performance results
โโโ .gitignore # Git ignore rules
โโโ README.md # This file
- Mobile app development (Android/iOS)
- Voice pattern analysis integration
- Government database integration
- Community reporting features
- Smart home integration
- Advanced ML models (BERT, Transformers)
- Real-time collaboration features
- Blockchain-based fraud reporting
We welcome contributions! Please follow these steps:
- Fork the repository
- Create a feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
Critical Risk Alert |
High Risk Warning |
Medium Risk Caution |
Live Test Results |
This project is licensed under the MIT License - see the LICENSE file for details.
ByteQuest 2025 - GeeksforGeeks Hackathon
- Advanced ML implementation with ensemble models
- Multi-language fraud detection system
- Real-time speech processing
- Production-ready deployment
- Web Speech API for real-time transcription
- scikit-learn for machine learning capabilities
- React.js community for excellent documentation
- Fraud research organizations for pattern data
- Beta testers and elderly user feedback
- GeeksforGeeks for hosting ByteQuest 2025
For support, create an issue in this repository or contact the development team.








