🤖 Database Assistant - Database Chatbot System

Advanced LLM-powered multi-agent system for querying SQLite databases using natural language - a professional web application. This project integrates grounding techniques, secure prompt engineering, and database connectivity to provide a safe and professional database interface. The system supports multilingual queries and can be tested in Turkish & English languages.

📋 Project Purpose and Features

This system allows users to query SQLite databases using natural language without requiring any SQL knowledge. It delivers accurate, reliable, and secure responses through multi-agent architecture and Google Gemini API.

🎯 Core Features

🗣️ Natural Language Processing: Convert Turkish and English queries to SQL
🔒 Security-Focused: Multi-layered security measures and SQL injection protection
📊 Automatic CSV Export: Download query results in CSV format
🌐 Modern Web Interface: React-based responsive user interface
⚡ Real-time Chat: Instant messaging experience
🔄 Multi-Agent System: Reliable results with specialized agents
📱 Mobile Compatible: Responsive design that works on all devices

🏗️ System Architecture

🔧 Backend Architecture

The project uses a 3-tier hybrid architecture:

1. Node.js Express Backend (Port 3001)

backend/
├── app.js              # Main Express application
├── routes/
│   └── api/
│       ├── chat.js     # Chat API endpoints
│       └── auth.js     # Authentication endpoints
├── services/
│   └── pythonBridge.js # Bridge to Python service
└── bin/www            # Server launcher

Responsibilities:

RESTful API endpoints
CORS and security middleware
Proxy function to Python service
Request validation and error handling

2. Python Flask Microservice (Port 5001)

python-service/
├── app.py                # Flask API server
├── chatbot_service.py    # Main chatbot logic
└── calculate_token.py    # Token calculation and rate limiting

Responsibilities:

LLM API integration (Google Gemini)
Multi-agent orchestration
SQLite database operations
CSV file creation and management

3. React Frontend (Port 3000)

frontend/src/
├── App.js        # Main React component
├── App.css       # Styling
└── index.js      # Entry point

Responsibilities:

User interface
Real-time chat experience
CSV download operations
Responsive design

🤖 Multi-Agent Chatbot System

The system uses 3 specialized agents:

Agent Roles

🔍 SQL Agent: Converts natural language questions into safe and valid SQL queries (Structured Output)
📝 Natural Language Agent: Converts JSON database results into user-friendly natural language responses
🎯 Orchestrator Agent: Coordinates agents, manages context, and enforces security policies

🧠 Grounding Techniques and Reliability

The system employs multiple grounding strategies to ensure accurate and secure outputs:

1. 🔄 Multi-Agent System

Agents with specialized roles (SQL generation, Natural Language processing, Orchestration)
Separation of concerns → reduces hallucination risk and improves control

2. 📋 Structured Output (Prompt Engineering)

SQL Agent responses are constrained to predefined JSON schema:

{
  "sql_query": "SELECT SupplierName FROM Suppliers WHERE SupplierID = (SELECT SupplierID FROM Products ORDER BY Price DESC LIMIT 1);",
  "explanation": "Finds the supplier of the highest-priced product."
}

Configuration:

sql_generation_config = {
  "temperature": 0.1,   
  "top_p": 0.95,
  "top_k": 64,
  "max_output_tokens": 8192,
  "response_mime_type": "application/json",
  "response_schema": {
    "type": "object",
    "properties": {
      "sql_query": {"type": "string", "description": "Valid SQLite SELECT query"},
      "explanation": {"type": "string", "description": "Brief explanation of what the query does"}
    },
    "required": ["sql_query"]
  }
}

3. 🎯 Context Injection (Prompt Engineering)

Agents are supported with:

Explicit database schema embedded into prompts
Clear rules for SQL generation and response formatting

database_schema = """
Northwind database schema:
- Categories: CategoryID, CategoryName, Description
- Customers: CustomerID, CustomerName, ContactName, Address, City, PostalCode, Country
- Products: ProductID, ProductName, SupplierID, CategoryID, Unit, Price
# ... other tables
"""

This approach prevents the model from inventing non-existent table or column names.

4. 💾 Real Database Connection

Real-time grounding via actual SQLite database execution
SQL results are fetched directly from the database and converted to JSON
NL Agent uses real query results → no hallucination

🛡️ Security Techniques

Layered Security Approach

🔍 Input Sanitization
- Detects and blocks SQL injection and prompt injection patterns
- Malicious content filtering
✅ SQL Query Validation
- Only accepts safe SELECT statements
- Blocks dangerous commands like DROP, INSERT, UPDATE
🛡️ Multi-Layer Prompt Injection Protection
- Guard lists prevent malicious attempts to manipulate the model
- Context enforcement prevents agents from revealing hidden instructions
🔒 Safe Database Execution
- Queries are validated before execution
- Errors are handled gracefully with user-friendly messages

⚡ Rate Limit Management & Retry Logic

To manage API rate limits and ensure high availability, the system includes automatic retry and token usage monitoring:

api_request_with_retry function catches HTTP 429 (rate limit) errors and retries with exponential backoff
Token tracking:
- count_tokens and get_token_usage monitor prompt and response tokens
- Global thresholds (MAX_TOKENS, CONTEXT_WINDOW, WARNING_THRESHOLD) trigger warnings when limits are approached

Rate Limit Error Handling Example

🌐 Web Interface and Technologies

React-Based Modern Interface

💬 Real-time Chat UI: Instant messaging experience
📊 Smart CSV Export: Automatic download of query results
🔍 Example Queries: Ready-to-use examples
📱 Mobile-Compatible Design: Responsive UI
⚡ Loading States: Loading indicators for user experience
🎨 Modern CSS: Gradients and animations

Why Flask Was Chosen?

Reasons for using Flask in the Python microservice:

🚀 Lightweight and Fast: Minimal overhead, fast API responses
🔧 Flexibility: Easy customization for LLM integration
📚 Rich Ecosystem: Google Generative AI, SQLite, Pandas integration
🐍 Python Advantages: Natural compatibility with AI/ML libraries
⚙️ Microservice Compatibility: Easy integration with Node.js backend
🔄 RESTful API: Clean architecture with standard HTTP endpoints

🔄 System Workflow

graph TD
    A[👤 User Query] --> B[🎯 Orchestrator Agent]
    B --> C[🔍 SQL Agent]
    C --> D[📋 Structured JSON Output]
    D --> E[💾 SQLite Database]
    E --> F[📊 Query Results]
    F --> G[📝 Natural Language Agent]
    G --> H[✅ Secure Response]
    
    B --> I[🛡️ Security Check]
    I --> J[❌ Malicious Content?]
    J -->|Yes| K[🚫 Block]
    J -->|No| C
    
    E --> L[📈 CSV Export]
    L --> M[💾 Automatic Save]

Detailed Workflow:

User Query → Sent from React frontend
Node.js Backend → Routes request to Python microservice
Orchestrator Agent → Security check and routing
SQL Agent → Converts natural language to SQL in JSON format
SQLite Database → Real query execution
NL Agent → Converts JSON results to natural language
CSV Export → Results automatically saved to CSV

🎯 Core Features Detail

🗣️ Natural Language → SQL Translation: Turkish and English support
📋 Structured Output: Safe and parseable queries
🎯 Context-Aware Querying: Context awareness through prompt engineering
📊 Automatic CSV Export: Instant download of results
🔒 Secure & Reliable Responses: Protected with multi-layer security

🚀 Installation and Usage

System Requirements

Backend (Node.js):

npm install express cors axios dotenv morgan express-validator jsonwebtoken uuid

Python Microservice:

pip install flask flask-cors google-generativeai python-dotenv pandas sqlite3

Frontend (React):

npm install react react-dom axios

Environment Setup

Create .env file:

# Google Gemini API
GEMINIAPI=your_gemini_api_key

# Database
DB_PATH=./Northwind.db

# Service URLs
FRONTEND_URL=http://localhost:3000
PYTHON_SERVICE_URL=http://localhost:5001

Running the Application

Start Python Microservice:

cd python-service
python app.py
# Runs on port 5001

Start Node.js Backend:

cd backend
npm start
# Runs on port 3001

Start React Frontend:

cd frontend
npm start
# Runs on port 3000

Usage Examples

🇹🇷 Turkish:
"En pahalı ürünün tedarikçisi kim?" (Who is the supplier of the most expensive product?)
→ En yüksek fiyatlı ürün Côte de Blaye ve tedarikçisi Aux joyeux ecclésiastiques.

"Beverages kategorisindeki tüm ürünleri göster" (Show all products in Beverages category)
→ Beverages kategorisindeki ürünler: Chai, Chang, Guaraná Fantástica...

🇺🇸 English:
"Show all customers from Germany"
→ Here are all customers from Germany: Alfreds Futterkiste, Blauer See Delikatessen...

🔧 Technical Details

🤖 LLM: Google Gemini 2.5 Pro (Structured Output + Context Injection)
💾 Database: SQLite with schema-level validation
🛡️ Security: Multi-layer protection (sanitization, validation, filtering)
🎯 Grounding: Real database connection prevents hallucination
🌐 Frontend: React 19.1.1 + Modern CSS
⚡ Backend: Node.js Express + Flask microservice
📊 Export: CSV generation with Pandas

📁 Project Structure

Database-Assistant/
├── 📁 backend/                    # Node.js Express API
│   ├── app.js                     # Main Express application
│   ├── routes/api/
│   │   ├── chat.js                # Chat endpoints
│   │   └── auth.js                # Auth endpoints
│   ├── services/
│   │   └── pythonBridge.js        # Python service bridge
│   └── package.json
│
├── 📁 frontend/                   # React Web Interface
│   ├── src/
│   │   ├── App.js                 # Main React component
│   │   ├── App.css                # Styling
│   │   └── index.js               # Entry point
│   └── package.json
│
├── 📁 python-service/             # Flask Microservice
│   ├── app.py                     # Flask API server
│   ├── chatbot_service.py         # Main chatbot logic
│   ├── calculate_token.py         # Token management
│   └── query_results/             # CSV outputs
│
├── 📄 Northwind.db               # SQLite database
├── 📄 .env                       # Environment variables
└── 📄 README.md                  # This documentation

🌟 What Makes This Project Unique

🔗 Hybrid Architecture: Multi-Agent LLM Architecture + real database grounding
🛡️ Advanced Security: State-of-the-art security techniques
📋 Structured Output: Reliable responses with Context Injection
📊 Smart Analytics: Automatic CSV export and data analysis
🌐 Modern Web Stack: React + Node.js + Flask microservice architecture
🗣️ Multilingual Support: Turkish and English natural language processing
⚡ Real-time Experience: WebSocket-like fast response times

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
backend		backend
frontend		frontend
python-service		python-service
.gitignore		.gitignore
API_REQUEST_ERROR.jpg		API_REQUEST_ERROR.jpg
Northwind.db		Northwind.db
README.md		README.md
chat_bot test prompt.txt		chat_bot test prompt.txt
demo_1.jpg		demo_1.jpg
demo_2.jpg		demo_2.jpg
env.example		env.example

yavuzssvr19/Database-Assistant

Folders and files

Latest commit

History

Repository files navigation

🤖 Database Assistant - Database Chatbot System

📋 Project Purpose and Features

🎯 Core Features

🏗️ System Architecture

🔧 Backend Architecture

1. Node.js Express Backend (Port 3001)

2. Python Flask Microservice (Port 5001)

3. React Frontend (Port 3000)

🤖 Multi-Agent Chatbot System

Agent Roles

🧠 Grounding Techniques and Reliability

1. 🔄 Multi-Agent System

2. 📋 Structured Output (Prompt Engineering)

3. 🎯 Context Injection (Prompt Engineering)

4. 💾 Real Database Connection

🛡️ Security Techniques

Layered Security Approach

⚡ Rate Limit Management & Retry Logic

Rate Limit Error Handling Example

🌐 Web Interface and Technologies

React-Based Modern Interface

Why Flask Was Chosen?

🔄 System Workflow

🎯 Core Features Detail

🚀 Installation and Usage

System Requirements

Backend (Node.js):

Python Microservice:

Frontend (React):

Environment Setup

Running the Application

Usage Examples

🔧 Technical Details

📁 Project Structure

🌟 What Makes This Project Unique

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages