voice-agent

🎙️ Voice Agent – RAG + Voice Chatbot

This application is a voice-enabled AI assistant that lets you talk to your documents.
Upload your PDFs or text files, and simply speak your question – the bot will transcribe your voice, retrieve the most relevant answer from your knowledge base using RAG (Retrieval-Augmented Generation), and reply back in a natural, human-like voice.

✨ Features

🎤 Voice Input: Speak instead of typing – powered by OpenAI Whisper.
📚 Document Knowledge Base: Upload PDFs or TXT files to build a searchable knowledge base.
🔍 Retrieval-Augmented Generation: Finds the most relevant info from your documents before answering.
🗣️ Text-to-Speech: Natural-sounding audio replies using Kokoro TTS.
⚡ Real-time Interaction: Smooth and quick responses in a friendly chat interface.

🛠️ Tech Stack

Frontend/UI: Streamlit
Speech-to-Text (ASR): Whisper
Text-to-Speech (TTS): Kokoro
Document Processing & RAG: LangChain + Vector Stores
Backend Language: Python3.10

🚀 How It Works

Upload Documents – PDF or TXT files via the sidebar.
Process Knowledge Base – Files are chunked, embedded, and stored in a vector database.
Ask via Voice – Speak your query into the mic.
RAG Retrieval – Finds and ranks relevant chunks from your uploaded content.
Answer Generation – Summarizes and formats the best answer.
Voice Response – Converts the answer into natural speech and plays it.

📦 Installation

git clone https://github.com/yourusername/voice-agent.git
cd voice-agent
pip install -r requirements.txt

▶️ Run the App

streamlit run main.py

📌 Notes

Make sure you have FFmpeg installed for Whisper.
Supports multiple files and multiple queries in a session.
Best used with clear audio for optimal transcription accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

voice-agent

🎙️ Voice Agent – RAG + Voice Chatbot

✨ Features

🛠️ Tech Stack

🚀 How It Works

📦 Installation

▶️ Run the App

📌 Notes

About

Uh oh!

Releases

Packages

Languages

Deepan-mn/voice-agent

Folders and files

Latest commit

History

Repository files navigation

voice-agent

🎙️ Voice Agent – RAG + Voice Chatbot

✨ Features

🛠️ Tech Stack

🚀 How It Works

📦 Installation

▶️ Run the App

📌 Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages