You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Duvent is a real-time voice language learning application designed for "Home Lab" deployment with a path to cloud scalability. It uses a Hybrid Microservices architecture to separate business logic from heavy AI computation.
2. High-Level Architecture Diagram
+-----------------+ +-----------------------+
| | | |
| User / Browser |<----->| Frontend (Next.js) |
| | | |
+-----------------+ +-----------+-----------+
^ |
| WS (Audio Stream) | REST (Auth/State)
v v
+-------------------------------------------------------+
| Backend (Golang) |
| "The Orchestrator" |
+-------------------------------------------------------+
| | |
gRPC | (Bidirectional) | HTTP (JSON) | TCP (SQL)
| v v
| +-----------------+ +--------------+
| | Ollama (LLM) | | PostgreSQL |
| | (Localhost) | | |
| +-----------------+ +--------------+
v
+-----------------------+
| AI Service (Python) |
| "The Worker" |
| (Whisper + Kokoro) |
+-----------------------+
3. Component Details
3.1. Frontend (Next.js)
Responsibility:
Voice Activity Detection (VAD): Detects speech to optimize bandwidth.
WebSocket Client: Streams audio chunks to the Go Backend.