Realtime Language Tutoring System

A comprehensive, multi-language learning application that combines OpenAI's Realtime API with intelligent spaced repetition system (SRS) memory modeling to create personalized, adaptive language tutoring conversations. The system uses Neo4j graph database to track vocabulary progress and morphological understanding, informing real-time conversation agents about optimal learning content.

About This Application

This application demonstrates advanced patterns for voice-based language learning agents, featuring:

Intelligent Conversation Agents: Using OpenAI Realtime API for natural, low-latency voice interactions
Spaced Repetition System: Neo4j-powered vocabulary tracking with morphological analysis
Adaptive Learning: Real-time conversation adjustment based on user progress and SRS data
Multi-Language Support: Configurable morphological analysis for multiple languages (Russian, Spanish, extensible)
Dual-Agent Architecture: Chat agents for conversation + supervisor agents for learning analysis

The system tracks vocabulary usage, grammatical form accuracy, and learning patterns to create personalized tutoring experiences that adapt to each user's proficiency level and learning needs.

Architecture Overview

Core Components

Language Tutor Agent (languageTutorSupervisor.ts): Handles real-time conversation with SRS-informed vocabulary selection
Learning Analysis Agent (learningSupervisor.ts): Analyzes conversation turns for vocabulary learning insights
Neo4j SRS System (lib/neo4j/srs.ts): Tracks vocabulary progress with embedded form statistics
Language Configuration System (lib/languages/): Configurable morphological analysis for multiple languages

Database Schema

The system uses a simplified Neo4j schema with embedded form statistics:

User -[:HAS_PROGRESS]-> LearningProgress -[:ABOUT]-> Lexeme

LearningProgress nodes contain:

srsLevel: Spaced repetition level (1-5)
overallSuccessRate: Overall word mastery percentage
formStats: JSON object tracking individual conjugated/declined forms
weakestForms: Array of forms needing more practice
nextReview: When the word is due for review

Setup

Prerequisites

Node.js 18+
Neo4j instance (local or cloud)
OpenAI API key

Installation

Clone the repository:

git clone <repository-url>
cd realtime-agents-language-tutor

Install dependencies:

npm install

Configure environment variables:

cp .env.example .env

Edit .env with your credentials:

OPENAI_API_KEY=your_openai_api_key_here
NEO4J_URI=bolt://localhost:7687
NEO4J_USERNAME=neo4j
NEO4J_PASSWORD=your_password

Initialize Neo4j database:

npm run dev
# Navigate to http://localhost:3000/api/neo4j/init

Start the development server:

npm run dev

Open http://localhost:3000 and select "Language Tutor" from the scenario dropdown.

How It Works

Conversation Flow

sequenceDiagram
    participant User
    participant ChatAgent as Language Tutor<br/>(gpt-4o-realtime-mini)
    participant Supervisor as Learning Supervisor<br/>(gpt-4.1-mini)
    participant Neo4j as SRS Database
    participant Analysis as Learning Analysis<br/>(gpt-4.1-mini)

    User->>ChatAgent: Target language phrase/sentence
    ChatAgent->>Supervisor: Get intelligent tutoring response
    Supervisor->>Neo4j: Query known words & review due
    Neo4j->>Supervisor: User progress data
    Supervisor->>ChatAgent: Personalized response with SRS vocab
    ChatAgent->>User: Adaptive tutoring response
    
    Note over Analysis: Background learning analysis
    ChatAgent->>Analysis: Trigger learning analysis
    Analysis->>Analysis: Extract vocabulary & assess performance
    Analysis->>Neo4j: Update progress with morphological features
    Neo4j->>Neo4j: Update SRS levels & form statistics

SRS Memory Modeling

The system tracks vocabulary at both lexeme and form levels:

Lexeme Level: Root word progress (e.g., "читать" - to read)
Form Level: Conjugated/declined forms (e.g., "читаю", "читаешь", "читает")
Morphological Features: Person, number, case, gender, tense, aspect
Error Patterns: Common mistakes tracked per form

Learning Analysis

The system analyzes conversation turns to:

Extract meaningful vocabulary usage
Assess grammatical accuracy
Identify error patterns
Update SRS scheduling
Track morphological understanding

Key Features

Intelligent Vocabulary Selection

Prioritizes words due for review in conversations
Introduces new vocabulary at appropriate difficulty levels
Adapts complexity based on demonstrated proficiency

Morphological Analysis

Configurable language-specific grammar pattern recognition
Tracks verb conjugations, noun declensions, adjective agreements
Supports multiple languages through modular configuration
Records form-specific error patterns per language

Spaced Repetition

Leitner box system with 5 SRS levels
Dynamic scheduling based on performance
Form-specific success rate tracking
Weakest forms identification for focused practice

Conversation Memory

Maintains conversation history for context
Uses sliding window for vocabulary analysis
Incorporates error patterns into tutoring decisions

Configuration

Agent Configs

The system uses two main agent configurations:

Language Tutor Agent (src/app/agentConfigs/languageTutor/index.ts):
- Real-time conversation handling
- SRS-informed vocabulary selection
- Adaptive difficulty adjustment
Learning Supervisor (src/app/agentConfigs/learningSupervisor.ts):
- Background learning analysis
- Morphological feature extraction
- SRS database updates

Customization

Supported Languages

Currently supported languages:

Russian (ru): Full morphological analysis for Cyrillic script
Spanish (es): Comprehensive verb conjugation and noun-adjective agreement

Adding New Languages

To add support for a new language:

Create a language configuration file in src/lib/languages/[language].ts
Define morphological patterns for verbs, nouns, adjectives
Add teaching strategies for different proficiency levels
Include grammar examples (correct and incorrect usage)
Register the language in src/lib/languages/index.ts

Example language configuration structure:

export const newLanguageConfig: LanguageConfig = {
  code: 'fr',
  name: 'French',
  nativeName: 'Français',
  morphology: {
    verbs: [/* conjugation patterns */],
    nouns: [/* declension patterns */],
    adjectives: [/* agreement patterns */]
  },
  teachingStrategies: {/* proficiency-based strategies */},
  grammarExamples: {/* correct/incorrect examples */}
};

API Endpoints

POST /api/learning/process-event - Process learning events
GET /api/learning/progress - Get user progress summary
GET /api/learning/review-due - Get words due for review
GET /api/learning/known-words - Get user's known vocabulary
POST /api/neo4j/init - Initialize database schema

Development Tools

Testing

test-simplified-system.js - Test SRS system with multi-language sample data
cleanup-old-nodes.js - Clean up old database nodes

Database Management

Built-in schema initialization
Automated cleanup scripts
Progress tracking utilities

Performance Considerations

Real-time Response: Chat agent provides immediate feedback
Background Analysis: Learning analysis runs asynchronously
SRS Optimization: Embedded form statistics reduce query complexity
Conversation Memory: Sliding window prevents context overflow

Limitations & Future Enhancements

Current Limitations

Rule-based morphological analysis (not ML-based)
Limited to Russian and Spanish (easily extensible)
Minimal frontend design
No user authentication system

Potential Enhancements

Advanced morphological analyzers (ML-based: pymystem3, natasha, spaCy)
Additional language support (French, German, Mandarin, etc.)
User interface improvements and language selection
Audio pronunciation analysis
Writing practice integration
Progress visualization and analytics

Contributing

The system is designed to be extensible. Key areas for contribution:

Language Support: Add morphological analysis for other languages
UI/UX: Improve frontend design and user experience
Analytics: Enhanced learning progress visualization
SRS Algorithms: Alternative spaced repetition implementations
Assessment: More sophisticated proficiency evaluation

Technical Stack

Frontend: Next.js 14, TypeScript, React
Backend: Next.js API routes
Database: Neo4j graph database
AI: OpenAI Realtime API, GPT-4.1-mini
Voice: WebRTC, OpenAI Realtime API
Language Processing: Configurable morphological analysis system

License

This project demonstrates advanced patterns for voice-based language learning applications. Feel free to use as a foundation for your own language tutoring systems.

Note: This application focuses on backend architecture and learning algorithms. Frontend design is minimal but functional. The system provides a complete foundation for building sophisticated language learning applications.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
public		public
src		src
.env.example		.env.example
.env.sample		.env.sample
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
IMPLEMENTATION.md		IMPLEMENTATION.md
LICENSE		LICENSE
README.md		README.md
cleanup-old-nodes.js		cleanup-old-nodes.js
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
test-simplified-system.js		test-simplified-system.js
tsconfig.json		tsconfig.json

License

coding-crying/realtime-agents-language-tutor

Folders and files

Latest commit

History

Repository files navigation

Realtime Language Tutoring System

About This Application

Architecture Overview

Core Components

Database Schema

Setup

Prerequisites

Installation

How It Works

Conversation Flow

SRS Memory Modeling

Learning Analysis

Key Features

Intelligent Vocabulary Selection

Morphological Analysis

Spaced Repetition

Conversation Memory

Configuration

Agent Configs

Customization

Supported Languages

Adding New Languages

API Endpoints

Development Tools

Testing

Database Management

Performance Considerations

Limitations & Future Enhancements

Current Limitations

Potential Enhancements

Contributing

Technical Stack

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages