RAG Translation Backend Server

A FastAPI-based RAG (Retrieval-Augmented Generation) server designed for translation tasks, featuring vector-based similarity search and stammering detection.

Features

Retrieval-Augmented Generation: Uses ChromaDB to retrieve the top 4 similar translation pairs for better context.
Stammering Detection: Advanced algorithm to detect character elongation and phrase repetition in translations.
Dockerized: Fully containerized for consistent deployment and testing.

Quick Start (Local)

This project uses uv for lightning-fast dependency management.

Setup: uv sync
Run Server: uv run uvicorn main:app --reload
Seed Data: uv run seed_db.py (populates the DB with the provided JSONL pairs)
API Docs: Open http://127.0.0.1:8000/docs

Docker (Score Booster)

To run the fully containerized application:

docker build -t translation-rag .
docker run -p 8000:8000 translation-rag

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
.python-version		.python-version
DO		DO
Dockerfile		Dockerfile
README.md		README.md
database.py		database.py
main.py		main.py
pyproject.toml		pyproject.toml
seed_db.py		seed_db.py
services.py		services.py
stammering_tests.jsonl		stammering_tests.jsonl
translation_pairs.jsonl		translation_pairs.jsonl
translation_requests.jsonl		translation_requests.jsonl
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Translation Backend Server

Features

Quick Start (Local)

Docker (Score Booster)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG Translation Backend Server

Features

Quick Start (Local)

Docker (Score Booster)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages