Enterprise RAG System (v1)

Production-oriented monorepo for a workspace-scoped Retrieval-Augmented Generation (RAG) platform using Supabase Auth, FastAPI, Redis/RQ workers, PostgreSQL + pgvector, and a React client.

This repository follows AGENTS.md as the locked architecture contract. Some modules are scaffolded with TODOs; this README distinguishes current implementation from planned flow.

Project Overview

Enterprise RAG enables a user to:

authenticate with Supabase
create a single workspace (v1 constraint)
upload and index PDFs (pipeline scaffolded)
run grounded queries over selected documents (query pipeline scaffolded)
enforce a strict daily token budget per workspace

Architecture Diagram

flowchart LR
    U[User] --> C[React Client\nVite]
    C -->|Supabase Auth| SA[Supabase Auth]
    C -->|Bearer JWT| API[FastAPI Server]

    API --> DB[(PostgreSQL + pgvector)]
    API --> R[(Redis)]
    API --> ST[(Supabase Storage)]

    API -->|enqueue jobs| Q[RQ Queues\ningest_extract / ingest_index]
    Q --> W1[Worker Extract]
    Q --> W2[Worker Index]

    W1 --> DB
    W2 --> DB
    W1 --> ST
    API --> OAI[OpenAI\nEmbeddings + LLM]
    W2 --> OAI

ASCII view:

Client (React) -> FastAPI -> Postgres(pgvector)
      |              |            ^
      v              v            |
 Supabase Auth    Redis/RQ -> Worker(s)
      |                           |
      +---------------------------+
                 Supabase Storage / OpenAI

Tech Stack

Backend: FastAPI, SQLAlchemy, Pydantic Settings
Database: PostgreSQL + pgvector
Queue: Redis + RQ
Auth/Storage: Supabase Auth + Storage
AI: OpenAI (text-embedding-3-small, gpt-4o-mini per locked architecture)
Frontend: React + TypeScript + Vite + Supabase JS
Infra/Dev: Docker Compose, Nginx (production client image)

High-Level Flow

1) Supabase Auth

Client signs in via Supabase (client/src/lib/supabase.ts).
Client gets access_token from session.
Backend validates bearer token in server/app/core/auth.py using Supabase SDK (with REST fallback).

2) Workspace Creation

POST /workspaces creates one workspace per user.
Enforced uniqueness: if existing workspace owned by user is found, returns 409.
A daily usage row (workspace_daily_usage) is initialized at creation.

3) Token Budget Engine

Budget tracked per workspace/day in workspace_daily_usage.
Implemented operations:
- reserve (reserve_tokens)
- release (release_tokens)
- commit actual usage (commit_usage)
- read status (get_budget_status)
GET /usage/today returns {used,reserved,limit,remaining,resets_at}.

4) Document Ingestion

Locked architecture defines upload-prepare -> upload-complete -> extract -> chunk -> embed -> ready.
Current repo status:
- document/query endpoints are scaffolded placeholders
- worker jobs ingest_extract and ingest_index are TODO stubs
- DB schema and queue wiring are present

5) RAG Query Flow

Locked architecture requires strict grounded retrieval over workspace-scoped chunks.
Current repo status:
- /query route exists but returns Not implemented
- retrieval/chunking/embeddings modules are scaffolded

Environment Variables

Core env is defined in .env.example.

# Supabase
SUPABASE_URL=
SUPABASE_SERVICE_ROLE_KEY=
SUPABASE_ANON_KEY=
SUPABASE_JWT_SECRET=
SUPABASE_KEY=  # compatibility alias

# AI
OPENAI_API_KEY=

# Data/queue
DATABASE_URL=
REDIS_URL=

# App
ENVIRONMENT=development
API_HOST=0.0.0.0
API_PORT=8000
DAILY_TOKEN_LIMIT=100000

# Client
VITE_API_URL=http://localhost:8000
VITE_SUPABASE_URL=
VITE_SUPABASE_ANON_KEY=

What matters most right now:

SUPABASE_URL + service role key for backend token validation
VITE_SUPABASE_URL + anon key for client auth
DATABASE_URL for server + workers
REDIS_URL for workers

Run Locally (Backend + Frontend)

Option A: Docker Compose (recommended)

cp .env.example .env
docker-compose up --build

Services:

API: http://localhost:8000
Client: http://localhost:5173
RQ Dashboard: http://localhost:9181

Option B: Run modules directly

Server:

cd server
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Client:

cd client
npm install
npm run dev -- --host 0.0.0.0

Worker (example queue):

cd worker
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
QUEUE_NAME=ingest_extract REDIS_URL=redis://localhost:6379/0 python worker.py

Run With Supabase

Create Supabase project.
Fill .env with Supabase URL, service role key, anon key.
Apply schema:

psql "$DATABASE_URL" -f scripts/schema.supabase.sql

Start stack (docker-compose up --build).
Open client and sign in.

Basic API check with JWT:

curl -H "Authorization: Bearer <SUPABASE_ACCESS_TOKEN>" \
  http://localhost:8000/auth/me

Create workspace:

curl -X POST http://localhost:8000/workspaces \
  -H "Authorization: Bearer <SUPABASE_ACCESS_TOKEN>" \
  -H "Content-Type: application/json" \
  -d '{"name":"My Workspace"}'

Get usage today:

curl -H "Authorization: Bearer <SUPABASE_ACCESS_TOKEN>" \
  http://localhost:8000/usage/today

Folder Structure Summary

enterprise-rag/
├── server/           # FastAPI API + token budget + DB models
├── client/           # React/Vite frontend with Supabase auth
├── worker/           # RQ workers (extract/index + maintenance)
├── scripts/          # DB schema/bootstrap scripts
├── infrastructure/   # Terraform/K8s placeholders
├── docker-compose.yml
└── AGENTS.md         # Locked architecture contract

Development Order Roadmap

Complete document API contracts (upload-prepare, upload-complete, list/status).
Implement extraction worker (worker/jobs/ingest_extract.py) and page persistence.
Implement chunking + embeddings pipeline (server/app/core/chunking.py, embeddings.py, worker/jobs/ingest_index.py).
Implement retrieval + grounded query endpoint (server/app/api/query.py, core/retrieval.py).
Add stale reservation scheduled maintenance integration and observability.
Expand client from test harness to full app pages (Documents, Query, Usage, Dashboard).
Harden with integration tests (auth, workspace isolation, ingestion, retrieval, budget edge cases).

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
client		client
scripts		scripts
server		server
worker		worker
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enterprise RAG System (v1)

Project Overview

Architecture Diagram

Tech Stack

High-Level Flow

1) Supabase Auth

2) Workspace Creation

3) Token Budget Engine

4) Document Ingestion

5) RAG Query Flow

Environment Variables

Run Locally (Backend + Frontend)

Option A: Docker Compose (recommended)

Option B: Run modules directly

Run With Supabase

Folder Structure Summary

Development Order Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Aryan1718/enterprise-rag-platform

Folders and files

Latest commit

History

Repository files navigation

Enterprise RAG System (v1)

Project Overview

Architecture Diagram

Tech Stack

High-Level Flow

1) Supabase Auth

2) Workspace Creation

3) Token Budget Engine

4) Document Ingestion

5) RAG Query Flow

Environment Variables

Run Locally (Backend + Frontend)

Option A: Docker Compose (recommended)

Option B: Run modules directly

Run With Supabase

Folder Structure Summary

Development Order Roadmap

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages