KnowledgeChat

Turn your documents into a queryable knowledge base — full RAG pipeline with pgvector semantic search, streaming AI responses, and bring-your-own-LLM support.

🔗 Live Demo · 中文文档

Screenshot pending — GIF will be added after live deployment.

✨ Highlights

End-to-end RAG pipeline — Upload Markdown or plain-text files; the system automatically parses, chunks (Markdown-aware recursive splitting), and embeds them into pgvector. Every AI response cites the exact source documents with a link back to the original.

Bring your own LLM — Connect any OpenAI-compatible API (OpenAI, Azure OpenAI, local models via Ollama, etc.). Your API key is AES-encrypted before storage and never sent to the client.

Honest fallback with source attribution — Similarity threshold dynamically adapts to question length. When no relevant chunks are found, the response is explicitly labeled "general answer" rather than silently hallucinating from an empty context.

Tech Stack

Layer	Choice
Framework	Next.js 15 (App Router) + TypeScript
UI	shadcn/ui + Tailwind CSS
Database	PostgreSQL + pgvector
ORM	Drizzle ORM
Auth	Better Auth — email/password + Google OAuth
AI	Vercel AI SDK + any OpenAI-compatible API
Email	Resend
Deployment	Vercel

Features

Knowledge base management

Nested folder tree — create folders and sub-folders to organize documents
Upload .md / .txt files, or create notes in the built-in Markdown editor (split-pane edit + preview)
Document detail page — view extracted text, file metadata, and processing status

Document processing pipeline

Text extraction → recursive character text splitting (respects Markdown headings, paragraphs, sentences) → vector embedding via text-embedding-3-small → stored in pgvector

Semantic chat

Conversation list with star (pinned), rename, and delete
Streaming responses with full Markdown rendering (headings, code blocks, lists)
"Sources referenced" collapsible section — shows cited document names, click to open original

LLM configuration

Configure base URL, API key, and model per user account
"Test Connection" validates both the chat and embedding API before saving

Auth & multi-tenancy

Email + password with email verification (Resend), Google OAuth
All data scoped by userId — no cross-user data leakage

UX

Dark / light theme toggle
Collapsible sidebar (icon-only mode)

Architecture

                    ┌─────────────────────────────┐
                    │        RAG Pipeline          │
  Upload file ──►  Parse  ──►  Chunk  ──►  Embed  ──►  pgvector
                    └─────────────────────────────┘
                                                        │
  User question ──►  Embed  ──►  cosine similarity ────┘
                                       │
                               Top-K chunks (k=10)
                               + dynamic threshold
                                       │
                              LLM (context injection)
                                       │
                          Streaming response + sources

Key design decisions:

Server Components by default; "use client" only where interaction is required
Server Actions for all mutations; Route Handlers only for external-facing APIs
Chunking uses recursive splitting (500–1000 chars, 100-char overlap) — no extra model calls needed
DocumentParser interface makes adding PDF/DOCX parsers a drop-in change

Getting Started

Prerequisites: Node.js 18+, pnpm, PostgreSQL with the pgvector extension enabled

Clone and install

git clone https://github.com/your-username/knowledgechat.git
cd knowledgechat
pnpm install

Configure environment variables

cp .env.local.example .env.local

Required variables:

Variable	Description
`DATABASE_URL`	PostgreSQL connection string
`BETTER_AUTH_SECRET`	Random secret (≥ 32 chars)
`BETTER_AUTH_URL`	App base URL (`http://localhost:3000` locally)
`GOOGLE_CLIENT_ID` / `GOOGLE_CLIENT_SECRET`	Google OAuth credentials
`RESEND_API_KEY` / `RESEND_FROM`	Transactional email
`ENCRYPTION_KEY`	AES key for LLM API key encryption

Run database migrations
```
pnpm db:generate
pnpm db:migrate
```
Start the dev server
```
pnpm dev
```
Open http://localhost:3000.
Configure your LLM — Go to Settings, enter your OpenAI-compatible base URL, API key, and model name, then click Test Connection.

Roadmap

PDF and DOCX support — DocumentParser interface is already defined; implementations pending
Live demo deployment

License

MIT

Demo coming soon. The live deployment will be linked here once available.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.claude		.claude
.vscode		.vscode
docs		docs
drizzle		drizzle
public		public
src		src
.env.local.example		.env.local.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
README_CN.md		README_CN.md
components.json		components.json
drizzle.config.ts		drizzle.config.ts
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
vitest.setup.ts		vitest.setup.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KnowledgeChat

✨ Highlights

Tech Stack

Features

Architecture

Getting Started

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

KnowledgeChat

✨ Highlights

Tech Stack

Features

Architecture

Getting Started

Roadmap

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages