llm-chat

A minimal, full-stack LLM Chat demo built with TypeScript end-to-end.

Frontend: React + Vite + TypeScript
Backend: Express + TypeScript
Provider: OpenAI-compatible API (switch providers via LLM_BASE_URL)
Modes: Stream (SSE) / Non-stream (JSON)

Quick Start

pnpm install
cp server/.env.example server/.env
# Edit server/.env — set LLM_API_KEY (and optionally LLM_BASE_URL)
pnpm dev

Frontend: http://localhost:5173
Backend: http://localhost:8787

Request Format (Frontend → Backend)

{
  "messages": [{ "role": "user", "content": "Hello" }],
  "model": "auto",
  "mode": "stream",
  "session_id": "s_demo_001",
  "user_id": "u_001",
  "trace_id": "trace_xxx"
}

Core Flow

Frontend sends POST /chat (non-stream) or POST /chat/stream (stream)
Backend runs auth guard (user validation + balance check)
Merges system prompt + user messages + tool definitions + sampling params
Router selects the target model
Calls provider
- Non-stream: returns full completion as JSON
- Stream: yields delta.content chunks via SSE
Response
- Non-stream: JSON body
- Stream: SSE events (event + data)
Logs: latency / tokens / trace_id / error

SSE Event Format

event: message
data: {"type":"delta","text":"Hello"}

event: done
data: {"type":"done","usage":{"prompt_tokens":...}}

Model Router

server/src/core/router.ts

User explicitly specifies a model (not auto) → pass through
Tool call needed (text matches time / 时间) → tool-capable model
Large context (> 12k chars) → long-context model
Default → cheap model

Mock Auth

server/src/mock/db.ts

u_001 — active, pro plan (default test user)
u_003 — inactive, zero balance (for testing error branches)

Key Files

File	Description
`server/src/routes/chat.ts`	Non-stream chat endpoint
`server/src/routes/chat-stream.ts`	Stream chat endpoint (SSE)
`server/src/core/provider.ts`	LLM provider calls
`server/src/core/router.ts`	Model selection logic
`server/src/core/auth.ts`	Auth guard middleware
`server/src/core/sse.ts`	SSE helpers
`server/src/core/logger.ts`	Structured request logging
`web/src/api.ts`	Frontend API + SSE parser
`web/src/App.tsx`	Chat UI + incremental rendering

Roadmap

See Issues for the full development plan, organized in three phases:

Phase 1 — Core completion (persistence, tool calling, context management)
Phase 2 — Production infrastructure (auth, rate limiting, multi-provider, observability)
Phase 3 — Differentiation (model comparison, export, RAG)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
server		server
web		web
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-chat

Quick Start

Request Format (Frontend → Backend)

Core Flow

SSE Event Format

Model Router

Mock Auth

Key Files

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

llm-chat

Quick Start

Request Format (Frontend → Backend)

Core Flow

SSE Event Format

Model Router

Mock Auth

Key Files

Roadmap

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages