llm-prism

A lightweight, transparent reverse proxy for LLM API observability. It captures full HTTP request/response lifecycles (including streaming/SSE) without latency impact.

Note: Currently optimized for the DeepSeek API. The design is provider-agnostic and can be extended to other LLM providers in the future.

Features

Zero-Latency Streaming: Wraps http.Flusher to ensure Server-Sent Events (SSE) are forwarded instantly.
Faithful Capture: Records raw payloads. Automatically handles Gzip decompression and JSON validation for logging.
Log Separation:
- System Logs: Console (Stderr) for operational status.
- Data Logs: File (JSONL) for traffic analysis.

Install

go install github.com/wangyihang/llm-prism@latest

Usage

Start the gateway

$ llm-prism run --help
Usage: llm-prism run --api-key=STRING [flags]

Run proxy

Flags:
  -h, --help                          Show context-sensitive help.
      --log-file="llm-prism.jsonl"    Log file ($LLM_PRISM_LOG_FILE)

      --api-url="https://api.deepseek.com/anthropic"
                                      API URL ($LLM_PRISM_API_URL)
      --api-key=STRING                API Key ($LLM_PRISM_API_KEY)
      --provider="deepseek"           Provider ($LLM_PRISM_PROVIDER)
      --host="0.0.0.0"                Host ($LLM_PRISM_HOST)
      --port=4000                     Port ($LLM_PRISM_PORT)

# Basic usage (DeepSeek)
export LLM_PRISM_API_URL=https://api.deepseek.com/anthropic
export LLM_PRISM_API_KEY=sk-deepseek-sample-api-key
export LLM_PRISM_PROVIDER=deepseek
llm-prism run

# Basic usage (Kimi)
export LLM_PRISM_API_URL=https://api.moonshot.cn/anthropic/
export LLM_PRISM_API_KEY=sk-kimi-sample-api-key
export LLM_PRISM_PROVIDER=kimi
llm-prism run

Run Claude Code

export ANTHROPIC_BASE_URL=http://localhost:4000
export ANTHROPIC_AUTH_TOKEN=
claude

Log Format

Data logs are stored in JSONL format. Each line represents a completed HTTP interaction.
The example below shows a DeepSeek chat completion, but the schema is generic enough to support other providers later.

{
  "level": "info",
  "time": "2023-10-27T10:00:00.123Z",
  "duration": 150.5,
  "http": {
    "request": {
      "method": "POST",
      "path": "/v1/chat/completions",
      "body": {
        "model": "deepseek-chat",
        "messages": [...]
      }
    },
    "response": {
      "status": 200,
      "body": "data: {...}\n\ndata: [DONE]" // Raw string for SSE streams
    }
  }
}

Architecture

Provider-Agnostic Core: Core proxy and logging pipeline are independent of any specific LLM provider.
DeepSeek Adapter (current): Request/response examples focus on DeepSeek endpoints and models.
Future Providers: Additional adapters can be added to normalize request/response shapes for other providers while keeping the logging format stable.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github/workflows		.github/workflows
pkg		pkg
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
README.md		README.md
cliff.toml		cliff.toml
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-prism

Features

Install

Usage

Log Format

Architecture

About

Uh oh!

Releases 1

Packages

Languages

WangYihang/llm-prism

Folders and files

Latest commit

History

Repository files navigation

llm-prism

Features

Install

Usage

Log Format

Architecture

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages