Eclaire System Architecture

This document describes the overall system architecture of Eclaire, an AI assistant focused on privacy and self-hosting.

Architecture Overview

📝 View Mermaid source code

graph TB
    %% Users
    User[👤 User]

    %% Frontend Layer
    subgraph "Frontend Layer"
        Frontend[🌐 Vite Frontend<br/>Port 3000<br/>React, TanStack Router, Radix UI]
    end

    %% Backend Layer
    subgraph "Backend Layer"
        Backend[⚙️ Node.js Backend API<br/>Port 3001<br/>Hono, Zod, Better Auth]
    end

    %% Workers Layer
    subgraph "Workers Layer"
        Workers[🔄 Background Workers<br/>Node.js<br/>Job Processing]

        subgraph "Worker Jobs"
            BookmarkJob[📎 Bookmark Processor]
            ImageJob[🖼️ Image Processor]
            DocJob[📄 Document Processor]
            NoteJob[📝 Note Processor]
            TaskJob[✅ Task Processor]
            TaskExecJob[🤖 Task Execution Processor]
        end
    end

    %% Data Layer
    subgraph "Data Layer"
        Postgres[(🗄️ PostgreSQL<br/>Port 5432<br/>Database + Job Queue<br/>Drizzle ORM)]
    end

    %% External Services
    subgraph "AI & External Services"
        LlamaCppBackend[🧠 llama-server<br/>Port 11500<br/>Backend AI Model<br/>Qwen3-14B]
        LlamaCppWorkers[🧠 llama-server<br/>Port 11501<br/>Workers AI Model<br/>Gemma-3-4B]
        Docling[📑 Docling<br/>Port 5001<br/>Document Processing<br/>PDF, RTF, etc.]
        ExtAPIs[🌐 External APIs<br/>GitHub, Reddit<br/>Rate Limited]
    end

    %% File System
    subgraph "Storage"
        DataVol[📁 ./data Volume<br/>Persistent Storage<br/>Config, Logs, Files]
        BrowserData[🌐 ./data/browser-data<br/>Playwright Cache]
    end

    %% User Interactions
    User --> Frontend

    %% Frontend to Backend
    Frontend -->|HTTP REST API<br/>WebSocket/SSE Streaming<br/>Authentication| Backend

    %% Backend to Data Layer
    Backend -->|SQL Queries<br/>Drizzle ORM| Postgres
    Backend -->|Auth Sessions<br/>Better Auth| Postgres
    Backend -->|AI Inference<br/>OpenAI Compatible| LlamaCppBackend
    Backend -->|Job Queue| Postgres

    %% Backend to Workers (unified or separate)
    Backend -->|Enqueue Jobs| Workers

    %% Worker Jobs to Workers
    BookmarkJob --> Workers
    ImageJob --> Workers
    DocJob --> Workers
    NoteJob --> Workers
    TaskJob --> Workers
    TaskExecJob --> Workers

    %% Workers to External Services
    Workers -->|HTTP Requests<br/>Rate Limited| ExtAPIs
    Workers -->|AI Inference<br/>OpenAI Compatible| LlamaCppWorkers
    Workers -->|Document Conversion<br/>HTTP API| Docling
    Workers -->|Database Updates<br/>Job Results| Postgres

    %% Storage Access
    Workers -->|File I/O<br/>Screenshots, PDFs| DataVol
    Workers -->|Browser Cache<br/>Playwright Data| BrowserData
    Backend -->|Config Files<br/>models.json| DataVol
    Backend -->|Logs, Uploads| DataVol

    %% Styling
    classDef frontend fill:#e1f5fe,stroke:#0277bd,stroke-width:2px
    classDef backend fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    classDef workers fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    classDef data fill:#e8f5e8,stroke:#388e3c,stroke-width:2px
    classDef external fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef storage fill:#f1f8e9,stroke:#689f38,stroke-width:2px

    class Frontend frontend
    class Backend backend
    class Workers,BookmarkJob,ImageJob,DocJob,NoteJob,TaskJob,TaskExecJob workers
    class Postgres data
    class LlamaCppBackend,LlamaCppWorkers,Docling,ExtAPIs external
    class DataVol,BrowserData storage

Note: To regenerate the SVG after modifying the diagram, run:
mmdc -i docs/architecture.mmd -o docs/assets/architecture.svg

Component Details

Frontend Layer

Technology: Vite with React 19 and TanStack Router
UI Framework: Radix UI components with Tailwind CSS
Features:
- Progressive Web App (PWA) support
- Dark/light theme support
- Real-time updates via WebSocket/SSE
- Authentication with Better Auth
- API documentation with Scalar

Backend API Layer

Technology: Node.js with Hono web framework
Key Features:
- RESTful API with OpenAPI specification
- WebSocket and Server-Sent Events for real-time features
- Authentication and session management
- File upload and processing
- Job scheduling and queue management
- Rate limiting and security middleware

Workers Layer

Technology: Node.js background workers (runs unified with backend by default)
Queue Types:
- Bookmark Processing: Web scraping, content extraction, screenshots
- Image Processing: AI-powered image analysis and metadata extraction
- Document Processing: PDF generation, format conversion via Docling
- Note Processing: AI-powered note enhancement and organization
- Task Processing: General task management and automation
- Task Execution: AI assistant interactions and complex workflows

Data Layer

PostgreSQL Database

Purpose: Primary persistent storage and job queue
Schema: Managed with Drizzle ORM
Features:
- User accounts and authentication
- Content storage (bookmarks, notes, tasks, documents)
- Metadata and relationships
- Full-text search capabilities
- Database-backed job queue (default mode)

AI & External Services

llama.cpp Server

Purpose: Local AI model inference
Model: Gemma-3-4b-it (quantized)
API: OpenAI-compatible HTTP interface
Port: 11500 (backend), 11501 (workers)

Docling Service

Purpose: Document processing and conversion
Capabilities: PDF, RTF, DOCX, and other format processing
Port: 5001

External APIs

Services: Twitter/X, GitHub, Reddit, and other web services
Features: Rate-limited access with domain-specific configurations
Authentication: API keys and tokens managed securely

Storage & File System

Data Volume: Persistent storage for configuration, logs, and user files
Browser Data: Playwright browser cache and session data
Configuration: JSON-based model and service configuration

Deployment Architecture

The system supports multiple deployment modes:

Development Mode

Backend and workers run locally with hot reloading
PostgreSQL runs in Docker container
No Redis required (database queue by default)

Production Mode (Docker Compose)

All application services containerized
Shared Docker network for service communication
External volumes for data persistence
Health checks and restart policies

Key Design Principles

Privacy First: All data processing happens locally or on self-hosted infrastructure
Simplicity: Single container deployment with database-backed queues by default
Reliability: Job queues provide retry logic and error handling
Observability: Comprehensive logging with Pino logger
Security: No external data transmission except for explicitly configured APIs
Modularity: Clean separation between API, workers, and data layers

Network Communication

Frontend ↔ Backend: HTTP REST API, WebSocket for real-time features
Backend ↔ Database: PostgreSQL connections via Drizzle ORM
Backend ↔ Workers: Database-backed job queue (or Redis/BullMQ for scaling)
Workers ↔ AI Services: HTTP APIs for model inference
Workers ↔ External APIs: HTTP with rate limiting and error handling

This architecture provides a robust, privacy-focused AI assistant platform suitable for self-hosting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eclaire System Architecture

Architecture Overview

Component Details

Frontend Layer

Backend API Layer

Workers Layer

Data Layer

PostgreSQL Database

AI & External Services

llama.cpp Server

Docling Service

External APIs

Storage & File System

Deployment Architecture

Development Mode

Production Mode (Docker Compose)

Key Design Principles

Network Communication

FilesExpand file tree

architecture.md

Latest commit

History

architecture.md

File metadata and controls

Eclaire System Architecture

Architecture Overview

Component Details

Frontend Layer

Backend API Layer

Workers Layer

Data Layer

PostgreSQL Database

AI & External Services

llama.cpp Server

Docling Service

External APIs

Storage & File System

Deployment Architecture

Development Mode

Production Mode (Docker Compose)

Key Design Principles

Network Communication