CLAUDE.md

Project context for Claude Code.

Project Goal

coOCR/HTR is an Editor-in-the-Loop tool for OCR/HTR verification and correction.

Input: Image (generate OCR) OR PAGE-XML (correct existing transcription)
Output: Correct OCR/HTR in exportable format (PAGE-XML, TXT, JSON)
Target audience: Domain experts (Digital Humanists, archivists, historians)

Success Criteria:

Self-explanatory (usable without instructions)
Complete workflow (Upload → Edit → Export)
Workflow integration (output usable in downstream processes)

See VISION.md for details.

Technology Stack

Component	Technology
Runtime	Vanilla JavaScript (ES6+)
Dependencies	None (Tests: Vitest)
Storage	LocalStorage + IndexedDB (v2)
API	Fetch API (Gemini, OpenAI, Anthropic, Mistral, Azure Mistral, Ollama)
i18n	JSON dictionaries (DE/EN switchable)
UI	Plain HTML/CSS, Dark Mode, Glass Morphism
Hosting	GitHub Pages (`docs/` folder)

Project Structure

co-ocr-htr/
├── README.md              # Project overview (English)
├── CLAUDE.md              # This document
├── knowledge/             # Consolidated knowledge base (Obsidian Vault)
│   ├── INDEX.md           # Navigation, document matrix
│   ├── VISION.md          # Project goal, success criteria
│   ├── METHODOLOGY.md     # Scientific foundations
│   ├── MODEL-LANDSCAPE.md # OCR/HTR model comparison
│   ├── DESIGN-SYSTEM.md   # UI/UX specification
│   ├── ARCHITECTURE.md    # Technical architecture
│   ├── VALIDATION.md      # Hybrid validation
│   ├── DATA-SCHEMA.md     # Data structures
│   ├── IMPLEMENTATION-PLAN.md # Roadmap (completed)
│   └── JOURNAL.md         # Development log
├── docs/                  # GitHub Pages deployment
│   ├── index.html         # Main application
│   ├── css/               # Modular CSS files
│   │   ├── variables.css  # Design tokens
│   │   ├── base.css       # Reset, typography
│   │   ├── layout.css     # Grid, header
│   │   ├── components.css # Buttons, cards
│   │   ├── viewer.css     # Document viewer
│   │   ├── editor.css     # Transcription editor
│   │   ├── validation.css # Validation panel
│   │   └── dialogs.css    # Modal dialogs
│   ├── js/
│   │   ├── main.js        # Entry point
│   │   ├── state.js       # Central state (EventTarget)
│   │   ├── viewer.js      # Document viewer
│   │   ├── editor.js      # Transcription editor
│   │   ├── components/    # UI components
│   │   ├── utils/         # Shared utilities
│   │   └── services/      # LLM, i18n, Storage, Validation, Export
│   ├── i18n/              # Translation dictionaries (en.json, de.json)
│   ├── samples/           # Demo documents
│   └── tests/             # Vitest tests (567 tests)
└── data/                  # Development data (not deployed)
    └── ocr-examples/      # Complete datasets

Knowledge Base (knowledge/)

All design decisions are documented and justified in knowledge/.

Question	Document
What is the goal?	VISION
Why categorical confidence?	METHODOLOGY
Which models?	MODEL-LANDSCAPE
What does the UI look like?	DESIGN-SYSTEM
How is it built?	ARCHITECTURE
How does validation work?	VALIDATION
Which data structures?	DATA-SCHEMA

Development Methodology: Promptotyping

Documentation before code
Iteration through dialogue
Early validation
Minimal, readable code

Core Concepts

Concept	Meaning
Critical Expert in the Loop	Human validates, machine assists
Categorical Confidence	confident / uncertain / problematic (no percentages)
Hybrid Validation	Deterministic rules + LLM-Judge
Custom Validation Prompt	Optional user-defined validation prompt

Conventions

No build process
ES6 Modules (native)
CSS Custom Properties for theming
Comments explain "why", code explains "what"
No emojis - Use instead:
- [x] for completed
- [~] for in progress
- [ ] for planned
- (green), (yellow), (red) for status colors in documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Goal

Technology Stack

Project Structure

Knowledge Base (knowledge/)

Development Methodology: Promptotyping

Core Concepts

Conventions

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Goal

Technology Stack

Project Structure

Knowledge Base (knowledge/)

Development Methodology: Promptotyping

Core Concepts

Conventions