This guide covers how to use Mistral AI's dedicated OCR service in paperless-gpt.
Mistral AI provides a purpose-built OCR endpoint optimized for document processing. Unlike other providers, the Mistral LLM interface in the underlying library does not currently support image uploads, so only the dedicated OCR provider is available.
- Cost-effective document processing
- Optimized for text extraction from documents
- Built-in document structure preservation
- Returns markdown-formatted text
- Best for standard OCR needs
- Limited to 50MB/1000 pages per document
environment:
# OCR Configuration
OCR_PROVIDER: "mistral_ocr"
MISTRAL_API_KEY: "your_mistral_api_key"
# Optional: specify model version
MISTRAL_MODEL: "mistral-ocr-latest"- Maximum file size: 50MB
- Maximum page count: 1,000 pages
- Supported formats: PDF, images (JPEG, PNG)
-
Performance Optimization
- Stay within size limits
- Use markdown output for structured text
-
Cost Optimization
- The dedicated OCR provider is cost-effective for bulk processing
- Returns markdown-formatted text
- Preserves document structure
- Maintains formatting like headers and lists
- Handles tables and columns
Common issues and solutions:
-
Authentication
Error: Invalid API keySolution: Verify your MISTRAL_API_KEY is correctly set
-
Model Availability
Error: Model not availableSolution: Check model name and your subscription level
For additional help:
- Check the paperless-gpt issues
- Join our Discord community