RFC: Let models declare accepted file MIME types for native file handling

## Summary

Models should be able to declare which file MIME types they accept (e.g. `application/pdf`, `image/*`) so the frontend can adapt the upload UI and the backend can deliver files in the provider-native format.

## Related issues

- #482 — general file upload
- #609 — PDFs/text/images
- #1505 — PDF support
- #1652 — file upload for assistants

## Current state

- `multimodal: true` + `multimodalAcceptedMimetypes` works well for images
- There's no equivalent for documents (PDF, DOCX, etc.)
- Binary files currently get base64-wrapped in XML tags, which most models can't process

## Proposal

Add an `acceptedFileMimetypes` field to the model config:

```json
{
  "name": "gpt-4o",
  "multimodal": true,
  "acceptedFileMimetypes": ["image/*", "application/pdf"]
}
```

**How it works:**
1. Each model declares which file MIME types it accepts
2. The frontend merges `acceptedFileMimetypes` with `multimodalAcceptedMimetypes` to determine which upload options to show
3. The endpoint adapter delivers files in the provider-native format (e.g., OpenAI's `file` content part for PDFs, `image_url` for images)
4. For models/providers that don't natively handle a file type, the existing text extraction fallback still works

**Why this approach:**
- Works for **any provider** — OpenAI, Anthropic, self-hosted via vLLM/Ollama, HF Inference API
- Models that natively support PDFs (GPT-4o, Claude, Gemini) get native handling
- Self-hosted models can still receive extracted text as fallback
- No heavy dependencies (no LibreOffice, no server-side PDF parsing required in core)
- Backward compatible — `supportsBinaryDocs: true` can be mapped to `acceptedFileMimetypes: [...]`

**Comparison with other projects:**
- LibreChat has a multi-stage file processor pipeline with per-endpoint config
- Open WebUI has pluggable storage backends and document RAG workflows
- This proposal is lighter: trust the model/provider to handle what it declares it supports

I'm preparing PRs for this. Would love feedback from maintainers on the approach before finalizing.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Let models declare accepted file MIME types for native file handling #2188

Summary

Related issues

Current state

Proposal

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RFC: Let models declare accepted file MIME types for native file handling #2188

Description

Summary

Related issues

Current state

Proposal

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions