[ENHANCEMENT] Native PDF multimodal analysis (upload + text/visual understanding)

## Type
Enhancement

## Problem / Value
Roo Code cannot upload/analyze PDFs for multimodal analysis, blocking users from leveraging model-native capabilities to:
- Understand document structure and layout
- Analyze charts, diagrams, and tables
- Extract information from forms and technical documents
- Comprehend visual elements like flowcharts and architectural diagrams

Who is affected: All users working with PDFs containing visual content  
Current behavior: PDF files cannot be uploaded or analyzed  
Expected behavior: Users can upload PDFs and receive analysis of both text and visual content

## Model Support Status (2024–2025)
All major providers support native PDF multimodal analysis:
- Claude: PDF upload with image/table analysis
- ChatGPT: PDF upload with multimodal interpretation
- Gemini 2.5: PDF upload with comprehensive multimodal capabilities

## Use Cases
- Analyze research papers with charts and diagrams
- Understand technical documentation with flowcharts
- Process forms and structured documents
- Review presentations and reports with visual content
- Analyze code documentation with UML diagrams

## Acceptance Criteria
Given a user has selected a model that supports PDF multimodal analysis (Claude, ChatGPT, or Gemini),  
When they upload a PDF containing visual elements,  
Then the AI analyzes both text and visual content (charts, diagrams, tables).

Given a user uploads a PDF with complex layouts,  
When they ask questions about the document structure,  
Then the AI understands and responds based on visual layout and organization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENHANCEMENT] Native PDF multimodal analysis (upload + text/visual understanding) #7266

Type

Problem / Value

Model Support Status (2024–2025)

Use Cases

Acceptance Criteria

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[ENHANCEMENT] Native PDF multimodal analysis (upload + text/visual understanding) #7266

Description

Type

Problem / Value

Model Support Status (2024–2025)

Use Cases

Acceptance Criteria

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions