[FEATURE] Implement Vision Capabilities for A2A servers OpenAI-Compatible Agent

## Summary

Some OpenAI-compatible providers models support vision, it would be great if an A2A server gets out of the box vision capabilities. For example a browser-agent might want to solve captchas, and it needs vision.

It's important to note that while the Inference Gateway supports those models, not all providers have them, so an error will be thrown when the chosen model is attaching an image to the payload - the operator have to choose the right model for the right tasks.

### Acceptance Criteria

- [ ] The A2A internal agent supports sending images as base64 or image urls to OpenAI-compatible APIs
- [ ] The default agent support it without the need to implement custom code
- [ ] It's also possible to run it with custom tasks - the user gets the OpenAI compatible agent and they can decide how to use it
- [ ] It's documented
- [ ] It's tested


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Implement Vision Capabilities for A2A servers OpenAI-Compatible Agent #117

Summary

Acceptance Criteria

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[FEATURE] Implement Vision Capabilities for A2A servers OpenAI-Compatible Agent #117

Description

Summary

Acceptance Criteria

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions