Skip to content

FCE-2759 Add support for agent image capture#228

Merged
Qizot merged 2 commits intomainfrom
agent-image-capture
Feb 9, 2026
Merged

FCE-2759 Add support for agent image capture#228
Qizot merged 2 commits intomainfrom
agent-image-capture

Conversation

@Qizot
Copy link
Contributor

@Qizot Qizot commented Feb 4, 2026

Description

This PR adds support for agent image capture.

Motivation and Context

Why is this change required? What problem does it solve? If it fixes an open
issue, please link to the issue here.

Documentation impact

  • Documentation update required
  • Documentation updated in another PR
  • No documentation update required

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to
    not work as expected)

@linear
Copy link

linear bot commented Feb 4, 2026

Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds support for agent image capture functionality, enabling agents to request and receive captured images from video tracks. The implementation includes new protobuf message types, SDK methods for image capture requests, and a multimodal example demonstrating audio+video interactions with the Gemini Live API.

Changes:

  • Added captureImage method to FishjamAgent for requesting track images
  • Extended protobuf definitions with AgentRequest_CaptureImage and AgentResponse_TrackImage messages
  • Implemented periodic image capture in a new multimodal example that integrates Fishjam with Gemini Live API

Reviewed changes

Copilot reviewed 14 out of 16 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
packages/js-server-sdk/src/agent.ts Added captureImage method and trackImage event support
packages/js-server-sdk/src/integrations/gemini.ts Added metadata field to audio codec parameters
packages/fishjam-proto/src/fishjam/agent_notifications.ts Added protobuf message types for image capture requests and responses
packages/fishjam-proto/src/fishjam/server_notifications.ts Added channel management notifications and fixed typos in comments
examples/multimodal/* New example demonstrating multimodal interactions with Gemini
.gitmodules Updated protos submodule branch reference

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@Qizot Qizot force-pushed the agent-image-capture branch from c576d44 to 87216be Compare February 9, 2026 11:02
@Qizot Qizot merged commit e95069b into main Feb 9, 2026
1 check passed
@Qizot Qizot deleted the agent-image-capture branch February 9, 2026 12:37
PiotrWodecki added a commit that referenced this pull request Feb 9, 2026
PiotrWodecki added a commit that referenced this pull request Feb 9, 2026
@Qizot Qizot restored the agent-image-capture branch February 9, 2026 13:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants