feat: Enable browser use for Gemini models with image support #5026

daniel-lxs · 2025-06-22T22:42:30Z

Description

This PR enables browser use functionality for Gemini models that support images, aligning with Cline's approach where browser interaction works through screenshot analysis rather than direct computer control.

Changes Made

**Refactored to **: Updated the entire codebase to use more accurate terminology
Updated browser capability logic: Changed from checking to checking in
Added browser support to Gemini models: All Gemini models with now have
Updated UI labels: Changed references from "computer use" to "browser use" in settings and model info
Added comprehensive tests: Created new test suite to verify browser capability logic

Testing

All existing tests pass
Added new test file
Verified that Gemini models with image support can now use browser tools
UI correctly displays browser use capability based on image support

Files Changed

Core type definition and all provider configurations
Browser capability check logic
UI components and translations
Test files updated to reflect new property names

Checklist

Code follows project style guidelines
Self-review completed
Tests added for new functionality
All tests passing
No breaking changes for existing functionality

Screenshots

The browser use capability is now enabled for Gemini models like that support images.

Related Issues

Fixes Gemini model Browser Use #5003 - Gemini model Browser Use
Based on approach from Cline PR ENG-524 Remove supportsComputerUse restriction and support browser use through any model that supports images cline/cline#3048

- Replace supportsComputerUse with supportsBrowserUse throughout codebase - Enable browser use for any model that supports images - Update Gemini models configuration to include supportsBrowserUse - Update UI labels from 'computer use' to 'browser use' - Add comprehensive tests for browser capability logic This change aligns with Cline's approach where browser interaction works through screenshot analysis rather than direct computer control.

RooCodeInc/Roo-Code#5026 does this more thoroughly, but limits browser use to Claude and Gemini for some reason. From my testing it additionally also works with GPT-4.1, Mistral Medium 3 and Qwen 2.5 VL

chrarnoldus · 2025-06-30T07:51:02Z

Any reason to only enable browser use for Gemini (in addition to Claude) and not for all models that support images (as is done in Cline)?

I tested it with other models like GPT-4.1 and Mistral Medium 3 and it seems to work fine.

daniel-lxs · 2025-07-01T22:03:15Z

@chrarnoldus
No reason, the original issue just mentioned Gemini models, This PR is just a POC we probably need to handle this differently.

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jun 22, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jun 22, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jun 22, 2025

daniel-lxs moved this from Triage to PR [Draft / In Progress] in Roo Code Roadmap Jun 22, 2025

hannesrudolph added the PR - Draft / In Progress label Jun 22, 2025

chrarnoldus mentioned this pull request Jun 29, 2025

Enable browser use for all browsers that support images Kilo-Org/kilocode#921

Merged

daniel-lxs closed this Jul 1, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Jul 1, 2025

github-project-automation bot moved this from PR [Draft / In Progress] to Done in Roo Code Roadmap Jul 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Enable browser use for Gemini models with image support #5026

feat: Enable browser use for Gemini models with image support #5026

Uh oh!

daniel-lxs commented Jun 22, 2025 •

edited

Loading

Uh oh!

chrarnoldus commented Jun 30, 2025

Uh oh!

daniel-lxs commented Jul 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: Enable browser use for Gemini models with image support #5026

feat: Enable browser use for Gemini models with image support #5026

Uh oh!

Conversation

daniel-lxs commented Jun 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes Made

Testing

Files Changed

Checklist

Screenshots

Related Issues

Uh oh!

chrarnoldus commented Jun 30, 2025

Uh oh!

daniel-lxs commented Jul 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

daniel-lxs commented Jun 22, 2025 •

edited

Loading