Skip to content
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion docs.json
Original file line number Diff line number Diff line change
Expand Up @@ -105,7 +105,8 @@
"group": "Computer Use",
"pages": [
"integrations/computer-use/anthropic",
"integrations/computer-use/openai"
"integrations/computer-use/openai",
"integrations/computer-use/gemini"
]
},
"integrations/magnitude",
Expand Down
31 changes: 31 additions & 0 deletions integrations/computer-use/gemini.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
---
title: "Gemini"
---

[Gemini 2.5 Computer Use](https://ai.google.dev/gemini-api/docs/multimodal-agents) is Google's powerful capability that enables AI models to interact with computers through screen observation, cursor control, and keyboard input. This feature allows AI agents to autonomously navigate websites, interact with web applications, and perform complex multi-step tasks.

By integrating Gemini 2.5 Computer Use with Kernel (via Stagehand), you can run these AI-powered browser automations on cloud-hosted infrastructure, eliminating the need for local browser management and enabling scalable, reliable AI agents.

## Quick setup with our example template

Get started quickly with our TypeScript template that demonstrates Gemini 2.5 Computer Use with Kernel:

Check out the [ts-stagehand-google-cua-agent](https://github.com/onkernel/ts-stagehand-google-cua-agent) repository for a complete working example that shows how to:
- Set up Gemini 2.5 Computer Use with Kernel
- Use Stagehand for browser automation
- Run AI-powered web interactions on cloud infrastructure

## Benefits of using Kernel with Gemini Computer Use

- **No local browser management**: Run Computer Use automations without installing or maintaining browsers locally
- **Scalability**: Launch multiple browser sessions in parallel for concurrent automations
- **Stealth mode**: Built-in anti-detection features for web interactions
- **Session persistence**: Maintain browser state across automation runs
- **Live view**: Debug your automations with real-time browser viewing

## Next steps

- Check out [live view](/browsers/live-view) for debugging your automations
- Learn about [stealth mode](/browsers/stealth) for avoiding detection
- Learn how to properly [terminate browser sessions](/browsers/termination)
- Learn how to [deploy](/apps/deploy) your Computer Use app to Kernel