Skip to content

Commit f9361a8

Browse files
committed
docs: Add Gemini Computer Use integration page
1 parent 6853e1f commit f9361a8

File tree

2 files changed

+33
-1
lines changed

2 files changed

+33
-1
lines changed

docs.json

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -105,7 +105,8 @@
105105
"group": "Computer Use",
106106
"pages": [
107107
"integrations/computer-use/anthropic",
108-
"integrations/computer-use/openai"
108+
"integrations/computer-use/openai",
109+
"integrations/computer-use/gemini"
109110
]
110111
},
111112
"integrations/magnitude",
Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,31 @@
1+
---
2+
title: "Gemini"
3+
---
4+
5+
[Gemini 2.5 Computer Use](https://ai.google.dev/gemini-api/docs/multimodal-agents) is Google's powerful capability that enables AI models to interact with computers through screen observation, cursor control, and keyboard input. This feature allows AI agents to autonomously navigate websites, interact with web applications, and perform complex multi-step tasks.
6+
7+
By integrating Gemini 2.5 Computer Use with Kernel (via Stagehand), you can run these AI-powered browser automations on cloud-hosted infrastructure, eliminating the need for local browser management and enabling scalable, reliable AI agents.
8+
9+
## Quick setup with our example template
10+
11+
Get started quickly with our TypeScript template that demonstrates Gemini 2.5 Computer Use with Kernel:
12+
13+
Check out the [ts-stagehand-google-cua-agent](https://github.com/onkernel/ts-stagehand-google-cua-agent) repository for a complete working example that shows how to:
14+
- Set up Gemini 2.5 Computer Use with Kernel
15+
- Use Stagehand for browser automation
16+
- Run AI-powered web interactions on cloud infrastructure
17+
18+
## Benefits of using Kernel with Gemini Computer Use
19+
20+
- **No local browser management**: Run Computer Use automations without installing or maintaining browsers locally
21+
- **Scalability**: Launch multiple browser sessions in parallel for concurrent automations
22+
- **Stealth mode**: Built-in anti-detection features for web interactions
23+
- **Session persistence**: Maintain browser state across automation runs
24+
- **Live view**: Debug your automations with real-time browser viewing
25+
26+
## Next steps
27+
28+
- Check out [live view](/browsers/live-view) for debugging your automations
29+
- Learn about [stealth mode](/browsers/stealth) for avoiding detection
30+
- Learn how to properly [terminate browser sessions](/browsers/termination)
31+
- Learn how to [deploy](/apps/deploy) your Computer Use app to Kernel

0 commit comments

Comments
 (0)