Skip to content

A Kernel application that demonstrates Computer Use Agent (CUA) capabilities using Google's Gemini 2.5 model with Stagehand for browser automation.

Notifications You must be signed in to change notification settings

kernel/ts-stagehand-google-cua-agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kernel TypeScript SDK + Stagehand + Gemini Computer Use Agent

A Kernel application that demonstrates Computer Use Agent (CUA) capabilities using Google's Gemini 2.5 model with Stagehand for browser automation.

gemini-2.5-cua-demo.mp4

What It Does

This app uses Gemini 2.5's computer use model capabilities to autonomously navigate websites and complete tasks. The example task searches for Kernel's company page on YCombinator and writes a blog post about their product.

Setup

  1. Copy the environment file:

    cp .env-example .env
  2. Add your API keys to .env:

Running Locally

Execute the script directly with tsx:

npx tsx index.ts

This runs the agent without a Kernel invocation context and provides the browser live view URL for debugging.

Deploying to Kernel

  1. Deploy the application:

    kernel deploy index.ts --env-file .env
  2. Invoke the action:

    kernel invoke ts-stagehand-google-cua-agent google-cua-agent-task

The action creates a Kernel-managed browser and associates it with the invocation for tracking and monitoring.

Documentation

About

A Kernel application that demonstrates Computer Use Agent (CUA) capabilities using Google's Gemini 2.5 model with Stagehand for browser automation.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published