-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Closed as duplicate of#1460
Closed as duplicate of#1460
Copy link
Labels
Issue - Unassigned / ActionableClear and approved. Available for contributors to pick up.Clear and approved. Available for contributors to pick up.bugSomething isn't workingSomething isn't working
Description
Which version of the app are you using?
3.7.12
Which API Provider are you using?
OpenRouter
Which Model are you using?
R1
What happened?
"browser", // Browser group (browser_action)
which you can open in a browser.
## browser_action Description: Request to interact with a Puppeteer-controlled browser. Every action, except close`, will be responded to with a screenshot of the browser's current state, along with any new console logs. You may only perform one browser action per message, and wait for the user's response including a screenshot and logs to determine the next action.
- The sequence of actions must always start with launching the browser at a URL, and must always end with closing the browser. If you need to visit a new URL that is not possible to navigate to from the current webpage, you must first close the browser, then launch again at the new URL.
- While the browser is active, only the
browser_actiontool can be used. No other tools should be called during this time. You may proceed to use other tools only after closing the browser. For example if you run into an error and need to fix a file, you must close the browser, then use other tools to make the necessary changes, then re-launch the browser to verify the result. - The browser window has a resolution of 1280x800 pixels. When performing any click actions, ensure the coordinates are within this resolution range.
- Before clicking on any elements such as icons, links, or buttons, you must consult the provided screenshot of the page to determine the coordinates of the element. The click should be targeted at the center of the element, not on its edges.
Parameters: - action: (required) The action to perform. The available actions are:
- launch: Launch a new Puppeteer-controlled browser instance at the specified URL. This must always be the first action.
- Use with the
urlparameter to provide the URL. - Ensure the URL is valid and includes the appropriate protocol (e.g. http://localhost:3000/page, file:///path/to/file.html, etc.)
- Use with the
- click: Click at a specific x,y coordinate.
- Use with the
coordinateparameter to specify the location. - Always click in the center of an element (icon, button, link, etc.) based on coordinates derived from a screenshot.
- Use with the
- type: Type a string of text on the keyboard. You might use this after clicking on a text field to input text.
- Use with the
textparameter to provide the string to type.
- Use with the
- scroll_down: Scroll down the page by one page height.
- scroll_up: Scroll up the page by one page height.
- close: Close the Puppeteer-controlled browser instance. This must always be the final browser action.
- Example:
<action>close</action>
- Example:
- launch: Launch a new Puppeteer-controlled browser instance at the specified URL. This must always be the first action.
- url: (optional) Use this for providing the URL for the
launchaction.- Example: https://example.com
- coordinate: (optional) The X and Y coordinates for the
clickaction. Coordinates should be within the 1280x800 resolution.- Example: 450,300
- text: (optional) Use this for providing the text for the
typeaction.- Example: Hello, world!
Usage:
<browser_action>
Action to perform (e.g., launch, click, type, scroll_down, scroll_up, close)
URL to launch the browser at (optional)
x,y coordinates (optional)
Text to type (optional)
</browser_action>
- Example: Hello, world!
Example: Requesting to launch a browser at https://example.com
<browser_action>
launch
https://example.com
</browser_action>
Example: Requesting to click on the element at coordinates 450,300
<browser_action>
click
450,300
</browser_action>
`
dosubot
Metadata
Metadata
Assignees
Labels
Issue - Unassigned / ActionableClear and approved. Available for contributors to pick up.Clear and approved. Available for contributors to pick up.bugSomething isn't workingSomething isn't working