Skip to content

Commit 3c0190c

Browse files
committed
docs: Add documentation for Cursor's browser control feature and update journal entry
- Created a new page detailing the capabilities and use cases of the Cursor AI browser control feature through the MCP server - Updated journal entry to reference the new documentation and provide context on Gordon Pedersen's contributions
1 parent e8995d8 commit 3c0190c

File tree

2 files changed

+55
-1
lines changed

2 files changed

+55
-1
lines changed

journals/2025_11_04.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
11
## Perennial Challenges
22
- [[Chrome/Extension]] [[Browser/Extension]] #Markdown
33
- Evaluating [GitHub - deathau/markdownload: A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.](https://github.com/deathau/markdownload)
4-
- [[Person/Gordon Pedersen]] created - Software developer and maintainer of markdownload browser extension, known online as death.au
4+
- [[Person/Gordon Pedersen]] created - Software developer and maintainer of markdownload browser extension, known online as death.au
5+
- [[CursorAI/Feature/Browser Control]] - Created page documenting Cursor's browser control feature through MCP server, allowing AI to interact with web pages directly
Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,53 @@
1+
alias:: [[CursorAI/Feature/Browser Control]], [[CursorAI/MCP/Browser Extension]]
2+
- # Cursor Browser Control Feature
3+
- ## Overview
4+
- Cursor AI can control web browsers directly through the [[CursorAI/MCP/Browser Extension]] MCP server
5+
- This feature allows the AI agent to interact with web pages, perform automated testing, debugging, and UI improvements
6+
- Previously in beta, Browser feature achieved General Availability in [[CursorAI/v/2.0]]
7+
- ## Capabilities
8+
- ### Navigation
9+
- Navigate to URLs
10+
- Navigate back through browser history
11+
- Resize browser window
12+
- ### Interaction
13+
- Click elements on pages
14+
- Type into form fields
15+
- Fill forms with multiple fields
16+
- Select dropdown options
17+
- Hover over elements
18+
- Drag and drop between elements
19+
- ### Inspection
20+
- Take screenshots of pages or specific elements
21+
- Capture accessibility snapshots (better than screenshots for understanding page structure)
22+
- View console messages
23+
- Monitor network requests
24+
- Evaluate JavaScript expressions
25+
- ### Advanced Features
26+
- Wait for text to appear or disappear
27+
- Handle dialogs (alerts, confirms, prompts)
28+
- Manage multiple browser tabs
29+
- Handle forms with validation
30+
- ## Use Cases
31+
- ### Automated Testing
32+
- Execute end-to-end tests by navigating through user flows
33+
- Fill forms and verify outputs
34+
- Streamline the testing phase
35+
- ### UI Enhancements
36+
- Analyze web interfaces
37+
- Propose or implement refinements directly within the browser
38+
- Reduce need for manual back-and-forth adjustments
39+
- ### Client-Side Debugging
40+
- Detect and address client-side issues
41+
- Access console logs and network activities
42+
- Facilitate efficient debugging
43+
- ## Technical Details
44+
- Implemented as an MCP server: `cursor-browser-extension`
45+
- Provides browser control tools through the Model Context Protocol
46+
- Integrated into Cursor's agent interface
47+
- Includes embedded in-editor support with tools to select elements and forward DOM information to the agent
48+
- ## Related
49+
- [[CursorAI/Settings/MCP]] - Configure MCP servers
50+
- [[CursorAI/Docs/Context/Model Context Protocol]] - MCP documentation
51+
- [[CursorAI/v/2.0]] - Browser feature GA announcement
52+
- [[MCP Server]] - Model Context Protocol overview
53+

0 commit comments

Comments
 (0)