|
| 1 | +alias:: [[CursorAI/Feature/Browser Control]], [[CursorAI/MCP/Browser Extension]] |
| 2 | +- # Cursor Browser Control Feature |
| 3 | + - ## Overview |
| 4 | + - Cursor AI can control web browsers directly through the [[CursorAI/MCP/Browser Extension]] MCP server |
| 5 | + - This feature allows the AI agent to interact with web pages, perform automated testing, debugging, and UI improvements |
| 6 | + - Previously in beta, Browser feature achieved General Availability in [[CursorAI/v/2.0]] |
| 7 | + - ## Capabilities |
| 8 | + - ### Navigation |
| 9 | + - Navigate to URLs |
| 10 | + - Navigate back through browser history |
| 11 | + - Resize browser window |
| 12 | + - ### Interaction |
| 13 | + - Click elements on pages |
| 14 | + - Type into form fields |
| 15 | + - Fill forms with multiple fields |
| 16 | + - Select dropdown options |
| 17 | + - Hover over elements |
| 18 | + - Drag and drop between elements |
| 19 | + - ### Inspection |
| 20 | + - Take screenshots of pages or specific elements |
| 21 | + - Capture accessibility snapshots (better than screenshots for understanding page structure) |
| 22 | + - View console messages |
| 23 | + - Monitor network requests |
| 24 | + - Evaluate JavaScript expressions |
| 25 | + - ### Advanced Features |
| 26 | + - Wait for text to appear or disappear |
| 27 | + - Handle dialogs (alerts, confirms, prompts) |
| 28 | + - Manage multiple browser tabs |
| 29 | + - Handle forms with validation |
| 30 | + - ## Use Cases |
| 31 | + - ### Automated Testing |
| 32 | + - Execute end-to-end tests by navigating through user flows |
| 33 | + - Fill forms and verify outputs |
| 34 | + - Streamline the testing phase |
| 35 | + - ### UI Enhancements |
| 36 | + - Analyze web interfaces |
| 37 | + - Propose or implement refinements directly within the browser |
| 38 | + - Reduce need for manual back-and-forth adjustments |
| 39 | + - ### Client-Side Debugging |
| 40 | + - Detect and address client-side issues |
| 41 | + - Access console logs and network activities |
| 42 | + - Facilitate efficient debugging |
| 43 | + - ## Technical Details |
| 44 | + - Implemented as an MCP server: `cursor-browser-extension` |
| 45 | + - Provides browser control tools through the Model Context Protocol |
| 46 | + - Integrated into Cursor's agent interface |
| 47 | + - Includes embedded in-editor support with tools to select elements and forward DOM information to the agent |
| 48 | + - ## Related |
| 49 | + - [[CursorAI/Settings/MCP]] - Configure MCP servers |
| 50 | + - [[CursorAI/Docs/Context/Model Context Protocol]] - MCP documentation |
| 51 | + - [[CursorAI/v/2.0]] - Browser feature GA announcement |
| 52 | + - [[MCP Server]] - Model Context Protocol overview |
| 53 | + |
0 commit comments