You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project builds upon the foundation of the [browser-use](https://github.com/browser-use/browser-use), which is designed to make websites accessible for AI agents. We have enhanced the original capabilities by providing:
6
+
7
+
1.**A Brand New WebUI:** We offer a comprehensive web interface that supports a wide range of `browser-use` functionalities. This UI is designed to be user-friendly and enables easy interaction with the browser agent.
8
+
9
+
2.**Expanded LLM Support:** We've integrated support for various Large Language Models (LLMs), including: Gemini, OpenAI, Azure OpenAI, Anthropic etc. And we plan to add support for even more models in the future.
10
+
11
+
3.**Custom Browser Support:** You can use your own browser with our tool, eliminating the need to re-login to sites or deal with other authentication challenges. This feature also supports high-definition screen recording.
12
+
13
+
4.**Customized Agent:** We've implemented a custom agent that enhances `browser-use` with Optimized prompts.
14
+
15
+
<videosrc="https://github.com/user-attachments/assets/cc4ca59f-e4a5-43d8-86db-bb0e6edbedef"controls="controls"width="500"height="300" >Your browser does not support playing this video!</video>
16
+
17
+
## Environment Installation
18
+
19
+
1.**Python Version:** Ensure you have Python 3.11 or higher installed.
20
+
2.**Install `browser-use`:**
21
+
```bash
22
+
pip install browser-use
23
+
```
24
+
3. **Install Playwright:**
25
+
```bash
26
+
playwright install
27
+
```
28
+
4. **Install Dependencies:**
29
+
```bash
30
+
pip install -r requirements.txt
31
+
```
32
+
5. **Configure Environment Variables:**
33
+
- Copy `.env.example` to `.env` and set your environment variables, including API keys for the LLM.
34
+
- **If using your own browser:**
35
+
- Set `CHROME_PATH` to the executable path of your browser (e.g., `C:\Program Files\Google\Chrome\Application\chrome.exe` on Windows).
36
+
- Set `CHROME_USER_DATA` to the user data directory of your browser (e.g.,`C:\Users\<YourUsername>\AppData\Local\Google\Chrome\User Data`).
37
+
38
+
## Usage
39
+
40
+
1. **Run the WebUI:**
41
+
```bash
42
+
python webui.py --ip 127.0.0.1 --port 7788
43
+
```
44
+
2. **Access the WebUI:** Open your web browser and navigate to `http://127.0.0.1:7788`.
45
+
3. **Using Your Own Browser:**
46
+
- Open the WebUI in a non-Chrome browser, such as Firefox or Edge. This is important because the persistent browser context will use the Chrome data when running the agent.
47
+
- Check the "Use Own Browser" option within the Browser Settings.
0 commit comments