You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
7. Start recording. Once recording begins, navigate to the Chrome browser, browse some pages, and perform a few clicks. Then, stop the recording and let it complete successfully.
195
-
196
-
8. After recording, check the `openadapt.db` table `browser_event`. It should contain all your browser activity logs. You can verify the data's correctness using the `sqlite3` CLI or an extension like `SQLite Viewer` in VS Code to open `data/openadapt.db`.
197
-
198
-
199
167
### Visualize
200
168
201
169
Quickly visualize the latest recording you created by running the following command:
@@ -243,6 +211,7 @@ Other replay strategies include:
243
211
-[`StatefulReplayStrategy`](https://github.com/OpenAdaptAI/OpenAdapt/blob/main/openadapt/strategies/stateful.py): Early proof-of-concept which uses the OpenAI GPT-4 API with prompts constructed via OS-level window data.
244
212
- (*)[`VisualReplayStrategy`](https://github.com/OpenAdaptAI/OpenAdapt/blob/main/openadapt/strategies/visual.py): Uses [Fast Segment Anything Model (FastSAM)](https://github.com/CASIA-IVA-Lab/FastSAM) to segment active window.
245
213
- (*)[`VanillaReplayStrategy`](https://github.com/OpenAdaptAI/OpenAdapt/blob/main/openadapt/strategies/vanilla.py): Assumes the model is capable of directly reasoning on states and actions accurately. With future frontier models, we hope that this script will suddenly work a lot better.
214
+
- (*)[`VisualBrowserReplayStrategy`](https://github.com/OpenAdaptAI/OpenAdapt/blob/main/openadapt/strategies/visual_browser.py): Like VisualReplayStrategy but generates segments from the visible DOM read by the browser extension.
246
215
247
216
248
217
The (*) prefix indicates strategies which accept an "instructions" parameter that is used to modify the recording, e.g.:
See https://github.com/OpenAdaptAI/OpenAdapt/tree/main/openadapt/strategies for a complete list. More ReplayStrategies coming soon! (see [Contributing](#Contributing)).
255
224
225
+
### Browser integration
226
+
227
+
To record browser events in Google Chrome (required by the `BrowserReplayStrategy`), follow these steps:
228
+
229
+
1. Go to your Chrome extensions page by entering [chrome://extensions](chrome://extensions/) in your address bar.
230
+
231
+
2. Enable `Developer mode` (located at the top right).
232
+
233
+
3. Click `Load unpacked` (located at the top left).
234
+
235
+
4. Select the `chrome_extension` directory in the OpenAdapt repo.
236
+
237
+
5. Make sure the Chrome extension is enabled (the switch to the right of the OpenAdapt extension widget is turned on).
238
+
239
+
6. Set the `RECORD_BROWSER_EVENTS` flag to `true` in `openadapt/data/config.json`.
240
+
256
241
## Features
257
242
258
243
### State-of-the-art GUI understanding via [Segment Anything in High Quality](https://github.com/SysCV/sam-hq):
@@ -306,13 +291,6 @@ We're looking forward to your contributions. Let's build the future 🚀
306
291
307
292
## Contributing
308
293
309
-
### Notable Works-in-progress (incomplete, see https://github.com/OpenAdaptAI/OpenAdapt/pulls and https://github.com/OpenAdaptAI/OpenAdapt/issues/ for more)
0 commit comments