You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please note that the WorkArena benchmark is still undergoing minor bug fixes and updates, which may cause discrepancies with results reported in our latest arXiv preprint. We plan to release soon a stable version of WorkArena v0.1.0 with enhanced stability, and a final version v1.0.0 with a new suite of tasks.
14
+
Please note that the WorkArena benchmark is still undergoing minor bug fixes and updates, which may cause discrepancies with results reported in our latest arXiv preprint. We plan to release soon a stable version of WorkArena with enhanced stability, and a final version v1.0.0 with a new suite of tasks.
15
15
16
16
## Benchmark Contents
17
17
18
-
At the moment, WorkArena includes `18,050` task instances drawn from `29` tasks that cover the main components of the ServiceNow user interface. The following videos show an agent built on `GPT-4-vision` interacting with every such component. As emphasized by our results, this benchmark is not solved and thus, the performance of the agent is not always on point.
18
+
At the moment, WorkArena includes `18,050` task instances drawn from `33` tasks that cover the main components of the ServiceNow user interface. The following videos show an agent built on `GPT-4-vision` interacting with every such component. As emphasized by our results, this benchmark is not solved and thus, the performance of the agent is not always on point.
**Goal:** The agent must extract information from a dashboard.
57
+
58
+
59
+
54
60
## Getting Started
55
61
56
62
To setup WorkArena, you will need to get your own ServiceNow instance, install our Python package, and upload some data to your instance. Follow the steps below to achieve this.
57
63
58
64
### a) Create a ServiceNow Developer Instance
59
65
60
66
1. Go to https://developer.servicenow.com/ and create an account.
61
-
2. Click on `Request an instance` and select the `Utah` release (initializing the instance will take a few minutes)
67
+
2. Click on `Request an instance` and select the `Washington` release (initializing the instance will take a few minutes)
62
68
3. Once the instance is ready, you should see your instance URL and credentials. If not, click _Return to the Developer Portal_, then navigate to _Manage instance password_ and click _Reset instance password_.
63
69
4. You should now see your URL and credentials. Based on this information, set the following environment variables:
64
70
*`SNOW_INSTANCE_URL`: The URL of your ServiceNow developer instance
0 commit comments