Skip to content

Commit fa0bfb3

Browse files
recursixgasse
authored andcommitted
imrove readme
1 parent 197b251 commit fa0bfb3

File tree

1 file changed

+16
-4
lines changed

1 file changed

+16
-4
lines changed

README.md

Lines changed: 16 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,10 +9,22 @@
99
AgentLab is a framework for developing and evaluating agents on a variety of
1010
benchmarks supported by [BrowserGym](https://github.com/ServiceNow/BrowserGym).
1111
This includes:
12-
* WebArena
13-
* WorkArena.L1, L2, L3
14-
* VisualWebArena (coming soon...)
15-
* MiniWoB
12+
* [WebArena](https://webarena.dev/)
13+
* [WorkArena](https://github.com/ServiceNow/WorkArena) L1, L2, L3
14+
* [WebLinx](https://mcgill-nlp.github.io/weblinx/)
15+
* [VisualWebArena](https://github.com/web-arena-x/visualwebarena)
16+
* Assistant Bench
17+
* GAIA
18+
* Mind2Web-live (coming soon ...)
19+
* [MiniWoB](https://miniwob.farama.org/index.html)
20+
21+
AgentLab Features:
22+
* Easy large scale parallel agent experiments using [ray](https://www.ray.io/)
23+
* Building blocks for making agents
24+
* Unified LLM api for OpenRouter, OpenAI, Azure, Self hosted using TGI.
25+
* Prefered way for running benchmarks like WebArena
26+
* Various Reproducibility features
27+
* Unified LeaderBoard
1628

1729
The framework enables the desing of rich hyperparameter spaces and the launch of
1830
parallel experiments using ablation studies or random searches. It also provides

0 commit comments

Comments
 (0)