File tree Expand file tree Collapse file tree 1 file changed +16
-4
lines changed
Expand file tree Collapse file tree 1 file changed +16
-4
lines changed Original file line number Diff line number Diff line change 99AgentLab is a framework for developing and evaluating agents on a variety of
1010benchmarks supported by [ BrowserGym] ( https://github.com/ServiceNow/BrowserGym ) .
1111This includes:
12- * WebArena
13- * WorkArena.L1, L2, L3
14- * VisualWebArena (coming soon...)
15- * MiniWoB
12+ * [ WebArena] ( https://webarena.dev/ )
13+ * [ WorkArena] ( https://github.com/ServiceNow/WorkArena ) L1, L2, L3
14+ * [ WebLinx] ( https://mcgill-nlp.github.io/weblinx/ )
15+ * [ VisualWebArena] ( https://github.com/web-arena-x/visualwebarena )
16+ * Assistant Bench
17+ * GAIA
18+ * Mind2Web-live (coming soon ...)
19+ * [ MiniWoB] ( https://miniwob.farama.org/index.html )
20+
21+ AgentLab Features:
22+ * Easy large scale parallel agent experiments using [ ray] ( https://www.ray.io/ )
23+ * Building blocks for making agents
24+ * Unified LLM api for OpenRouter, OpenAI, Azure, Self hosted using TGI.
25+ * Prefered way for running benchmarks like WebArena
26+ * Various Reproducibility features
27+ * Unified LeaderBoard
1628
1729The framework enables the desing of rich hyperparameter spaces and the launch of
1830parallel experiments using ablation studies or random searches. It also provides
You can’t perform that action at this time.
0 commit comments