You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -235,7 +235,7 @@ dynamic benchmarks.
235
235
***Reproduced results in the leaderboard**. For agents that are repdocudibile, we encourage users
236
236
to try to reproduce the results and upload them to the leaderboard. There is a special column
237
237
containing information about all reproduced results of an agent on a benchmark.
238
-
***ReproducibilityAgent**: You can run this agent on an existing study and it will try to re-run
238
+
***ReproducibilityAgent**: [You can run this agent](src/agentlab/agents/generic_agent/reproducibility_agent.py) on an existing study and it will try to re-run
239
239
the same actions on the same task seeds. A vsiual diff of the two prompts will be displayed in the
240
240
AgentInfo HTML tab of AgentXray. You will be able to inspect on some tasks what kind of changes
241
241
between to two executions. **Note**: this is a beta feature and will need some adaptation for your
0 commit comments