Skip to content

Commit a40022c

Browse files
authored
Merge pull request #26 from MiroMindAI/test/yfd
refactor(config): simplify config and script naming
2 parents 6a132cb + 0978b8a commit a40022c

13 files changed

+6
-164
lines changed

config/benchmark/browsecomp-en-200.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,8 @@ execution:
1414
max_tasks: null # null = no limit, or specify a number
1515
max_concurrent: 5 # Number of parallel tasks
1616
pass_at_k: 1 # Number of attempts per task
17+
max_retry: 5
18+
exceed_max_turn_summary: true
1719

1820
# OpenAI API key for evaluation (required for browsecomp since it has ground truth)
1921
openai_api_key: "${oc.env:OPENAI_API_KEY,???}"

config/fangda_agent_browsecomp-en-200_mirothinker_single_agent_rollback_new_tools_toolblacklist.yaml renamed to config/fangda_agent_browsecomp-en-200_mirothinker.yaml

File renamed without changes.

config/fangda_agent_browsecomp-zh_mirothinker_single_agent_rollback_new_tools_toolblacklist.yaml renamed to config/fangda_agent_browsecomp-zh_mirothinker.yaml

File renamed without changes.

config/fangda_agent_gaia-validation-text-only_mirothinker_single_agent_rollback_new_tools_toolblacklist.yaml renamed to config/fangda_agent_gaia-validation-text-only_mirothinker.yaml

File renamed without changes.

config/fangda_agent_gaia-validation-text-only_mirothinker_single_agent.yaml

Lines changed: 0 additions & 36 deletions
This file was deleted.

config/fangda_agent_gaia-validation-text-only_mirothinker_single_agent_new_tools.yaml

Lines changed: 0 additions & 37 deletions
This file was deleted.

config/fangda_agent_gaia-validation-text-only_mirothinker_single_agent_rollback.yaml

Lines changed: 0 additions & 43 deletions
This file was deleted.

config/fangda_agent_gaia-validation-text-only_mirothinker_single_agent_rollback_new_tools.yaml

Lines changed: 0 additions & 44 deletions
This file was deleted.

config/fangda_agent_gaia-validation_mirothinker_single_agent_rollback_new_tools_toolblacklist.yaml renamed to config/fangda_agent_gaia-validation_mirothinker.yaml

File renamed without changes.

scripts/fangda_run_evaluate_multiple_runs_mirothinker_browsecomp-en-200.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@
77
# Configuration parameters
88
NUM_RUNS=3
99
BENCHMARK_NAME="browsecomp-en-200"
10-
AGENT_SET="fangda_agent_browsecomp-en-200_mirothinker_single_agent_rollback_new_tools_toolblacklist"
10+
AGENT_SET="fangda_agent_browsecomp-en-200_mirothinker"
1111
MAX_CONCURRENT=50
1212

1313
# Set results directory with timestamp

0 commit comments

Comments
 (0)