You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/ROAD-TO-V2-Overhaul.md
+6Lines changed: 6 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -64,6 +64,12 @@ Tasks updated in this pass:
64
64
65
65
The intent here was high-signal evaluation: remove "mind-reading" requirements, but do not turn tasks into copy-paste exercises.
66
66
67
+
## What changed in MCP tools mode
68
+
69
+
The `--use-mcp-tools` prompt was rewritten to be minimal and neutral. The previous prompt included workflow coaching (read files, search code, don't guess) that could inflate scores independent of actual MCP tool usage. The new prompt is a single sentence nudging the agent to use its MCP tools proactively, without teaching problem-solving strategy. This makes with-vs-without comparisons fairer.
70
+
71
+
A new `mcp_prompt` config field was added to `AgentConfig`, allowing per-agent MCP tool guidance (e.g., telling Gemini to use `@web` search). This is appended under an `AGENT-SPECIFIC TOOLS:` header when `--use-mcp-tools` is set.
72
+
67
73
## Compatibility and comparing old runs
68
74
69
75
`1.7.x` is intentionally not identical to `v1.6.1` behavior. If you are comparing against historical leaderboard-era runs, use legacy mode:
You have access to MCP (Model Context Protocol) tools. Use them proactively:
1510
-
- Use file reading tools to examine stub files and test files thoroughly
1511
-
- Use code search tools to find patterns, helper functions, or related implementations
1512
-
- Use any available analysis tools to understand the codebase structure
1513
-
- Prefer using tools to gather context over making assumptions
1514
-
1515
-
Do NOT guess at implementation details that tools can help you discover.`
1509
+
You have access to MCP tools. Carefully assess what they do and how they can be used as effectively as possible, then use them as proactively as you can wherever and whenever most suitable.`
0 commit comments