Skip to content

Commit 517af96

Browse files
authored
Update README.md
1 parent 8aea8dd commit 517af96

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,8 +6,7 @@ _For the implementation used in the paper [SWE-Search: Enhancing Software Agents
66
## SWE-Bench
77
I use the [SWE-bench benchmark](https://www.swebench.com/) as a way to verify my ideas.
88

9-
* [Claude 3.5 Sonnet v20241022 evaluation results](https://experiments.moatless.ai/evaluations/20250113_claude_3_5_sonnet_20241022_temp_0_0_iter_20_fmt_tool_call_hist_messages_lite) - 39% solve rate, 2.7 resolved instances per dollar
10-
* [Deepseek V3](https://experiments.moatless.ai/evaluations/20250111_deepseek_chat_v3_temp_0_0_iter_20_fmt_react_hist_react) - 30.7% solve rate, 24 resolved instances per dollar
9+
* Claude 4 Sonnet - 70.8% solve rate, $0.63 per instance.
1110

1211
# Try it out
1312

0 commit comments

Comments
 (0)