You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,6 +8,8 @@ Deep research has broken out as one of the most popular agent applications. This
8
8
9
9
### 🔥 Recent Updates
10
10
11
+
**August 7, 2025**: Added support for GPT-5 models and updated the Deep Research Bench evaluation to use GPT-5.
12
+
11
13
**August 2, 2025**: Achieved #6 ranking on the [Deep Research Bench Leaderboard](https://huggingface.co/spaces/Ayanami0730/DeepResearch-Leaderboard) with an overall score of 0.4344.
12
14
13
15
**July 30, 2025**: Read about the evolution from our original implementations to the current version in our [blog post](https://rlancemartin.github.io/2025/07/30/bitter_lesson/).
@@ -103,9 +105,10 @@ This creates `tests/expt_results/deep_research_bench_model-name.jsonl` with the
103
105
104
106
| Name | Commit | Summarization | Research | Compression | Total Cost | Total Tokens | RACE Score | Experiment |
0 commit comments