Skip to content

Commit ceb9672

Browse files
committed
Update README.md
1 parent 4ef46fb commit ceb9672

File tree

1 file changed

+20
-1
lines changed

1 file changed

+20
-1
lines changed

optillm/plugins/deepthink/README.md

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -98,7 +98,26 @@ The plugin will:
9898

9999
## Performance
100100

101-
The Deep Think approach has shown significant improvements on complex reasoning tasks:
101+
The Deep Think approach has shown significant improvements on complex reasoning tasks, with particularly strong results on mathematical competition problems.
102+
103+
### AIME 2025 Results
104+
105+
| Model | Approach | Accuracy | Improvement |
106+
|-------|----------|----------|-------------|
107+
| qwen-3-32b | Baseline | 43.33% | - |
108+
| qwen-3-32b | Deep Think | **63.33%** | **+20.00pp** |
109+
110+
*Experimental settings: max_completion_tokens=16382, temperature=0.7, top_p=0.95*
111+
112+
**Key Findings:**
113+
- **46% relative improvement** over baseline on mathematical reasoning
114+
- **Cerebras inference** was crucial for enabling high inference-time compute without latency penalty
115+
- The combination of SELF-DISCOVER structure discovery and uncertainty-routed sampling proved particularly effective for competition mathematics
116+
- Enhanced accuracy on multi-step problems requiring systematic reasoning
117+
118+
### Other Improvements
119+
120+
The Deep Think approach has also demonstrated:
102121
- Enhanced accuracy on multi-step problems
103122
- Better handling of ambiguous or open-ended questions
104123
- Improved consistency across different problem types

0 commit comments

Comments
 (0)