You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<strong>MiroFlow</strong> is an open-source research agent framework that achieves <strong>#1 ranking</strong> across representative benchmarks (FutureX, GAIA, HLE, xBench-DeepSearch, BrowseComp).<br>
17
-
It powers <ahref="https://github.com/MiroMindAI/mirothinker">MiroThinker</a>, our open-source agent foundation model with native tool-assisted reasoning.
16
+
<strong>MiroFlow</strong> is the open-source agent framework that maximizes any model's agent performance — and proves it across 9+ benchmarks with reproducible results.<br>
17
+
Plug in GPT-5, Claude, <ahref="https://github.com/MiroMindAI/mirothinker">MiroThinker</a>, Kimi, DeepSeek, or any OpenAI-compatible model. Same tools. Same environment. Better results.
18
18
</div>
19
19
20
20
<br>
@@ -34,22 +34,61 @@ It powers <a href="https://github.com/MiroMindAI/mirothinker">MiroThinker</a>, o
34
34
35
35
-**[2025-09-15]**: **MiroFlow v0.3**: Enhanced codebase architecture and significantly improved benchmark performance, boosting GPT-5's prediction accuracy for future events by 11%. MiroFlow now ranks #1 in the future prediction benchmark. See [FutureX](https://futurex-ai.github.io/).
36
36
-**[2025-08-27]**: **MiroFlow v0.2**: Achieves state-of-the-art performance across [multiple agentic benchmarks](https://miromind.ai), including HLE (27.2%), HLE-Text-Only (29.5%), BrowserComp-EN (33.2%), BrowserComp-ZH (47.1%), and xBench-DeepSearch (72.0%).
37
-
-**[2025-08-26]**: Released [GAIA Validation Trace](docs/public_trace.md) (73.94% pass@1) and [Gradio Demo](https://github.com/MiroMindAI/MiroThinker/tree/main/apps/gradio-demo) for local deployment.
37
+
-**[2025-08-26]**: Released GAIA Validation Trace (73.94% pass@1) and [Gradio Demo](https://github.com/MiroMindAI/MiroThinker/tree/main/apps/gradio-demo) for local deployment.
38
38
-**[2025-08-08]**: **MiroFlow v0.1**: Complete open-source release of the research agent framework.
39
39
40
40
</details>
41
41
42
42
---
43
43
44
-
## Highlights
44
+
## Architecture
45
45
46
-
-**Reproducible State-of-the-Art Performance**: #1 ranking across [multiple representative agentic benchmarks](https://miromindai.github.io/miroflow/evaluation_overview/), including FutureX, GAIA, HLE, xBench-DeepSearch, and BrowseComp.
47
-
-**High Concurrency & Reliability**: Robust concurrency management and fault-tolerant design for handling rate-limited APIs and unstable networks.
48
-
-**Cost-Effective Deployment**: Run a research agent service on a single RTX 4090 with the open-source [MiroThinker](https://github.com/MiroMindAI/mirothinker) model and free tools.
-**Model-Agnostic Performance**: Plug in any LLM — GPT-5, Claude, MiroThinker, Kimi K2.5, DeepSeek — and get better agent performance through smart rollback, iterative reasoning, and optimized tool orchestration.
56
+
-**#1 Across 9+ Benchmarks**: Reproducible state-of-the-art on FutureX, GAIA, HLE, xBench-DeepSearch, BrowseComp, and more.
57
+
-**One-Line Model Switching**: Change `provider_class` and `model_name` in YAML. Same tools, same prompts, same environment.
58
+
59
+
### Prove It
60
+
-**Standardized Evaluation**: Fair model comparison with identical infrastructure. The framework is the constant; the model is the variable.
61
+
-**Automated Multi-Run Evaluation**: Parallel runs with statistical aggregation (mean, std dev, min/max). Every result reproducible from config to score.
62
+
63
+
### Build With It
64
+
-**Skill System**: Define agent skills via `SKILL.md` — no code changes needed.
65
+
-**Agent Graph**: Compose multi-agent workflows with hierarchical graphs.
66
+
-**Web Application**: FastAPI + React interface out of the box.
67
+
-**Plugin Architecture**: `@register` decorator — extend without touching core code.
68
+
-**Zero-Code Prompts**: YAML + Jinja2 templates.
69
+
-**Cost-Effective**: Single RTX 4090 with open-source [MiroThinker](https://github.com/MiroMindAI/mirothinker).
70
+
71
+
---
72
+
73
+
## Any Model, Better Results
74
+
75
+
### Cross-Model Performance (MiroFlow Framework)
76
+
77
+
| Benchmark | MiroThinker | Claude 3.7 Sonnet | Kimi K2.5 |
0 commit comments