Skip to content

fix(error): error in performance report#27

Merged
shiqian-su merged 1 commit intomainfrom
update-readme
Aug 27, 2025
Merged

fix(error): error in performance report#27
shiqian-su merged 1 commit intomainfrom
update-readme

Conversation

@BinWang28
Copy link
Member

deepseek v3.1 should be hle-text performance

This comment was marked as outdated.

@BinWang28 BinWang28 requested a review from Copilot August 27, 2025 06:14
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR corrects performance benchmark data in the README, specifically fixing misallocated scores for DeepSeek v3.1 and adjusting formatting for MiroFlow's results.

  • Moves DeepSeek v3.1's 29.8% score from HLE column to HLE-Text column
  • Removes bold formatting from MiroFlow's HLE score while adding it to HLE-Text score

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

| Model/Framework | GAIA Val | HLE | HLE-Text | BrowserComp-EN | BrowserComp-ZH | xBench-DeepSearch |
|----------------|----------|-----|----------|----------------|----------------|-------------------|
| **MiroFlow** | **82.4%** | 27.2% | **29.5%** | 33.2% | **47.1%** | **72.0%** |
| **MiroFlow** | **82.4%** | **27.2%** | 29.5% | 33.2% | **47.1%** | **72.0%** |
Copy link

Copilot AI Aug 27, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] The bold formatting change for MiroFlow's HLE score appears inconsistent with the PR description which focuses on DeepSeek v3.1. Consider whether this formatting change is intentional or if it should be reverted to maintain consistency.

Suggested change
| **MiroFlow** | **82.4%** | **27.2%** | 29.5% | 33.2% | **47.1%** | **72.0%** |
| **MiroFlow** | **82.4%** | 27.2% | 29.5% | 33.2% | **47.1%** | **72.0%** |

Copilot uses AI. Check for mistakes.
@shiqian-su shiqian-su merged commit f304655 into main Aug 27, 2025
8 checks passed
@shiqian-su shiqian-su deleted the update-readme branch August 27, 2025 06:34
Zhudongsheng75 pushed a commit to open-compass/MiroFlow that referenced this pull request Dec 27, 2025
BinWang28 added a commit that referenced this pull request Mar 11, 2026
feat(agent): update README and single task running scripts to support the demo.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants