Skip to content

The scores for K2-thinking and MiMo-V2-Flash were mixed up #118

@yetlinghao

Description

@yetlinghao

Hi,

In the benchmark scores you reported, K2-thinking’s SWE-bench Verified score is 73.4. However, this score currently only appears in the results reported for MiMo-V2-Flash. I believe this is a mistake :)

Image Image Image

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions