Skip to content

Commit f4cfaf6

Browse files
YihengWangYihengWang
authored andcommitted
update README by 12-16
1 parent fc5e2e5 commit f4cfaf6

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -71,14 +71,14 @@ Its design is shaped by following core ideas:
7171

7272
## <img src="assets/icon/news.png" alt="news" height="28" style="vertical-align:middle;" />&nbsp;News
7373
* **[2025‑12‑12] · 📰 Evaluation Published on OpenCompass**
74-
&nbsp;&nbsp;• SciEval’s benchmark results are now live on the [OpenCompass](https://opencompass.org.cn/Intern-Discovery-Eval) platform, providing broader community visibility and comparison.
74+
- SciEval’s benchmark results are now live on the [OpenCompass](https://opencompass.org.cn/Intern-Discovery-Eval) platform, providing broader community visibility and comparison.
7575

7676
* **[2025‑12‑05] · 🚀 SciEval v1 Launch**
77-
&nbsp;&nbsp;• Initial public release of a science‑focused evaluation toolkit and leaderboard devoted to realistic research workflows.
77+
- Initial public release of a science‑focused evaluation toolkit and leaderboard devoted to realistic research workflows.
78+
- Coverage: seven scientific capability dimensions × six major disciplines in the initial benchmark suite.
7879

79-
&nbsp;&nbsp;• Coverage: seven scientific capability dimensions × six major disciplines in the initial benchmark suite.
8080
* **[2025‑12‑05] · 🌟 Community Submissions Open**
81-
&nbsp;&nbsp;• Submit your benchmarks via pull request to appear on the official leaderboard.
81+
- Submit your benchmarks via pull request to appear on the official leaderboard.
8282

8383
## <img src="assets/icon/start.png" alt="start" height="28" style="vertical-align:middle;" />&nbsp;Quick Start
8484

@@ -134,7 +134,7 @@ python run.py \
134134
## <img src="assets/icon/update.png" alt="update" height="28" style="vertical-align:middle;" />&nbsp;Codebase Updates
135135

136136
* **Execution‑based Scoring**
137-
&nbsp;&nbsp;• Code‑generation tasks (SciCode, AstroVisBench) are now graded via sandboxed unit tests.
137+
- Code‑generation tasks (SciCode, AstroVisBench) are now graded via sandboxed unit tests.
138138

139139

140140

0 commit comments

Comments
 (0)