Skip to content

Add ds distill collection#298

Merged
Yunnglin merged 11 commits intomainfrom
add/math_collection
Feb 8, 2025
Merged

Add ds distill collection#298
Yunnglin merged 11 commits intomainfrom
add/math_collection

Conversation

@Yunnglin
Copy link
Collaborator

@Yunnglin Yunnglin commented Feb 7, 2025

No description provided.

@@ -9,6 +9,7 @@ You can also use other tools supported by this framework for evaluation, such as

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

update news for reasoning model eval collection

@@ -9,6 +9,7 @@ You can also use other tools supported by this framework for evaluation, such as

| Name | Dataset ID | Task Category | Remarks |
|-------------------|------------------------------------------------------------------------------------------------------|------------------|--------------------------------------------------------------------------------------------------------------------------|
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

可以跑一个完整的distill模型评测实验来验证一下跟r1 paper中report的结论是否对齐。

@Yunnglin Yunnglin merged commit 478bb4f into main Feb 8, 2025
1 check failed
Yunnglin added a commit that referenced this pull request Mar 7, 2025
* set padding false

* add math benchmark

* fix lint

* add doc

* add level math competition

* add batch infer

* add generation n support

* update eval batch

* add example

* add example

* add example
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants