Replies: 1 comment 1 reply
-
@Consistos is there a quantitative comparison of R1 vs models trained with CoCoNut? Also, open reproductions are 100% good science and pushing the field forward (more than closed models arguably). |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This leads to a noticeable performance improvement and flexibility over forcing the model to output language at each reasoning step, as showcased in this publication. Code for an implementation has recently been open-sourced.
I think this is an improvement of R1, and as such begs whether this project aims to merely be a 1:1 reproduction or improve over it? The latter is IMO more exciting.
Edit: another related publication
Edit 2: yet another one
Beta Was this translation helpful? Give feedback.
All reactions