Replies: 1 comment
-
@samuk The datasets requested for Open-R1 are domain specific datasets that will be used for fine tuning through RLHF, GRPO methods etc vs common corpus which is generally for pre-training. The following domain specific datasets are currently requested for Open-R1 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Is the intention to train this on the common corpus/ open training data? https://huggingface.co/blog/Pclanglais/common-models
Beta Was this translation helpful? Give feedback.
All reactions