Replies: 1 comment
-
I haven't tried rlhf datasets yet. You may be able to run preprocess with |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Dataset Used: argilla/dpo-mix-7k
!! The fields are dict instead of strings. !!
So I have written a script to tokenize them and then save the tokenized prompts in a dataset, which I am trying to then load and do the training with custom dataset type.
The training is running currently, but I am not sure if this is the correct way to do this. Can anyone confirm?
Beta Was this translation helpful? Give feedback.
All reactions