Skip to content
Discussion options

You must be logged in to vote

Hello!

datasets:
  - path: microsoft/orca-agentinstruct-1M-v1
    type: chat_template
    split: rag
-    conversation_column: messages
+    field_messages: messages
    message_field_role: role
    message_field_content: content

We don't have a conversation_column field, it's called field_messages, although the default should work for this dataset.


I took a look at this split in particular. Their dataset is a bit weird.

Instead of messages: List[dict] (list of conversation), it's messages: str (stringified list).

https://huggingface.co/datasets/microsoft/orca-agentinstruct-1M-v1/viewer/default/rag?row=0

You'll need to do a small patch to axolotl if you want to get this to work.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by NanoCode012
Comment options

You must be logged in to vote
1 reply
@NanoCode012
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants