Number of question-answer pairs needed to build a private model for an enterprise #737
sunilswain
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I've been really enjoying how the community helps answer my questions, and I'm learning more about H2o.ai every day. Does anyone know how many question-answer pairs are needed to fine-tune a model effectively? I'm using h2ogpt-llama2-7b as the backbone, and I'm training the model on a single document that's about 20 pages long. Additionally, how many question-answer pairs would be required if I were to train the model on hundreds of documents of the same length?
Also please specify how many epochs are generally considered to be a good number?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions