Skip to content
Discussion options

You must be logged in to vote
  1. Will you share the train code?

  2. How many A100s did you use to train the model?

  3. Can you give information about the dataset size?

  1. Not this time.
  2. We have done a lot of experiments, it depends on how many parameters the model has, we try 1-4 A100-80gb
  3. The more the better.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by baizh0u
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #8 on September 04, 2024 13:41.