Replies: 1 comment
-
Hey, can you try enable cpu offload and reduce batch size? Theoretically, it should fit. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi team and @winglian ,
I get OOM error when pretraining Yi-34 with 8*A100 80GB, flash_attn, deepspeed zero3, sequence_len=2048. So what is minimum gpus for this?
Beta Was this translation helpful? Give feedback.
All reactions