Testing muP. #4
cloneofsimo
started this conversation in
Show and tell
Replies: 2 comments 1 reply
-
Looks correct to me! |
Beta Was this translation helpful? Give feedback.
0 replies
-
Could you please share the loss curves? with respect to training steps |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Initial Tests on muP running from 110M ~ 4.4B models.
Our goal is to see if results from TP-V5 and muScaling results are reproducible, and test to see if muP provides stable scaling-law predictions.
Current Status:
Beta Was this translation helpful? Give feedback.
All reactions