Training Log 2022-11-22 #246
zh-zheng
announced in
Training Logs 训练日志
Replies: 1 comment
-
可以详细介绍一下如何应用scaling weights吗?具体怎么做的?为什么要用这种方法解决NaN问题呢?谢谢,提前。 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
CPM-Live Training Log (November, 22)
Time: November, 22 2022 19:00
Recorder: @zh-zheng
Loss
Completed Data
Average Grad Norm
Progress
Comment
Today, the training loss became NaN at around 12:00. We solved this problem by scaling weights. We'll keep an eye on the model in the next few days.
Beta Was this translation helpful? Give feedback.
All reactions