Training Log 2022-06-28 #46
zh-zheng
announced in
Training Logs 训练日志
Replies: 1 comment 1 reply
-
hi,想问一下,规划书里面提到的"预计整个训练周期为5个月",如果损失处于这种波动状态,咱们实际计划训练到什么条件下停止训练? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
CPM-Live Training Log (June, 28)
Time: June, 28 2022 16:00
Recorder: @zh-zheng
Loss
Completed Data
Average Grad Norm
Progress
Comment
Another peaceful day😁. Although there are still some fluctuations in the loss value, its downward trend remains unchanged. Today I notice an interesting paper, in which researchers use OpenAI Codex to solve university-level mathematics problems. What's more, the explanation of the solution code can also be automatically generated. As you can see, when a model performs well and is explainable, it looks charming! We will also make this our goal when training CPM-Live.💪
Beta Was this translation helpful? Give feedback.
All reactions