Replies: 1 comment
-
你的问题是更换字典进行微调后,模型的 可能的原因及优化建议:1. 字典问题
2. 数据质量
3. 训练数据量
4. 学习率
5. 预训练模型
6. 验证集评估
最终建议
你可以尝试这些方法看看是否能提升 Response generated by feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
其中一段日志如下:
[2025/02/10 12:41:00] ppocr INFO: epoch: [650/800], global_step: 6150, lr: 0.000010, acc: 0.585937, norm_edit_dis: 0.847266, Teacher_acc: 0.578125, Teacher_norm_edit_dis: 0.842225, dml_ctc_0: 5.081125, loss: 53.012030, dml_sar_0: 2.156744, loss_distance_l2_Student_Teacher_0: 0.041590, loss_ctc_Student_0: 19.968964, loss_ctc_Teacher_1: 20.565991, loss_sar_Student_0: 2.449266, loss_sar_Teacher_1: 2.413702, avg_reader_cost: 0.43734 s, avg_batch_cost: 2.03909 s, avg_samples: 64.0, ips: 31.38649 samples/s, eta: 1:53:00, max_mem_reserved: 13078 MB, max_mem_allocated: 12140 MB
[2025/02/10 12:41:07] ppocr INFO: save model in ./output/rec_ppocr_v3_distillation/latest
更换的是ppocr/dicts/chinese_cht_dict.txt中文繁体字典,训练数据共8000张图片,全部都是由TextRecognitionDataGenerator工具合成而来。加载的预训练模型为ch_PP-OCRv3_rec_train
验证集有1000张,评估的acc为74%
Beta Was this translation helpful? Give feedback.
All reactions