Continue training on existing model with new train data #9076

suheylkiris · 2023-02-14T20:02:24Z

suheylkiris
Feb 14, 2023

Hello,

I am using some models which provides from PaddleOCR like en_PP-OCRv3_xx or ch_ppocr_server_v2.0_xx. Actually they are really useful but i have new some special cases and so i want to implement my new train data to this models. I try to train new model with Backbones but my local computer GPU is insufficient for train big datasets. Is it possible? If it is, should i use inference models? Could anybody share any guide about this. I have read almost all english documents but i was really confused and feeling like im lost.

Answered by TuanBC

Feb 16, 2023

If you have any change in terms of char_dict, language, or if the accuracy of the inference model is not good for you, then it maybe worth it to try finetune the pretrained model. With the original PPOCRv3 small model config (image size 48x320), it consumes about 19GB of memory on the GPU with batch_size=128, and the memory consumption will increase or decrease linearly with the batch_size. If there is no GPU available, you could try some online resources from Baidu or Google Colab.

View full answer

TuanBC · 2023-02-16T05:43:45Z

TuanBC
Feb 16, 2023

If you have any change in terms of char_dict, language, or if the accuracy of the inference model is not good for you, then it maybe worth it to try finetune the pretrained model. With the original PPOCRv3 small model config (image size 48x320), it consumes about 19GB of memory on the GPU with batch_size=128, and the memory consumption will increase or decrease linearly with the batch_size. If there is no GPU available, you could try some online resources from Baidu or Google Colab.

1 reply

suheylkiris Feb 17, 2023
Author

Thank you so much. I spent a lot of time with github documentations and E-Book, i figured out how can i train or fine-tune new model. But still i need some reading/explanation resources about two subject:

Where can i learn resource(GPU, CPU) consumption of models and how effects their parameters(batch_size, lr) etc... which like is exampled in your comment.
I did not understand teacher-student relations and algorithms. How it works actually and how can i configure config files? In what situations should I use it? How can i decide which algorithm is a teacher or student? Can each model be used as a teacher or student? etc etc....

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Continue training on existing model with new train data #9076

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Continue training on existing model with new train data #9076

Uh oh!

suheylkiris Feb 14, 2023

Replies: 1 comment · 1 reply

Uh oh!

TuanBC Feb 16, 2023

Uh oh!

suheylkiris Feb 17, 2023 Author

suheylkiris
Feb 14, 2023

Replies: 1 comment 1 reply

TuanBC
Feb 16, 2023

suheylkiris Feb 17, 2023
Author