Continue training on existing model with new train data #9076
-
Hello, I am using some models which provides from PaddleOCR like en_PP-OCRv3_xx or ch_ppocr_server_v2.0_xx. Actually they are really useful but i have new some special cases and so i want to implement my new train data to this models. I try to train new model with Backbones but my local computer GPU is insufficient for train big datasets. Is it possible? If it is, should i use inference models? Could anybody share any guide about this. I have read almost all english documents but i was really confused and feeling like im lost. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
If you have any change in terms of char_dict, language, or if the accuracy of the inference model is not good for you, then it maybe worth it to try finetune the pretrained model. With the original PPOCRv3 small model config (image size 48x320), it consumes about 19GB of memory on the GPU with batch_size=128, and the memory consumption will increase or decrease linearly with the batch_size. If there is no GPU available, you could try some online resources from Baidu or Google Colab. |
Beta Was this translation helpful? Give feedback.
If you have any change in terms of char_dict, language, or if the accuracy of the inference model is not good for you, then it maybe worth it to try finetune the pretrained model. With the original PPOCRv3 small model config (image size 48x320), it consumes about 19GB of memory on the GPU with batch_size=128, and the memory consumption will increase or decrease linearly with the batch_size. If there is no GPU available, you could try some online resources from Baidu or Google Colab.