Replies: 3 comments
-
You can add the required characters to the bottom of the original dictionary. Then fine-tune the model based on your own data. |
Beta Was this translation helpful? Give feedback.
-
what do you think is it ok? Here's a step-by-step guide on fine-tuning the Russian Cyrillic model in PaddleOCR for recognizing Tajik text:
Additional Tips:
Remember, fine-tuning requires careful data preparation, training adjustments, and evaluation to achieve optimal accuracy for your specific needs. Consider starting with a small-scale experiment and adjust based on your findings. |
Beta Was this translation helpful? Give feedback.
-
@tink2123 Hello mate,
Global:
use_gpu: true
epoch_num: 10
log_smooth_window: 20
print_batch_step: 10
save_model_dir: ./output/rec_ben
save_epoch_step: 3
eval_batch_step:
- 0
- 100
cal_metric_during_train: true
pretrained_model: null
checkpoints: null
save_inference_dir: /content/inference
use_visualdl: false
infer_img:
character_dict_path: /content/bn_dict.txt
max_text_length: 25
infer_mode: false
use_space_char: true
Optimizer:
name: Adam
beta1: 0.9
beta2: 0.999
lr:
name: Cosine
learning_rate: 0.001
regularizer:
name: L2
factor: 1.0e-05
Architecture:
model_type: rec
algorithm: CRNN
Transform: null
Backbone:
name: MobileNetV3
scale: 0.5
model_name: small
small_stride:
- 1
- 2
- 2
- 2
Neck:
name: SequenceEncoder
encoder_type: rnn
hidden_size: 48
Head:
name: CTCHead
fc_decay: 1.0e-05
Loss:
name: CTCLoss
PostProcess:
name: CTCLabelDecode
Metric:
name: RecMetric
main_indicator: acc
Train:
dataset:
name: SimpleDataSet
data_dir: /content/content/final_dataset/rec/train/train
label_file_list:
- /content/content/final_dataset/rec/train/rec_gt_train.txt
transforms:
- DecodeImage:
img_mode: BGR
channel_first: false
- RecAug: null
- CTCLabelEncode: null
- RecResizeImg:
image_shape:
- 3
- 32
- 320
- KeepKeys:
keep_keys:
- image
- label
- length
loader:
shuffle: true
batch_size_per_card: 256
drop_last: true
num_workers: 8
Eval:
dataset:
name: SimpleDataSet
data_dir: /content/content/final_dataset/rec/test/test
label_file_list:
- /content/content/final_dataset/rec/test/rec_gt_test.txt
transforms:
- DecodeImage:
img_mode: BGR
channel_first: false
- CTCLabelEncode: null
- RecResizeImg:
image_shape:
- 3
- 32
- 320
- KeepKeys:
keep_keys:
- image
- label
- length
loader:
shuffle: false
drop_last: false
batch_size_per_card: 256
num_workers: 8
2. Using a dataset of ~80k one word images. After training for 100+ epochs, the training loss looks like this-
My questions are-
Would love to know your opinion. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Tajik language uses cyrillic, do I have to create new model from scratch or fine tune Russian cyrillic model?
Tajik has extra letters ӣ, ҳ, қ, ҷ, ӯ, ғ
Tajik and the Russian language are different
Beta Was this translation helpful? Give feedback.
All reactions