Repeating characters in a word are being suppressed by paddleOCR model #12841
Unanswered
raushani2v
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I have trained a custom paddleocr model for character recognition. The repeating characters in a word are being suppressed by the model. for e.g, 40799932 is predicted as 4079932.
What is the reason for this problem? Is there any solution?
I am using following configuration:
Global:
debug: false
use_gpu: true
epoch_num: 5000
log_smooth_window: 20
print_batch_step: 10
save_model_dir: ./output/v3_en_mobile
save_epoch_step: 500
eval_batch_step:
cal_metric_during_train: true
pretrained_model: null
checkpoints: null
save_inference_dir: null
use_visualdl: false
infer_img: doc/imgs_words/ch/word_1.jpg
character_dict_path: ../dataset/wagon/training_data_05-09-2023/char_dict.txt
max_text_length: 25
infer_mode: false
use_space_char: true
distributed: true
save_res_path: ./output/rec/predicts_ppocrv3_en.txt
Optimizer:
name: Adam
beta1: 0.9
beta2: 0.999
lr:
name: Cosine
learning_rate: 0.001
warmup_epoch: 5
regularizer:
name: L2
factor: 3.0e-05
Architecture:
model_type: rec
algorithm: SVTR_LCNet
Transform: null
Backbone:
name: MobileNetV1Enhance
scale: 0.5
last_conv_stride:
last_pool_type: avg
last_pool_kernel_size:
Head:
name: MultiHead
head_list:
Neck:
name: svtr
dims: 64
depth: 2
hidden_dims: 120
use_guide: true
Head:
fc_decay: 1.0e-05
enc_dim: 512
max_text_length: 25
Loss:
name: MultiLoss
loss_config_list:
PostProcess:
name: CTCLabelDecode
Metric:
name: RecMetric
main_indicator: acc
ignore_space: false
Train:
dataset:
name: SimpleDataSet
data_dir: ../dataset/wagon/training_data_05-09-2023
ext_op_transform_idx: 1
label_file_list:
transforms:
img_mode: BGR
channel_first: false
prob: 0.5
ext_data_num: 2
image_shape:
max_text_length: 25
image_shape:
keep_keys:
loader:
shuffle: true
batch_size_per_card: 128
drop_last: true
num_workers: 4
Eval:
dataset:
name: SimpleDataSet
data_dir: ../dataset/wagon/training_data_05-09-2023
label_file_list:
transforms:
img_mode: BGR
channel_first: false
image_shape:
keep_keys:
loader:
shuffle: false
drop_last: false
batch_size_per_card: 128
num_workers: 4
profiler_options: null
Beta Was this translation helpful? Give feedback.
All reactions