模型微调 #15000
Unanswered
daijun-hub
asked this question in
Q&A
模型微调
#15000
Replies: 1 comment
-
根据你描述的问题,训练数据主要集中在新增的「橖」字,导致模型在识别其他文本时也倾向输出「橖」,这属于训练数据中类别分布严重不平衡导致的过拟合现象。为了解决该问题,建议你在PP-OCRv4识别模型微调时:
以下是一个修改后的完整微调配置文件(基于 PP-OCRv3 识别模型): 🔧 config.yaml Global:
use_gpu: true
epoch_num: 100
log_smooth_window: 20
print_batch_step: 10
save_model_dir: ./output/rec_ppocrv3_custom
save_epoch_step: 1
eval_batch_step: [0, 2000]
cal_metric_during_train: True
pretrained_model: ./pretrain/ch_PP-OCRv3_rec_train/best_accuracy.pdparams
checkpoints:
save_inference_dir:
use_visualdl: False
infer_img:
character_dict_path: ./ppocr/utils/my_dict.txt
max_text_length: 25
infer_mode: False
use_space_char: True
distributed: False
save_res_path: ./output/rec/predicts.txt
use_amp: False
Architecture:
model_type: rec
algorithm: SVTR
Transform:
Backbone:
name: MobileNetV1Enhance
scale: 0.5
last_conv_stride: [1, 2]
last_pool_type: avg
Neck:
name: SequenceEncoder
encoder_type: svtr
dims: 64
depth: 2
hidden_dims: 120
use_guide: False
Head:
name: CTCHead
fc_decay: 0.00001
Loss:
name: CTCLoss
Optimizer:
name: Adam
beta1: 0.9
beta2: 0.999
lr:
name: Piecewise
decay_epochs: [50, 80]
values: [0.0001, 0.00002]
warmup_epoch: 5
regularizer:
name: 'L2'
factor: 0.00001
Train:
dataset:
name: SimpleDataSet
data_dir: ./train_data/
label_file_list:
- ./train_data/custom橖.txt
- ./train_data/general.txt
ratio_list: [0.2, 0.8] # 保证橖字样本不过多
transforms:
- RecAug: {}
- CTCLabelEncode: {}
- KeepKeys:
keep_keys: ['image', 'label', 'length']
loader:
shuffle: True
drop_last: False
batch_size_per_card: 64
num_workers: 4
use_shared_memory: False
Eval:
dataset:
name: SimpleDataSet
data_dir: ./val_data/
label_file_list: [./val_data/val.txt]
transforms:
- CTCLabelEncode: {}
- KeepKeys:
keep_keys: ['image', 'label', 'length']
loader:
shuffle: False
drop_last: False
batch_size_per_card: 64
num_workers: 4 📌 注意事项:
通过上述配置,你的模型将更好地兼顾新字的识别能力与整体识别性能。 Response generated by 🤖 feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
我在训练PP-OCRv4模型时,我在字典文件里增加了一个橖字,准备了带橖字的图片1000张左右,主要就是橖字,用了PP-OCRv4的预训练模型,训练出的模型在识别其他文字时,都识别成了橖字.我该如何调整我的训练配置文件,请给出完整的配置文件内容
Beta Was this translation helpful? Give feedback.
All reactions