Replies: 1 comment 1 reply
-
你的训练集准确率已经达到 0.89,但如果希望进一步提升,可以尝试以下方法: 1. 数据增强(Data Augmentation)你的训练配置中已经包含了一些数据增强(如
2. 训练超参数调整
3. 模型结构优化
4. 训练数据优化
5. 计算资源优化
6. 调整
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
训练参数
Global:
debug: false
use_gpu: true
epoch_num: 50
log_smooth_window: 20
print_batch_step: 20
save_model_dir: ./output/rec_ppocr_v4_hgnet
save_epoch_step: 3
eval_batch_step: [0, 500]
cal_metric_during_train: true
pretrained_model: https://paddleocr.bj.bcebos.com/pretrained/ch_PP-OCRv4_rec_server_trained.pdparams
checkpoints:
save_inference_dir:
use_visualdl: true
infer_img: doc/imgs_words/ch/word_1.jpg
character_dict_path: ppocr/utils/ppocr_keys_v1.txt
max_text_length: &max_text_length 25
infer_mode: false
use_space_char: true
distributed: true
save_res_path: ./output/rec/predicts_ppocrv3.txt
Optimizer:
name: AdamW
beta1: 0.9
beta2: 0.999
lr:
name: Cosine
learning_rate: 0.0005
warmup_epoch: 5
warmup_start_lr: 1e-6
regularizer:
name: L2
factor: 1e-05
Architecture:
model_type: rec
algorithm: SVTR_HGNet
Transform:
Backbone:
name: PPHGNet_small
Head:
name: MultiHead
head_list:
- CTCHead:
Neck:
name: svtr
dims: 128
depth: 3
hidden_dims: 128
kernel_size: [3, 3]
use_guide: True
Head:
fc_decay: 0.00001
- NRTRHead:
nrtr_dim: 256
max_text_length: *max_text_length
Loss:
name: MultiLoss
loss_config_list:
- CTCLoss:
weight: 0.7
- NRTRLoss:
weight: 0.3
PostProcess:
name: CTCLabelDecode
Metric:
name: RecMetric
main_indicator: acc
Train:
dataset:
name: MSTextRecDataset
ds_width: false
data_dir: ./train_data/
ext_op_transform_idx: 1
label_file_list:
- ./train_data/train_list.txt
transforms:
- DecodeImage:
img_mode: BGR
channel_first: false
- RecConAug:
prob: 0.5
ext_data_num: 2
image_shape: [48, 320, 3]
max_text_length: *max_text_length
- RecAug:
- MultiLabelEncode:
gtc_encode: NRTRLabelEncode
- KeepKeys:
keep_keys:
- image
- label_ctc
- label_gtc
- length
- valid_ratio
sampler:
name: MultiScaleSampler
scales: [[320, 32], [320, 48], [320, 64]]
first_bs: 16
fix_bs: false
divided_factor: [16, 16] # w, h
is_training: True
loader:
shuffle: true
batch_size_per_card: 24
drop_last: true
num_workers: 8
Eval:


dataset:
name: TextRecDataset
data_dir: ./train_data
label_file_list:
- ./train_data/val_list.txt
transforms:
- DecodeImage:
img_mode: BGR
channel_first: false
- MultiLabelEncode:
gtc_encode: NRTRLabelEncode
- RecResizeImg:
image_shape: [3, 48, 320]
- KeepKeys:
keep_keys:
- image
- label_ctc
- label_gtc
- length
- valid_ratio
loader:
shuffle: false
drop_last: false
batch_size_per_card: 12
8
num_workers: 4
训练样本示例
Beta Was this translation helpful? Give feedback.
All reactions