Repeating words distribution in train set #12071

pooja-kabra · 2024-05-08T00:37:14Z

pooja-kabra
May 8, 2024

I am finetuning the text recognition network on product labels. Some text fields occur repeatedly on the labels. For example, PRODUCT_ID: 343765. All labels will have the text 'PRODUCT_ID'. How should the word distribution be in the training set for these texts? For example in a 5k dataset, how many samples should these repeating words take up?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repeating words distribution in train set #12071

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Repeating words distribution in train set #12071

Uh oh!

pooja-kabra May 8, 2024

Replies: 0 comments

pooja-kabra
May 8, 2024