Custom trained NER tags differently for the same entity #10101

shrinidhin · 2022-01-20T15:08:22Z

shrinidhin
Jan 20, 2022

Hello. So I have trained a custom NER using spacy on a small dataset to detect custom entities. I have 80 training data files and 20 test data files. It is a basic model without any hyper parameter tuning for the time being till I get the complete pipeline in place. While detecting the entity during test time, In one test example where the entity to be detected appears twice, it tags it correctly once and incorrectly the second time. The word is tagged as

INFOSYS ENTITY

which is the correct one and in another instance in the sentence when the word appears, it is tagged as

INFOSYS ORG

Will this be fixed with more training data?How much data would be sufficient inorder to obtain a decent model with good recognition?

Answered by thomashacker

Jan 20, 2022

Hello,
In terms of how much data you need to get good results, there's no discrete answer and relies heavily on the type of the data. However, there are guidelines that roughly set some thresholds, we've made a flow chart for prodigy annotations which rules you can also apply to any other ML tasks.

I think that 80 training examples are a bit low and increasing your dataset would much likely result in better prediction accuracy.

View full answer

thomashacker · 2022-01-20T22:28:58Z

thomashacker
Jan 20, 2022

Hello,
In terms of how much data you need to get good results, there's no discrete answer and relies heavily on the type of the data. However, there are guidelines that roughly set some thresholds, we've made a flow chart for prodigy annotations which rules you can also apply to any other ML tasks.

I think that 80 training examples are a bit low and increasing your dataset would much likely result in better prediction accuracy.

1 reply

shrinidhin Jan 21, 2022
Author

@thomashacker Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Custom trained NER tags differently for the same entity #10101

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Custom trained NER tags differently for the same entity #10101

Uh oh!

shrinidhin Jan 20, 2022

Replies: 1 comment · 1 reply

Uh oh!

thomashacker Jan 20, 2022

Uh oh!

shrinidhin Jan 21, 2022 Author

shrinidhin
Jan 20, 2022

Replies: 1 comment 1 reply

thomashacker
Jan 20, 2022

shrinidhin Jan 21, 2022
Author