NER Training data with no labels to reduce false positive #11131

gunalanlakshmanan · 2022-07-14T19:10:26Z

gunalanlakshmanan
Jul 14, 2022

I am trying to create an NER model to identify one entity. I am facing with the problem of lot of false positives. Because the training data only contains data where the entity is present. During real time, I will get the data where we should not predict any entity. But I am getting lot of false positives on that.

How can I reduce the false positive? Can I add data where I should not predict anything in the training data without any labels? Will that work? Is there a way to add negative samples?

Answered by polm

Jul 19, 2022

One of the golden rules of models is that your training data should be as much like your real data as possible. For NER you should definitely have sentences with no entities (assuming any of your input data will be like that, which is typical).

It is also possible to add negative examples, like @kinghuang mentioned, though sometimes it's hard to get the balance right.

View full answer

kinghuang · 2022-07-18T20:01:42Z

kinghuang
Jul 18, 2022

You can pass negatives via a SpanGroup. See the incorrect_spans_key config.

EntityRecognizer — Config and implementation

0 replies

polm · 2022-07-19T03:55:08Z

polm
Jul 19, 2022

One of the golden rules of models is that your training data should be as much like your real data as possible. For NER you should definitely have sentences with no entities (assuming any of your input data will be like that, which is typical).

It is also possible to add negative examples, like @kinghuang mentioned, though sometimes it's hard to get the balance right.

0 replies

gunalanlakshmanan · 2022-07-19T06:07:45Z

gunalanlakshmanan
Jul 19, 2022
Author

@kinghuang @polm Thanks for the answer.

I have one doubt regarding incorrect_spans_key.
In the documentation for incorrect_spans_key (This key refers to a SpanGroup in doc.spans that specifies incorrect spans. The NER will learn not to predict (exactly) those spans), it is mentioned as "exactly". Will the model learn to generalise on those incorrect spans or is it just the rule based matching to filter out incorrect predictions?

2 replies

polm Jul 20, 2022

I'm not super familiar with the negative annotations code, but looking at the implementation, the model doesn't learn things not to tag (there is no model of "negative entities"), rather internally it makes sure to avoid reproducing those annotations. The NER model uses the same architecture as the transition parser and negative samples affect the cost calculations so that other predictions get made instead (whatever they may be).

gunalanlakshmanan Aug 3, 2022
Author

Thanks @polm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

NER Training data with no labels to reduce false positive #11131

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 3 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

NER Training data with no labels to reduce false positive #11131

Uh oh!

Uh oh!

gunalanlakshmanan Jul 14, 2022

Replies: 3 comments · 2 replies

Uh oh!

kinghuang Jul 18, 2022

Uh oh!

polm Jul 19, 2022

Uh oh!

gunalanlakshmanan Jul 19, 2022 Author

Uh oh!

polm Jul 20, 2022

Uh oh!

gunalanlakshmanan Aug 3, 2022 Author

gunalanlakshmanan
Jul 14, 2022

Replies: 3 comments 2 replies

kinghuang
Jul 18, 2022

polm
Jul 19, 2022

gunalanlakshmanan
Jul 19, 2022
Author

gunalanlakshmanan Aug 3, 2022
Author