Training the model to NOT recognize certain NE #4865

Erin59 · 2020-01-02T18:02:10Z

Erin59
Jan 2, 2020

Hi! Please forgive me if this question was already raised somewhere, I tried searching first but couldn't formulate it properly for Google. Also, sorry if this is not formatted correctly.

My question is - I know that you can train the model to recognize new named entities, however, my problem is the opposite. Is there any way to tell the model to "forget" certain entities that are not recognized correctly and not to recognize them anymore? For example, I was going through some law voting transcripts, there are words "Aye" and "No", and "Aye" gets recognized as a person, even when it's lowcase. Is there any better solution for this apart from just building a stop-word list for such cases?

svlandeg · 2020-01-02T22:51:35Z

svlandeg
Jan 2, 2020

Hi @Erin59. In my opinion, building a stop-word list is really only feasible if you notice a small variety of words that got wrongly tagged, and only if you're sure that these could never really be entities. Like, there could be a company called "Aye" but maybe that's a sacrifice you're willing to make ;-)

However if you see a larger lexical variety, ánd if you feel like there's a sort of systematic error, then it might be worth retraining your NER model. Basically what you want to do is take the sentences where you have mistakes, correct the annotation, and feed the sentence back into the classifier. You won't specifically have an annotation for "NOT AN ENTITY", but the model will (should) learn not to predict entities for those tokens that are not annotated in the "gold" entities training data you provided. More specifically, those tokens would internally get the annotation O meaning Outside in the BILUO tagging scheme.

Basically what you are doing then is (re)training or updating the Named Entity Recognizer with custom examples to make it fit your dataset better. While you're doing this, make sure to heed this advice:

You should avoid iterating over the same few examples multiple times, or the model is likely to “forget” how to annotate other examples. If you iterate over the same few examples, you’re effectively changing the loss function. The optimizer will find a way to minimize the loss on your examples, without regard for the consequences on the examples it’s no longer paying attention to. One way to avoid this “catastrophic forgetting” problem is to “remind” the model of other examples by augmenting your annotations with sentences annotated with entities automatically recognized by the original model.

Basically, make sure that while retraining, the model does not forget to make those predictions it already had correct originally.

If you run into any technical difficulties implementing this retraining loop (if that's the route you chose to take) - feel free to open a new issue!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Training the model to NOT recognize certain NE #4865

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Uh oh!

Training the model to NOT recognize certain NE #4865

Uh oh!

Erin59 Jan 2, 2020

Replies: 1 comment

Uh oh!

Uh oh!

svlandeg Jan 2, 2020

Erin59
Jan 2, 2020

svlandeg
Jan 2, 2020