Spacy Custom Name Entity Recognition (NER) 'catastrophic forgetting' issue #6846
-
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 5 replies
-
Can you explain in more detail which model you're starting from, which entities it already is able to predict, which type of (new and old) entities you're feeding into the model for retraining, and which problems in accuracy you're seeing? The code above is not really a minimal reproducible snippet. The way you've pasted it, it looks like
Note that in the second definition of To avoid the catastrophic forgetting problem, you need to create realistic training examples for all the entity types that you want the ML algorithm to learn / not forget, and feed those in when you're retraining the model. Can you verify whether that is indeed what you're doing? |
Beta Was this translation helpful? Give feedback.
-
Also, just FYI, if you're mainly interested in recognizing a list of common synonyms, you might also consider a more rule-based approach, e.g. https://spacy.io/usage/rule-based-matching#phrasematcher. This might be particularly useful for words that are not ambiguous / not dependent on context. |
Beta Was this translation helpful? Give feedback.
-
@svlandeg I have initially trained a model on recognizing links ( Reddit, stack-overflow and Twitter links) and after that i want to add another entity in the above case petroleum products. and as far as the HORSE labeling is concerned it was the default train data given by Spacy. |
Beta Was this translation helpful? Give feedback.
-
FYI - I've transferred this issue to the discussion forum, which is better suited for usage questions and community discussions! |
Beta Was this translation helpful? Give feedback.
-
Hi!I have a similar use case I am trying to implement. I am trying to build an entity recognizer with a vast set of entities of a specific type. It is quite possible I might not have training data to cover all entity labels. Is is possible to include it in a vocab of some sort in SPACY's NLP NER? |
Beta Was this translation helpful? Give feedback.
Can you explain in more detail which model you're starting from, which entities it already is able to predict, which type of (new and old) entities you're feeding into the model for retraining, and which problems in accuracy you're seeing?
The code above is not really a minimal reproducible snippet. The way you've pasted it, it looks like
TRAIN_DATA
is defined several times:Note that in the second definition of
TRA…