How spaCy perform NER? #11278
-
If all data were lowercase, will it affect NER? I did simple experiment using displaCy Named Entity Visualizer Cased sentence: when Jason started working on self-driving cars at Google in 2007. Displacy able to identify Organization & Person from Cased sentence. But NO entities were identified from Uncased sentence. I'm wondering why & how spaCy model identify 'Google' but can't identify 'google'? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Yes, the NER architecture uses features likes |
Beta Was this translation helpful? Give feedback.
Yes, the NER architecture uses features likes
NORM
andSHAPE
which are both case-sensitive. The pre-trained models are not 100% accurate, and mistakes like these can happen. We have a master thread for these cases.