How spaCy perform NER? #11278

teohsinyee · 2022-08-08T12:57:07Z

teohsinyee
Aug 8, 2022

If all data were lowercase, will it affect NER?
Is casing important to NER?

I did simple experiment using displaCy Named Entity Visualizer

Cased sentence: when Jason started working on self-driving cars at Google in 2007.
Uncased sentence: when jason started working on self-driving cars at google in 2007.

Displacy able to identify Organization & Person from Cased sentence. But NO entities were identified from Uncased sentence.

I'm wondering why & how spaCy model identify 'Google' but can't identify 'google'?

Answered by thomashacker

Aug 12, 2022

Yes, the NER architecture uses features likes NORM and SHAPE which are both case-sensitive. The pre-trained models are not 100% accurate, and mistakes like these can happen. We have a master thread for these cases.

View full answer

thomashacker · 2022-08-12T08:55:19Z

thomashacker
Aug 12, 2022

Yes, the NER architecture uses features likes NORM and SHAPE which are both case-sensitive. The pre-trained models are not 100% accurate, and mistakes like these can happen. We have a master thread for these cases.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How spaCy perform NER? #11278

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How spaCy perform NER? #11278

Uh oh!

teohsinyee Aug 8, 2022

Replies: 1 comment

Uh oh!

thomashacker Aug 12, 2022

teohsinyee
Aug 8, 2022

thomashacker
Aug 12, 2022