SPACY3.1 Failing on simple sentences #8981
-
How to reproduce the behaviourYour Environment
I am trying NER tagging in latest spacy. Here are few examples:
Output
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Are you possibly using the v3.0.0 model rather than the v3.1.0 model (look at the version for The import spacy
from spacy import displacy
nlp = spacy.load("en_core_web_trf")
text = "my name is richard and i will be taking over from here My name is Richard and I will be taking over from here."
doc = nlp(text)
displacy.serve(doc, style='ent') Some users have also reported worse performance on identifying standalone first names in the |
Beta Was this translation helpful? Give feedback.
Are you possibly using the v3.0.0 model rather than the v3.1.0 model (look at the version for
en_core_web_trf
inpip freeze
and/or runspacy validate
)? If so, I think you've just lucked into an example that doesn't work well for this model, which is primarily trained on newspaper-style text with standard capitalization. See more details about the statistical models in #3052.The
en_core_web_trf
v3.1.0 model has some lowercase augmentation that should improve the performance on texts without newspaper-style capitalization. With spacy v3.1.1 anden_core_web_trf
v3.1.0, both versions of "Richard" are shown asPERSON
for me: