"Burkina Faso" not detected #8140
-
Spacy does not detect I work for an NGO that does data analysis on violent incidents regarding the humanitarian and education sectors. We are using Spacy to automate our data entry pipeline. It works amazing most of the times, but several of our events are in Burkina Faso and Spacy does not seem to be able to detect this country in specific. I think all other countries are detected. How to reproduce the behaviour
Returns
Your Environment
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
This is deeply unfortunate but not a bug (error in code) or something we can fix directly. The models are statistical and make mistakes. Given the training data, it would be unsurprising if Burkina Faso wasn't mentioned that frequently, and it's also two separate unusual words, which is unusual for a country name, so it's kind of hard for the model to recognize. You can read more about errors in the models in #3052. The good news is it's detected in the medium and large models.
Because this is not a bug I'm going to move it to a discussion, please feel free to follow up there if you want. |
Beta Was this translation helpful? Give feedback.
This is deeply unfortunate but not a bug (error in code) or something we can fix directly.
The models are statistical and make mistakes. Given the training data, it would be unsurprising if Burkina Faso wasn't mentioned that frequently, and it's also two separate unusual words, which is unusual for a country name, so it's kind of hard for the model to recognize. You can read more about errors in the models in #3052.
The good news is it's detected in the medium and large models.
Because this is not a bug I'm going to move it to a dis…