Skip to content
Discussion options

You must be logged in to vote

This is deeply unfortunate but not a bug (error in code) or something we can fix directly.

The models are statistical and make mistakes. Given the training data, it would be unsurprising if Burkina Faso wasn't mentioned that frequently, and it's also two separate unusual words, which is unusual for a country name, so it's kind of hard for the model to recognize. You can read more about errors in the models in #3052.

The good news is it's detected in the medium and large models.

import spacy
nlp = spacy.load("en_core_web_md")
text = "I went to Burkina Faso for vacation."
for ent in nlp(text).ents:
    print(ent.label_, ent, sep="\t")

Because this is not a bug I'm going to move it to a dis…

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@polm
Comment options

@joaobarcia
Comment options

@joaobarcia
Comment options

@polm
Comment options

Answer selected by adrianeboyd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer
2 participants
Converted from issue

This discussion was converted from issue #8133 on May 19, 2021 04:18.