Skip to content
Discussion options

You must be logged in to vote

If you wanted to go the NER route, you could do it as you propose and use spaCy to label as 'ents' the examples you want it to capture and not label the examples you don't want it to capture (like your example sentence), and that becomes your training data and hopefully the model can learn to tell the difference. I think what you have to watch out for are, first, that you have enough examples. You'd be training the entity recognizer from scratch so you would need quite a large training set. Second, if the the pitfall sentences like the example you gave are rare, the model may have trouble learning to recognize them. You would know better obviously, but if the sentence structure in the exa…

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
2 replies
@jamiehannaford
Comment options

@svlandeg
Comment options

Answer selected by svlandeg
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage General spaCy usage
4 participants