Skip to content
Discussion options

You must be logged in to vote

One of the golden rules of models is that your training data should be as much like your real data as possible. For NER you should definitely have sentences with no entities (assuming any of your input data will be like that, which is typical).

It is also possible to add negative examples, like @kinghuang mentioned, though sometimes it's hard to get the balance right.

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by gunalanlakshmanan
Comment options

You must be logged in to vote
2 replies
@polm
Comment options

@gunalanlakshmanan
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer perf / accuracy Performance: accuracy
3 participants