Skip to content
Discussion options

You must be logged in to vote

While you can update an existing model to predict new entity types (this demo project does just that), it can be tricky to train it in such a way that the performance on existing entities doesn't suffer.

So, it's usually easier to just train the model from scratch. If you are in need of training data, you could use the English model to predict the existing labels for raw text. These predictions can then be combined with the training data for the new labels to train the new model.

The example pipelines are implemented as spaCy Projects - the documentation goes over how they work.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Comment options

You must be logged in to vote
1 reply
@svlandeg
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer
3 participants