Skip to content
Discussion options

You must be logged in to vote

Multi-task learning is the default in spaCy pipelines, see the docs on shared embeddings.

It sounds like your pipeline would be a tok2vec (or transformer) with two textcat components, but I need to look at how that works - the issue is that textcat components use the .cats attribute, which is unique, so there's no place to separate your annotations.

What you can do out of the box is train a multilabel textcat with non-exclusive categories. If you label your data consistently the classifier shouldn't violate your constraints even if it's technically possible; you can try that to get started with the spaCy training process at least.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models feat / textcat Feature: Text Classifier
2 participants