Skip to content
Discussion options

You must be logged in to vote

Hello,
you can add your components (parser, tagger, ner) to the annotating_components variable in the config.cfg (https://spacy.io/api/data-formats#config) that will set their annotations during training. In order to have an effect on the performance you have to use a tok2vec instead of a transformer and add the new features to the attrs variable under [components.textcat.model.tok2vec.embed]
However, I'm unsure if this will have a big impact on your textcat.

Alternatively, I suggest that you have a look at your dataset and figure out if your "problematic label" is equally represented in the training/development set.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by polm
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / textcat Feature: Text Classifier feat / config Feature: Training config
2 participants