Skip to content
Discussion options

You must be logged in to vote

If you want to use a custom component in addition to the pretrained pipelines, I would recommend you train your component in isolation, and then source the components you want from the pretrained pipeline afterwards, whether in code or using spacy assemble.

The one downside of this approach is you'll need two copies of the Transformer (/tok2vec), which will take up more disk and memory. But the alternative is training your model with a frozen Transformer, which will limit the accuracy you can achieve. (Also freezing Transformers isn't straightforward at the moment - you can't use frozen_components, you have to set grad_factor = 0.)

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@hnalla
Comment options

@hnalla
Comment options

@polm
Comment options

Answer selected by hnalla
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / textcat Feature: Text Classifier feat / transformer Feature: Transformer
2 participants