Skip to content
Discussion options

You must be logged in to vote

The way vectors are used in spaCy is that pretrained vectors are used as input when training a tok2vec model, which is then used as the input to other pipeline components like the tagger, parser, and NER. The pretrained vectors are also exposed in the final pipeline and can be used in similarity calculations, for example.

For the pretrained models, the small models do not contain static vectors, but medium and large ones do. All models have a trained tok2vec layer.

You can add existing vectors to a model, but you need to convert them first with spacy init vectors. You can read more about how to specify custom vectors for training models here, or how the static vectors are used here.

And …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by polm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / vectors Feature: Word vectors and similarity
2 participants