Additional vectors usage in training process to increase model accuracy #7776
-
Hi, I'm looking for potential ways to increase trained model accuracy and was thinking that the increasing amount of vectors might help. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
The way vectors are used in spaCy is that pretrained vectors are used as input when training a tok2vec model, which is then used as the input to other pipeline components like the tagger, parser, and NER. The pretrained vectors are also exposed in the final pipeline and can be used in similarity calculations, for example. For the pretrained models, the small models do not contain static vectors, but medium and large ones do. All models have a trained tok2vec layer. You can add existing vectors to a model, but you need to convert them first with spacy init vectors. You can read more about how to specify custom vectors for training models here, or how the static vectors are used here.
Generally I would suggest using the quickstart with the "accuracy" option to establish a baseline, and then clarify what you specifically want to improve the accuracy of. |
Beta Was this translation helpful? Give feedback.
The way vectors are used in spaCy is that pretrained vectors are used as input when training a tok2vec model, which is then used as the input to other pipeline components like the tagger, parser, and NER. The pretrained vectors are also exposed in the final pipeline and can be used in similarity calculations, for example.
For the pretrained models, the small models do not contain static vectors, but medium and large ones do. All models have a trained tok2vec layer.
You can add existing vectors to a model, but you need to convert them first with spacy init vectors. You can read more about how to specify custom vectors for training models here, or how the static vectors are used here.