Skip to content
Discussion options

You must be logged in to vote

The explanations get a bit fuzzy here because we can define what the thing is "conceptually", but, also, pipelines are allowed to write data to these attributes, and they might choose to use them with different semantics from how we really expect.

We use the doc.tensor attribute to store the contextual token-to-vector encodings computed by the Tok2Vec component. These encodings might be used as features by other components, if they have a Tok2VecListener layer inside their model. The doc.tensor values may or may not be useful to you outside of those modelling decisions, these are learned parameters and all bets are off, really.

The token.vector attribute is usually drawn out of the static…

Replies: 1 comment 9 replies

Comment options

You must be logged in to vote
9 replies
@koaning
Comment options

@thiippal
Comment options

@stas-sl
Comment options

@stas-sl
Comment options

@polm
Comment options

Answer selected by koaning
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage General spaCy usage feat / vectors Feature: Word vectors and similarity v3.0 Related to v3.0
5 participants