Skip to content
Discussion options

You must be logged in to vote

You should not be doing this - the vectors that this pipeline generates are trained to return representations that are a) highly conditioned on their context, and b) specific to some downstream task; it's not going to be very meaningful to run similarity tests on them.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@iwkkk
Comment options

@vinbo8
Comment options

Answer selected by vinbo8
@iwkkk
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / tok2vec Feature: Token-to-vector layer and pretraining
2 participants