Skip to content
Discussion options

You must be logged in to vote

Hi @info2000 ,

What is the best way to vectorize new words?

To do this you need to train the vectors. However at this stage, a 3% out-of-vocab training data is still low. You might want to check first why there are many whitespace tokens in your dataset (note that the top 3 common "words" are all whitespaces). Perhaps clean the data a bit more and see how it works :)

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by info2000
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models feat / vectors Feature: Word vectors and similarity
2 participants