RFE: vectors in different languages should be aligned #5888
Replies: 3 comments
-
Agree that this would be nice but it would mean training the various models jointly, and it may impact the accuracy of specific models (especially models for smaller languages). It's also a big task to set up evaluation of the alignment. If someone wants to look at this as a research project and they can get convincing results, we'd consider adopting it. But I think it seems like quite an ambitious project that won't yield results that are that compelling in practice. Instead if your specific project requires this, you'll want to train your own vector models. |
Beta Was this translation helpful? Give feedback.
-
Right, I'll look for alignment algorithms or for how to produce the transformation matrices to align two models. |
Beta Was this translation helpful? Give feedback.
-
I have found the MUSE project from Facbook Research https://github.com/facebookresearch/MUSE |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Feature description
When working on different languages it would be very useful to compare word vectors to compare their "meaning".
Yet different models seems to use vector spaces that are not aligned, thus vectors for words with the same meaning are normally not similar.
It would be nice if:
Beta Was this translation helpful? Give feedback.
All reactions