Commit 179659c
authored
Create gloveExpectedTop10.csv
- This data is used to test the DML script results for GloVe word embedding.
- This file contains the top 10 most similar words for each word in the GloVe word embedding, based on (https://github.com/roamanalytics/mittens/tree/master).
- The test dataset is provided under test/resources in '20news/20news_subset_untokenized.csv'.1 parent 87fc8a1 commit 179659c
1 file changed
+478
-0
lines changed
0 commit comments