|
1 | | -# 词向量资源 |
| 1 | +# 四、词向量资源&代码&文献 |
2 | 2 |
|
| 3 | +## 4.1词嵌入模型资源 |
3 | 4 | 使用 cntext2.x 训练得到的相关词向量资源,汇总如下 |
4 | 5 |
|
5 | | - |
6 | 6 | | 数据集介绍 | 词向量 | 下载链接 | |
7 | 7 | | --- | --- | --- | |
8 | | -| 更新中 | 更新中| 更新中 | |
| 8 | +| [留言板](https://textdata.cn/blog/2023-12-22-renmin-gov-leader-comment-board/) | ***留言板-Word2Vec.200.15.bin***| https://pan.baidu.com/s/1n7vwCOBnrye1CYrt_IBqZA?pwd=9m42 | |
| 9 | +| [A股年报](https://textdata.cn/blog/2023-03-23-china-a-share-market-dataset-mda-from-01-to-21/) | ***mda01-23-GloVe.200.15.bin***| https://pan.baidu.com/s/1vXvbomHjOaFBeEz7GV0R6A?pwd=y6hd | |
| 10 | +| [A股年报](https://textdata.cn/blog/2023-03-23-china-a-share-market-dataset-mda-from-01-to-21/) | ***mda01-23-Word2Vec.200.15.bin***| https://pan.baidu.com/s/11V1RyqH_cKE9eju0Mm-1TQ?pwd=kcwx | |
| 11 | +|[港股年报](https://textdata.cn/blog/2024-01-21-hk-stock-market-anual-report/)| ***英文港股年报-Word2Vec.200.15.bin***| https://pan.baidu.com/s/1ISGAoZnA_1Ben6M2DCliOQ?pwd=nagx | |
| 12 | +|[港股年报](https://textdata.cn/blog/2024-01-21-hk-stock-market-anual-report/)| ***中文港股年报-Word2Vec.200.15.bin***| hhttps://pan.baidu.com/s/1smMcrPtIP8g635YABCodig?pwd=sjdj | |
| 13 | +| [黑猫消费者投诉](https://textdata.cn/blog/2025-03-05-consumer-complaint-dataset/) | ***消费者黑猫投诉-Word2Vec.200.15.bin***| https://pan.baidu.com/s/1FOI2BIVRojOswdKfqaNbsw?pwd=catc | |
| 14 | +| [豆瓣影评](2024-04-16-douban-movie-1000w-ratings-comments-dataset) | ***douban-movie-1000w-Word2Vec.200.15.bin***| https://pan.baidu.com/s/1uq6Ti7HbEWyT4CgktKrMng?pwd=63jg | |
| 15 | +| [B站](2023-11-12-using-100m-bilibili-user-sign-data-to-training-word2vec) | ***B站签名-Word2Vec.200.15.bin***| https://pan.baidu.com/s/1OtBU9BzitcNxkmPzhzH6FQ?pwd=m3iv | |
| 16 | +| [人民日报](https://textdata.cn/blog/2023-12-14-daily-news-dataset/)|[年份Word2Vec](https://textdata.cn/blog/2023-12-28-visualize-the-culture-change-using-people-daily-dataset/)|https://pan.baidu.com/s/1Ru_wxu9egsmhM7lATjSlgQ?pwd=bcea | |
| 17 | +| [人民日报](https://textdata.cn/blog/2023-12-14-daily-news-dataset/)|[对齐模型Aligned_Word2Vec](https://textdata.cn/blog/2023-12-28-visualize-the-culture-change-using-people-daily-dataset/)|https://pan.baidu.com/s/1IVgP0MyQpez0hpoJyEyFdA?pwd=7qsu| |
| 18 | +| [专利申请](https://textdata.cn/blog/2023-04-13-3571w-patent-dataset-in-china-mainland/) | ***专利摘要-Word2Vec.200.15.bin***| https://pan.baidu.com/s/1FHI_J7wU9eQGRckD12QB5g?pwd=6rr2 | |
| 19 | +| [专利申请](https://textdata.cn/blog/2023-11-20-word2vec-by-year-by-province/) | ***province_w2vs分省份训练词向量***| https://pan.baidu.com/s/1eBFTIZcv2DWssLiaRnCqZQ?pwd=ikpu | |
| 20 | +| [专利申请](https://textdata.cn/blog/2023-11-20-word2vec-by-year-by-province/) | ***year_w2vs分年份训练词向量***| https://pan.baidu.com/s/1lrVkML92cVJdHQa1HQyAwA?pwd=4gqa | |
| 21 | + |
| 22 | +<br><br> |
| 23 | + |
| 24 | + |
| 25 | + |
| 26 | + |
| 27 | +## 4.2 相关代码 |
| 28 | +- [实验 | 使用 Stanford Glove 代码训练中文语料的 GloVe 模型](https://textdata.cn/blog/2025-03-28-train_a_glove_model_on_chinese_corpus_using_stanfordnlp/) |
| 29 | +- [词向量 | 使用**人民网领导留言板**语料训练Word2Vec模型](https://textdata.cn/blog/2023-12-28-train-word2vec-using-renmin-gov-leader-board-dataset/) |
| 30 | +- [可视化 | 人民日报语料反映七十年文化演变](https://textdata.cn/blog/2023-12-28-visualize-the-culture-change-using-people-daily-dataset/) |
| 31 | +- [使用 5000w 专利申请数据集按年份(按省份)训练词向量](https://textdata.cn/blog/2023-11-20-word2vec-by-year-by-province/) |
| 32 | + |
| 33 | +<br><br> |
| 34 | + |
| 35 | +## 4.3 相关文献 |
| 36 | +- [大数据时代下社会科学研究方法的拓展——基于词嵌入技术的文本分析的应用](https://textdata.cn/blog/2022-04-07-word-embeddings-in-social-science/) |
| 37 | +- [OS2022 | 概念空间 | 词嵌入模型如何为组织科学中的测量和理论提供信息](https://textdata.cn/blog/2023-11-03-organization-science-with-word-embeddings/) |
| 38 | +- [词嵌入技术在社会科学领域进行数据挖掘常见39个FAQ汇总](https://textdata.cn/blog/2023-03-15-39faq-about-word-embeddings-for-social-science/) |
| 39 | + |
| 40 | + |
| 41 | + |
0 commit comments