@@ -9,8 +9,8 @@ PaddleNLP为用户提供了常用的 ``BERT``、``ERNIE``、``ALBERT``、``RoBER
9
9
Transformer预训练模型汇总
10
10
------------------------------------
11
11
12
- 下表汇总了介绍了目前PaddleNLP支持的各类预训练模型以及对应预训练权重。我们目前提供了 **68 ** 种预训练的参数权重供用户使用,
13
- 其中包含了 **33 ** 种中文语言模型的预训练权重。
12
+ 下表汇总了介绍了目前PaddleNLP支持的各类预训练模型以及对应预训练权重。我们目前提供了 **70 ** 种预训练的参数权重供用户使用,
13
+ 其中包含了 **34 ** 种中文语言模型的预训练权重。
14
14
15
15
+--------------------+-------------------------------------+--------------+-----------------------------------------+
16
16
| Model | Pretrained Weight | Language | Details of the model |
@@ -171,6 +171,14 @@ Transformer预训练模型汇总
171
171
| | | | 16-heads, 336M parameters. |
172
172
| | | | Trained on lower-cased English text. |
173
173
+--------------------+-------------------------------------+--------------+-----------------------------------------+
174
+ | ERNIE-DOC _ |``ernie-doc-base-zh`` | Chinese | 12-layer, 768-hidden, |
175
+ | | | | 12-heads, 108M parameters. |
176
+ | | | | Trained on Chinese text. |
177
+ | +-------------------------------------+--------------+-----------------------------------------+
178
+ | |``ernie-doc-base-en`` | English | 12-layer, 768-hidden, |
179
+ | | | | 12-heads, 103M parameters. |
180
+ | | | | Trained on lower-cased English text. |
181
+ +--------------------+-------------------------------------+--------------+-----------------------------------------+
174
182
| ERNIE-GEN _ |``ernie-gen-base-en`` | English | 12-layer, 768-hidden, |
175
183
| | | | 12-heads, 108M parameters. |
176
184
| | | | Trained on lower-cased English text. |
@@ -332,6 +340,8 @@ Transformer预训练模型适用任务汇总
332
340
+--------------------+-------------------------+----------------------+--------------------+-----------------+
333
341
| ERNIE_ | ✅ | ✅ | ✅ | ❌ |
334
342
+--------------------+-------------------------+----------------------+--------------------+-----------------+
343
+ | ERNIE-DOC _ | ✅ | ✅ | ✅ | ❌ |
344
+ +--------------------+-------------------------+----------------------+--------------------+-----------------+
335
345
| ERNIE-GEN _ | ❌ | ❌ | ❌ | ✅ |
336
346
+--------------------+-------------------------+----------------------+--------------------+-----------------+
337
347
| ERNIE-GRAM _ | ✅ | ✅ | ✅ | ❌ |
@@ -357,6 +367,7 @@ Transformer预训练模型适用任务汇总
357
367
.. _DistilBert : https://arxiv.org/abs/1910.01108
358
368
.. _ELECTRA : https://arxiv.org/abs/2003.10555
359
369
.. _ERNIE : https://arxiv.org/abs/1904.09223
370
+ .. _ERNIE-DOC : https://arxiv.org/abs/2012.15688
360
371
.. _ERNIE-GEN : https://arxiv.org/abs/2001.11314
361
372
.. _ERNIE-GRAM : https://arxiv.org/abs/2010.12148
362
373
.. _GPT : https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
0 commit comments