PaddlePaddle
diff --git a/‎examples/few_shot/README.md‎
Lines changed: 3 additions & 2 deletions b/‎examples/few_shot/README.md‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎examples/few_shot/pet/README.md‎
Lines changed: 83 additions & 0 deletions b/‎examples/few_shot/pet/README.md‎
Lines changed: 83 additions & 0 deletions
@@ -12,12 +12,13 @@ Few-Shot Learning 旨在研究如何从少量有监督的训练样本中学习
 | ------------ | ------------ | ------------ | ------------ | ------------ | ------------ | ------------ | ------------ | ------------ |------------ | ------------ | ---------- |
 | P-tuning  | ERNIE1.0  | 55.70 | 83.28  | 63.43  | 35.36  | 60.54  | 50.02  | 54.51  | 50.14 | 54.93 | 41.16 |
 | EFL       | ERNIE1.0  | 54.47 | 84.10  | 60.10  | 35.12  | 56.61  | 56.57  | 53.59  | 46.37 | 61.21 | 36.56 |
-
+| PET       | ERNIE1.0  | 56.38 | 86.88  | 61.90  | 36.90  | 61.10  | 56.51  | 55.02  | 50.31 | 59.72 | 39.11 |
 ## 策略库
 - [P-tuning](./p-tuning)
 - [EFL](./efl)
-- PET(Todo)
+- [PET](./pet)
 
 ## References
 [1]X. Liu et al., “GPT Understands, Too,” arXiv:2103.10385 [cs], Mar. 2021, Accessed: Mar. 22, 2021. [Online]. Available: http://arxiv.org/abs/2103.10385
 [2] Wang, Sinong, Han Fang, Madian Khabsa, Hanzi Mao, and Hao Ma. “Entailment as Few-Shot Learner.” ArXiv:2104.14690 [Cs], April 29, 2021. http://arxiv.org/abs/2104.14690.
+[3] Wang, S., Fang, H., Khabsa, M., Mao, H., and Ma, H., “Entailment as Few-Shot Learner”, ArXiv:2001.07676 [Cs], 2021. https://arxiv.org/abs/2001.07676
@@ -0,0 +1,83 @@
+# [PET](https://arxiv.org/abs/2001.07676)
+
+[PET](https://arxiv.org/abs/2001.07676) (Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference)  提出将输入示例转换为完形填空式短语，以帮助语言模型理解给定的任务
+
+## 代码结构及说明
+```
+|—— pet.py # PET 策略的训练、评估主脚本
+|—— dataset.py # PET 策略针对 FewCLUE 9 个数据集的任务转换逻辑，以及明文 -> 训练数据的转换
+|—— model.py # PET 的网络结构
+|—— evaluate.py # 针对 FewCLUE 9 个数据集的评估函数
+|—— predict.py # 针对 FewCLUE 9 个数据集进行预测
+```
+
+
+## 基于 FewCLUE 进行 PET 实验
+PaddleNLP 内置了 FewCLUE 数据集，可以直接用来进行 PET 策略训练、评估、预测，并生成 FewCLUE 榜单的提交结果，参与 FewCLUE 竞赛。
+
+###  数据准备
+基于 FewCLUE 数据集进行实验只需要  1 行代码，这部分代码在 `pet.py` 脚本中
+
+```
+from paddlenlp.datasets import load_dataset
+
+# 通过指定 "fewclue" 和数据集名字 name="tnews" 即可一键加载 FewCLUE 中的 tnews 数据集
+train_ds, dev_ds, public_test_ds = load_dataset("fewclue", name="tnews", splits=("train_0", "dev_0", "test_public"))
+````
+### 模型训练&评估
+通过如下命令，指定 GPU 0 卡,  在 FewCLUE 的 `tnews` 数据集上进行训练&评估
+```
+#task_name="iflytek"
+task_name="tnews"
+#task_name="eprstmt"
+#task_name="bustm"
+#task_name="ocnli"
+#task_name="csl"
+#task_name="csldcp"
+#task_name="cluewsc"
+#task_name="chid"
+python -u -m paddle.distributed.launch --gpus "0" \
+    pet.py \
+	--task_name ${task_name} \
+	--device gpu \
+    --pattern_id 0 \
+	--save_dir ./${task_name} \
+	--index 0 \
+	--batch_size 16 \
+	--learning_rate 1E-4 \
+	--epochs 10 \
+	--max_seq_length 512 \
+	--language_model "ernie-1.0" \
+```
+参数含义说明
+- `task_name`: FewCLUE 中的数据集名字
+- `device`: 使用 cpu/gpu 进行训练
+- `pattern_id` 完形填空的模式
+- `save_dir`: 模型存储路径
+- `max_seq_length`: 文本的最大截断长度
+
+模型每训练 1 个 epoch,  会在验证集上进行评估
+
+### 模型预测
+通过如下命令，指定 GPU 0 卡， 在 `FewCLUE` 的 `iflytek` 数据集上进行预测
+```
+#task_name="iflytek"
+task_name="tnews"
+#task_name="eprstmt"
+#task_name="bustm"
+#task_name="ocnli"
+#task_name="csl"
+#task_name="csldcp"
+#task_name="cluewsc"
+#task_name="chid"
+python -u -m paddle.distributed.launch --gpus "0" predict.py \
+        --task_name ${task_name} \
+        --device gpu \
+        --init_from_ckpt "./${task_name}/model_120/model_state.pdparams" \
+        --output_dir "./${task_name}/output" \
+        --batch_size 32 \
+        --max_seq_length 512
+```
+
+## References
+[1] Wang, S., Fang, H., Khabsa, M., Mao, H., and Ma, H., “Entailment as Few-Shot Learner”, ArXiv:2001.07676 [Cs], 2021. https://arxiv.org/abs/2001.07676