PaddlePaddle
diff --git a/‎doc/pre_train_model.md‎
Lines changed: 20 additions & 2 deletions b/‎doc/pre_train_model.md‎
Lines changed: 20 additions & 2 deletions
diff --git a/‎doc/yaml.md‎
Lines changed: 2 additions & 0 deletions b/‎doc/yaml.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎models/contentunderstanding/readme.md‎
Lines changed: 6 additions & 6 deletions b/‎models/contentunderstanding/readme.md‎
Lines changed: 6 additions & 6 deletions
diff --git a/‎models/contentunderstanding/classification/__init__.py‎ renamed to ‎models/contentunderstanding/textcnn/__init__.py‎ b/‎models/contentunderstanding/classification/__init__.py‎ renamed to ‎models/contentunderstanding/textcnn/__init__.py‎
diff --git a/‎models/contentunderstanding/classification/config.yaml‎ renamed to ‎models/contentunderstanding/textcnn/config.yaml‎
Lines changed: 1 addition & 1 deletion b/‎models/contentunderstanding/classification/config.yaml‎ renamed to ‎models/contentunderstanding/textcnn/config.yaml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎models/contentunderstanding/classification/data/preprocess.py‎ renamed to ‎models/contentunderstanding/textcnn/data/preprocess.py‎ b/‎models/contentunderstanding/classification/data/preprocess.py‎ renamed to ‎models/contentunderstanding/textcnn/data/preprocess.py‎
diff --git a/‎models/contentunderstanding/classification/data/test/test.txt‎ renamed to ‎models/contentunderstanding/textcnn/data/test/test.txt‎ b/‎models/contentunderstanding/classification/data/test/test.txt‎ renamed to ‎models/contentunderstanding/textcnn/data/test/test.txt‎
diff --git a/‎models/contentunderstanding/classification/data/train/train.txt‎ renamed to ‎models/contentunderstanding/textcnn/data/train/train.txt‎ b/‎models/contentunderstanding/classification/data/train/train.txt‎ renamed to ‎models/contentunderstanding/textcnn/data/train/train.txt‎
diff --git a/‎models/contentunderstanding/classification/model.py‎ renamed to ‎models/contentunderstanding/textcnn/model.py‎ b/‎models/contentunderstanding/classification/model.py‎ renamed to ‎models/contentunderstanding/textcnn/model.py‎
diff --git a/‎models/contentunderstanding/classification/reader.py‎ renamed to ‎models/contentunderstanding/textcnn/reader.py‎ b/‎models/contentunderstanding/classification/reader.py‎ renamed to ‎models/contentunderstanding/textcnn/reader.py‎
@@ -7,9 +7,27 @@ PaddleRec基于业务实践，使用真实数据，产出了推荐领域算法
 ### 获取地址
 
 ```bash
-wget xxx.tar.gz
+wget https://paddlerec.bj.bcebos.com/textcnn_pretrain%2Fpretrain_model.tar.gz
 ```
 
 ### 使用方法
 
-解压后，得到的是一个paddle的模型文件夹，使用`PaddleRec/models/contentunderstanding/classification_finetue`模型进行加载
+解压后，得到的是一个paddle的模型文件夹，使用`PaddleRec/models/contentunderstanding/textcnn`模型进行加载  
+您可以在PaddleRec/models/contentunderstanding/textcnn_pretrain中找到finetune_startup.py文件，在config.yaml中配置startup_class_path和init_pretraining_model_path两个参数。  
+在参数startup_class_path中配置finetune_startup.py文件的地址，在init_pretraining_model_path参数中配置您要加载的参数文件。  
+以textcnn_pretrain为例，配置完的runner如下：
+```
+runner:
+- name: train_runner
+  class: train
+  epochs: 6
+  device: cpu
+  save_checkpoint_interval: 1
+  save_checkpoint_path: "increment"
+  init_model_path: "" 
+  print_interval: 10
+  startup_class_path: "{workspace}/finetune_startup.py"
+  init_pretraining_model_path: "{workspace}/pretrain_model/pretrain_model_params"
+  phases: phase_train
+```
+具体使用方法请参照textcnn[使用预训练模型进行finetune](https://github.com/PaddlePaddle/PaddleRec/tree/master/models/contentunderstanding/textcnn_pretrain)
@@ -37,6 +37,8 @@
 |      startup_class_path       |    string    |                           路径                            |    否    |                     自定义startup流程实现的地址                      |
 |       runner_class_path       |    string    |                           路径                            |    否    |                      自定义runner流程实现的地址                      |
 |      terminal_class_path      |    string    |                           路径                            |    否    |                     自定义terminal流程实现的地址                     |
+|  init_pretraining_model_path  |    string    |                           路径                            |    否    |自定义的startup流程中需要传入这个参数，finetune中需要加载的参数的地址 |
+
 
 
 
 
@@ -1,7 +1,7 @@
 # 内容理解模型库
 
 ## 简介
-我们提供了常见的内容理解任务中使用的模型算法的PaddleRec实现, 单机训练&预测效果指标以及分布式训练&预测性能指标等。实现的内容理解模型包括 [Tagspace](tagspace)、[文本分类](classification)等。
+我们提供了常见的内容理解任务中使用的模型算法的PaddleRec实现, 单机训练&预测效果指标以及分布式训练&预测性能指标等。实现的内容理解模型包括 [Tagspace](tagspace)、[文本分类](textcnn)、[基于textcnn的预训练模型](textcnn_pretrain)等。
 
 模型算法库在持续添加中，欢迎关注。
 
@@ -23,7 +23,7 @@
 |       模型        |       简介        |       论文        |
 | :------------------: | :--------------------: | :---------: |
 | TagSpace | 标签推荐 | [EMNLP 2014][TagSpace: Semantic Embeddings from Hashtags](https://www.aclweb.org/anthology/D14-1194.pdf) |
-| Classification | 文本分类 | [EMNLP 2014][Convolutional neural networks for sentence classication](https://www.aclweb.org/anthology/D14-1181.pdf) |
+| textcnn | 文本分类 | [EMNLP 2014][Convolutional neural networks for sentence classication](https://www.aclweb.org/anthology/D14-1181.pdf) |
 
 下面是每个模型的简介（注：图片引用自链接中的论文）
 
@@ -32,7 +32,7 @@
 <img align="center" src="../../doc/imgs/tagspace.png">
 <p>
 
-[文本分类CNN模型](https://www.aclweb.org/anthology/D14-1181.pdf)
+[textCNN模型](https://www.aclweb.org/anthology/D14-1181.pdf)
 <p align="center">
 <img align="center" src="../../doc/imgs/cnn-ckim2014.png">
 <p>
@@ -42,7 +42,7 @@
 git clone https://github.com/PaddlePaddle/PaddleRec.git paddle-rec
 cd PaddleRec
 python -m paddlerec.run -m models/contentunderstanding/tagspace/config.yaml
-python -m paddlerec.run -m models/contentunderstanding/classification/config.yaml
+python -m paddlerec.run -m models/contentunderstanding/textcnn/config.yaml
 ```
 
 ## 使用教程（复现论文）
@@ -134,7 +134,7 @@ batch: 13, acc: [0.928], loss: [0.01736144]
 batch: 14, acc: [0.93], loss: [0.01911209]
 ```
 
-**（2）Classification**
+**（2）textcnn**
 
 ### 数据处理
 情感倾向分析（Sentiment Classification，简称Senta）针对带有主观描述的中文文本，可自动判断该文本的情感极性类别并给出相应的置信度。情感类型分为积极、消极。情感倾向分析能够帮助企业理解用户消费习惯、分析热点话题和危机舆情监控，为企业提供有利的决策支持。  
@@ -206,4 +206,4 @@ batch: 3, acc: [0.90234375], loss: [0.27907994]
 |       数据集        |       模型       |       loss         |       acc         |
 | :------------------: | :--------------------: | :---------: |:---------: | 
 |       ag news dataset        |       TagSpace       |       0.0198        |       0.9177          | 
-|       ChnSentiCorp        |       Classification       |       0.2282        |        0.9127         | 
+|       ChnSentiCorp        |       textcnn       |       0.2282        |        0.9127         | 
@@ -12,7 +12,7 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 
-workspace: "models/contentunderstanding/classification"
+workspace: "models/contentunderstanding/textcnn"
 
 dataset:
 - name: data1