Merge pull request #410 from yinhaofeng/inference_python

seemingwang · web-flow · commit a16b934f49bd · 2021-04-15T13:39:12.000+08:00
add inference python
diff --git a/doc/inference.md b/doc/inference.md
@@ -0,0 +1,54 @@
+# Paddle Inference的使用方法
+paddlerec目前提供在静态图训练时使用save_inference_model接口保存模型，以及将保存的模型使用Inference预测库进行服务端部署的功能。本教程将以wide_deep模型为例，说明如何使用这两项功能。  
+
+## 使用save_inference_model接口保存模型
+在服务器端使用python部署需要先使用save_inference_model接口保存模型。  
+1. 首先需要在模型的yaml配置中，加入use_inference参数，并把值设置成True。use_inference决定是否使用save_inference_model接口保存模型，默认为否。若使用save_inference_model接口保存模型，保存下来的模型支持使用Paddle Inference的方法预测，但不支持直接使用paddlerec原生的的预测方法加载模型。  
+2. 确定需要的输入和输出的预测模型变量，将其变量名以字符串的形式填入save_inference_feed_varnames和save_inference_fetch_varnames列表中。  
+以wide_deep模型为例，可以在其config.yaml文件中观察到如下结构。训练及测试数据集选用[Display Advertising Challenge](https://www.kaggle.com/c/criteo-display-ad-challenge/)所用的Criteo数据集。该数据集包括两部分：训练集和测试集。训练集包含一段时间内Criteo的部分流量，测试集则对应训练数据后一天的广告点击流量。feed参数的名字中```<label>```表示广告是否被点击，点击用1表示，未点击用0表示，```<integer feature>```代表数值特征（连续特征dense_input），共有13个连续特征，```<categorical feature>```代表分类特征（离散特征C1~C26），共有26个离散特征。fetch参数输出的是auc，具体意义为static_model.py里def net（）函数中将auc使用cast转换为float32类型语句中的cast算子。  
+```yaml
+runner:
+  # 通用配置不再赘述
+  ...
+  # use inference save model
+  use_inference: True  # 静态图训练时保存为inference model
+  save_inference_feed_varnames: ["label","C1","C2","C3","C4","C5","C6","C7","C8","C9","C10","C11","C12","C13","C14","C15","C16","C17","C18","C19","C20","C21","C22","C23","C24","C25","C26","dense_input"] # inference model 的feed参数的名字
+  save_inference_fetch_varnames: ["cast_0.tmp_0"] # inference model 的fetch参数的名字
+```
+3. 启动静态图训练
+```bash
+# 进入模型目录
+# cd models/rank/wide_deep # 在任意目录均可运行
+# 静态图训练
+python -u ../../../tools/static_trainer.py -m config.yaml # 全量数据运行config_bigdata.yaml 
+```
+
+## 将保存的模型使用Inference预测库进行服务端部署
+paddlerec提供tools/paddle_infer.py脚本，供您方便的使用inference预测库高效的对模型进行预测。  
+
+需要安装的库：
+```bash
+pip install pynvml
+pip install psutil
+pip install GPUtil
+```
+
+1. 启动paddle_infer.py脚本的参数：
+
+|        名称         |    类型    |             取值             | 是否必须 |                               作用描述                               |
+| :-----------------: | :-------: | :--------------------------: | :-----: | :------------------------------------------------------------------: |
+|       --model_file        |    string    |       任意路径         |    是    |                            模型文件路径（当需要从磁盘加载 Combined 模型时使用）                           |
+|       --params_file        |    string    |       任意路径         |    是    |                            参数文件路径 （当需要从磁盘加载 Combined 模型时使用）                           |
+|       --model_dir        |    string    |       任意路径         |    是    |                            模型文件夹路径 （当需要从磁盘加载非 Combined 模型时使用）                           |
+|       --use_gpu        |    bool    |       True/False         |    是    |                            是否使用gpu                            |
+|       --data_dir        |    string    |       任意路径         |    是    |                            测试数据目录                            |
+|       --reader_file        |    string    |       任意路径         |    是    |                          测试时用的Reader()所在python文件地址                            |
+|       --batchsize        |    int    |       >= 1         |    是    |                            批训练样本数量                            |
+|       --model_name        |    str    |       任意名字         |    否    |                            输出模型名字                            |
+
+2. 以wide_deep模型的demo数据为例，启动预测：
+```bash
+# 进入模型目录
+# cd models/rank/wide_deep # 在任意目录均可运行
+python -u ../../../tools/paddle_infer.py --model_file=output_model_wide_deep/2/rec_inference.pdmodel --params_file=output_model_wide_deep/2/rec_inference.pdiparams --use_gpu=False --data_dir=data/sample_data/train --reader_file=criteo_reader.py --batchsize=5
+```
diff --git a/doc/yaml.md b/doc/yaml.md
@@ -21,6 +21,9 @@
 |             print_interval            |    int    |                           >= 1                           |    是    |                       训练指标打印batch间隔                        |
 |             use_auc            |    bool    |                           True/False                           |    否    |                       在每个epoch开始时重置auc指标的值                        |
 |             use_visual            |    bool    |                           True/False                           |    否    |                       开启模型训练的可视化功能，开启时需要安装visualDL                        |
+|             use_inference            |    bool    |                           True/False                           |    否    |                     是否使用save_inference_model接口保存                      |
+|             save_inference_feed_varnames         |    list[string]    |                      组网中指定Variable的name                      |    否    |                     预测模型的入口变量name                     |
+|             save_inference_fetch_varnames         |    list[string]    |                      组网中指定Variable的name                      |    否    |                     预测模型的出口变量name                     |
 
 
 ## hyper_parameters变量
diff --git a/models/demo/movie_recommand/get_movie_vectors.py b/models/demo/movie_recommand/get_movie_vectors.py
@@ -0,0 +1,112 @@
+# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+"""
+feed_var {
+  name: "movieid"
+  alias_name: "movieid"
+  is_lod_tensor: true
+  feed_type: 0
+  shape: -1
+}
+feed_var {
+  name: "title"
+  alias_name: "title"
+  is_lod_tensor: true
+  feed_type: 0
+  shape: -1
+}
+feed_var {
+  name: "genres"
+  alias_name: "genres"
+  is_lod_tensor: true
+  feed_type: 0
+  shape: -1
+}
+fetch_var {
+  name: "save_infer_model/scale_0.tmp_0"
+  alias_name: "save_infer_model/scale_0.tmp_0"
+  is_lod_tensor: false
+  fetch_type: 1
+  shape: 32
+}
+"""
+
+from paddle_serving_app.local_predict import LocalPredictor
+import redis
+import numpy as np
+import codecs
+
+
+class Movie(object):
+    def __init__(self):
+        self.movie_id, self.title, self.genres = "", "", ""
+        pass
+
+
+def hash2(a):
+    return hash(a) % 600000
+
+
+ctr_client = LocalPredictor()
+ctr_client.load_model_config("serving_server")
+with codecs.open("movies.dat", "r", encoding='utf-8', errors='ignore') as f:
+    lines = f.readlines()
+
+ff = open("movie_vectors.txt", 'w')
+
+for line in lines:
+    if len(line.strip()) == 0:
+        continue
+    tmp = line.strip().split("::")
+    movie_id = tmp[0]
+    title = tmp[1]
+    genre_group = tmp[2]
+
+    tmp = genre_group.strip().split("|")
+    genre = tmp
+    movie = Movie()
+    item_infos = []
+    if isinstance(genre, list):
+        movie.genres = genre
+    else:
+        movie.genres = [genre]
+    movie.movie_id, movie.title = movie_id, title
+    item_infos.append(movie)
+
+    dic = {"movieid": [], "title": [], "genres": []}
+    batch_size = len(item_infos)
+    for i, item_info in enumerate(item_infos):
+        dic["movieid"].append(hash2(item_info.movie_id))
+        dic["title"].append(hash2(item_info.title))
+        dic["genres"].extend([hash2(x) for x in item_info.genres])
+
+    if len(dic["title"]) <= 4:
+        for i in range(4 - len(dic["title"])):
+            dic["title"].append("0")
+    dic["title"] = dic["title"][:4]
+    if len(dic["genres"]) <= 3:
+        for i in range(3 - len(dic["genres"])):
+            dic["genres"].append("0")
+    dic["genres"] = dic["genres"][:3]
+
+    dic["movieid"] = np.array(dic["movieid"]).astype(np.int64).reshape(-1, 1)
+    dic["title"] = np.array(dic["title"]).astype(np.int64).reshape(-1, 4)
+    dic["genres"] = np.array(dic["genres"]).astype(np.int64).reshape(-1, 3)
+
+    fetch_map = ctr_client.predict(
+        feed=dic, fetch=["save_infer_model/scale_0.tmp_0"], batch=True)
+    ff.write("{}:{}\n".format(movie_id,
+                              str(fetch_map["save_infer_model/scale_0.tmp_0"]
+                                  .tolist()[0])))
+ff.close()
diff --git a/models/demo/movie_recommand/recall/movie.yaml b/models/demo/movie_recommand/recall/movie.yaml
@@ -0,0 +1,49 @@
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+runner:
+  train_data_dir: "../data/train"
+  train_reader_path: "reader"  # importlib format
+  train_batch_size: 1
+  model_save_path: "movie_model"
+
+  use_gpu: True
+  epochs: 5
+  print_interval: 20
+  
+  test_data_dir: "../data/test"
+  infer_reader_path: "reader"  # importlib format
+  infer_batch_size: 1
+  infer_load_path: "movie_model"
+  infer_start_epoch: 4
+  infer_end_epoch: 5
+
+  runner_result_dump_path: "recall_infer_result"
+
+  #use inference save model
+  use_inference: True
+  save_inference_feed_varnames: ["movieid", "title", "genres"]
+  save_inference_fetch_varnames: ["linear_15.tmp_1"]
+
+# hyper parameters of user-defined network
+hyper_parameters:
+  # optimizer config
+  optimizer:
+    class: Adam
+    learning_rate: 0.001
+  # user-defined <key, value> pairs
+  sparse_feature_number: 600000
+  sparse_feature_dim: 9
+  dense_input_dim: 13
+  fc_sizes: [512, 256, 128, 32]
diff --git a/models/demo/movie_recommand/recall/user.yaml b/models/demo/movie_recommand/recall/user.yaml
@@ -0,0 +1,49 @@
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+runner:
+  train_data_dir: "../data/train"
+  train_reader_path: "reader"  # importlib format
+  train_batch_size: 1
+  model_save_path: "user_model"
+
+  use_gpu: True
+  epochs: 5
+  print_interval: 20
+  
+  test_data_dir: "../data/test"
+  infer_reader_path: "reader"  # importlib format
+  infer_batch_size: 1
+  infer_load_path: "user_model"
+  infer_start_epoch: 4
+  infer_end_epoch: 5
+
+  runner_result_dump_path: "recall_infer_result"
+
+  #use inference save model
+  use_inference: True
+  save_inference_feed_varnames: ["userid", "gender", "age", "occupation"]
+  save_inference_fetch_varnames: ["linear_11.tmp_1"]
+
+# hyper parameters of user-defined network
+hyper_parameters:
+  # optimizer config
+  optimizer:
+    class: Adam
+    learning_rate: 0.001
+  # user-defined <key, value> pairs
+  sparse_feature_number: 600000
+  sparse_feature_dim: 9
+  dense_input_dim: 13
+  fc_sizes: [512, 256, 128, 32]
diff --git a/models/rank/wide_deep/README.md b/models/rank/wide_deep/README.md
@@ -88,7 +88,7 @@ wide&deep设计了一种融合浅层（wide）模型和深层（deep）模型进
 
 | 模型 | auc | batch_size | thread_num| epoch_num| Time of each epoch |
 | :------| :------ | :------| :------ | :------| :------ | 
-| wide_deep | 0.82 | 512 | 1 | 4 | 约2小时 |
+| wide_deep | 0.79 | 512 | 1 | 4 | 约2小时 |
 
 1. 确认您当前所在目录为PaddleRec/models/rank/wide_deep
 2. 进入paddlerec/datasets/criteo目录下，执行该脚本，会从国内源的服务器上下载我们预处理完成的criteo全量数据集，并解压到指定文件夹。
diff --git a/models/rank/wide_deep/config.yaml b/models/rank/wide_deep/config.yaml
@@ -28,8 +28,12 @@ runner:
   infer_reader_path: "criteo_reader" # importlib format
   infer_batch_size: 5
   infer_load_path: "output_model_wide_deep"
-  infer_start_epoch: 0
+  infer_start_epoch: 2
   infer_end_epoch: 3
+  #use inference save model
+  use_inference: False
+  save_inference_feed_varnames: ["label","C1","C2","C3","C4","C5","C6","C7","C8","C9","C10","C11","C12","C13","C14","C15","C16","C17","C18","C19","C20","C21","C22","C23","C24","C25","C26","dense_input"]
+  save_inference_fetch_varnames: ["cast_0.tmp_0"]
 
 # hyper parameters of user-defined network
 hyper_parameters:
diff --git a/models/rank/wide_deep/static_model.py b/models/rank/wide_deep/static_model.py
@@ -84,6 +84,7 @@ def net(self, input, is_infer=False):
                                               label=self.label_input,
                                               num_thresholds=2**12,
                                               slide_steps=20)
+        auc = paddle.cast(auc, "float32")
         self.inference_target_var = auc
         if is_infer:
             fetch_dict = {'auc': auc}
diff --git a/tools/paddle_infer.py b/tools/paddle_infer.py
diff --git a/tools/static_trainer.py b/tools/static_trainer.py
diff --git a/tools/utils/save_load.py b/tools/utils/save_load.py