|
| 1 | +# PaddlePaddle Custom Device Implementaion for Custom CPU |
| 2 | + |
| 3 | +English | [简体中文](./README_cn.md) |
| 4 | + |
| 5 | +Please refer to the following steps to compile, install and verify the custom device implementaion for Habana HPU. |
| 6 | + |
| 7 | +## Get Sources |
| 8 | + |
| 9 | +```bash |
| 10 | +# clone source |
| 11 | +git clone --recursive https://github.com/PaddlePaddle/PaddleCustomDevice |
| 12 | +cd PaddleCustomDevice |
| 13 | + |
| 14 | +# get the latest submodule source code |
| 15 | +git submodule sync |
| 16 | +git submodule update --remote --init --recursive |
| 17 | +``` |
| 18 | + |
| 19 | +## Compile and Install |
| 20 | + |
| 21 | +```bash |
| 22 | +# navigate to implementaion for Habana HPU |
| 23 | +cd backends/intel_hpu |
| 24 | + |
| 25 | +# before compiling, ensure that Paddle is installed, you can run the following command |
| 26 | +pip install paddlepaddle==0.0.0 -f https://www.paddlepaddle.org.cn/whl/linux/cpu-mkl/develop.html |
| 27 | + |
| 28 | +# create the build directory and navigate in |
| 29 | +mkdir build && cd build |
| 30 | + |
| 31 | +cmake .. |
| 32 | +make -j8 |
| 33 | + |
| 34 | +# using pip to install the output |
| 35 | +pip install dist/paddle_intel_hpu*.whl |
| 36 | +``` |
| 37 | + |
| 38 | +## Verification |
| 39 | + |
| 40 | +```bash |
| 41 | +# list available hardware backends |
| 42 | +python -c "import paddle; print(paddle.device.get_all_custom_device_type())" |
| 43 | + |
| 44 | +# expected output |
| 45 | +['intel_hpu'] |
| 46 | + |
| 47 | +# run a simple model |
| 48 | +python ../tests/test_MNIST_model.py |
| 49 | + |
| 50 | +# expected similar output |
| 51 | +... ... |
| 52 | +Epoch 0 step 0, Loss = [2.2956038], Accuracy = 0.15625 |
| 53 | +Epoch 0 step 100, Loss = [2.1552896], Accuracy = 0.3125 |
| 54 | +Epoch 0 step 200, Loss = [2.1177733], Accuracy = 0.4375 |
| 55 | +Epoch 0 step 300, Loss = [2.0089214], Accuracy = 0.53125 |
| 56 | +Epoch 0 step 400, Loss = [2.0845466], Accuracy = 0.421875 |
| 57 | +Epoch 0 step 500, Loss = [2.0473], Accuracy = 0.453125 |
| 58 | +Epoch 0 step 600, Loss = [1.8561764], Accuracy = 0.71875 |
| 59 | +Epoch 0 step 700, Loss = [1.9915285], Accuracy = 0.53125 |
| 60 | +Epoch 0 step 800, Loss = [1.8925955], Accuracy = 0.640625 |
| 61 | +Epoch 0 step 900, Loss = [1.8199624], Accuracy = 0.734375 |
| 62 | +``` |
| 63 | + |
| 64 | +## Using PaddleInference |
| 65 | + |
| 66 | +Re-compile plugin |
| 67 | + |
| 68 | +```bash |
| 69 | +# Compile PaddleInference |
| 70 | +git clone https://github.com/PaddlePaddle/Paddle.git |
| 71 | +git clone https://github.com/ronny1996/Paddle-Inference-Demo.git |
| 72 | + |
| 73 | +mkdir -p Paddle/build |
| 74 | +pushd Paddle/build |
| 75 | + |
| 76 | +cmake .. -DPY_VERSION=3.7 -DWITH_GPU=OFF -DWITH_TESTING=ON -DCMAKE_BUILD_TYPE=Release -DON_INFER=ON -DWITH_MKL=ON -DWITH_CUSTOM_DEVICE=ON |
| 77 | + |
| 78 | +make -j8 |
| 79 | + |
| 80 | +popd |
| 81 | +cp -R Paddle/build/paddle_inference_install_dir Paddle-Inference-Demo/c++/lib/paddle_inference |
| 82 | +export PADDLE_INFERENCE_LIB_DIR=$(realpath Paddle-Inference-Demo/c++/lib/paddle_inference/paddle/lib) |
| 83 | + |
| 84 | +# Compile the plug-in |
| 85 | +mkdir -p PaddleCustomDevice/backends/intel_hpu/build |
| 86 | +pushd PaddleCustomDevice/backends/intel_hpu/build |
| 87 | + |
| 88 | +cmake .. -DON_INFER=ON -DPADDLE_INFERENCE_LIB_DIR=${PADDLE_INFERENCE_LIB_DIR} |
| 89 | +make -j8 |
| 90 | + |
| 91 | +# Specify the plug-in directory |
| 92 | +export CUSTOM_DEVICE_ROOT=$PWD |
| 93 | +popd |
| 94 | +``` |
| 95 | + |
| 96 | +Using PaddleInference |
| 97 | + |
| 98 | +```bash |
| 99 | +pushd Paddle-Inference-Demo/c++/resnet50 |
| 100 | + |
| 101 | +# Modify resnet50_test.cc, use config.EnableCustomDevice("intel_hpu", 0) to replace config.EnableUseGpu(100, 0) |
| 102 | + |
| 103 | +bash run.sh |
| 104 | +``` |
| 105 | + |
| 106 | +Expected similar output |
| 107 | + |
| 108 | +```bash |
| 109 | +I0713 09:02:38.808723 24792 resnet50_test.cc:74] run avg time is 297.75 ms |
| 110 | +I0713 09:02:38.808859 24792 resnet50_test.cc:89] 0 : 8.76192e-29 |
| 111 | +I0713 09:02:38.808894 24792 resnet50_test.cc:89] 100 : 8.76192e-29 |
| 112 | +I0713 09:02:38.808904 24792 resnet50_test.cc:89] 200 : 8.76192e-29 |
| 113 | +I0713 09:02:38.808912 24792 resnet50_test.cc:89] 300 : 8.76192e-29 |
| 114 | +I0713 09:02:38.808920 24792 resnet50_test.cc:89] 400 : 8.76192e-29 |
| 115 | +I0713 09:02:38.808928 24792 resnet50_test.cc:89] 500 : 8.76192e-29 |
| 116 | +I0713 09:02:38.808936 24792 resnet50_test.cc:89] 600 : 1.05766e-19 |
| 117 | +I0713 09:02:38.808945 24792 resnet50_test.cc:89] 700 : 2.04093e-23 |
| 118 | +I0713 09:02:38.808954 24792 resnet50_test.cc:89] 800 : 3.85255e-25 |
| 119 | +I0713 09:02:38.808961 24792 resnet50_test.cc:89] 900 : 8.76192e-29 |
| 120 | +``` |
0 commit comments