Created ELU converter, compiled it to .so and run it in Python

bowang007 · bowang007 · commit 2ae7cd1a6ef5 · 2021-02-18T01:20:55.000-06:00
Signed-off-by: Bo Wang &lt;wangbo1995ee@163.com&gt;
diff --git a/examples/README.md b/examples/README.md
@@ -0,0 +1,159 @@
+# Create a new op in C++, compile it to .so library and load it in Python
+
+There are some operators in PyTorch library which are not supported in TRTorch. 
+To support these ops, users can register converters for missing ops. For example,
+if we try to compile a graph with a build of TRTorch that doesn't support the 
+[ELU](https://pytorch.org/docs/stable/generated/torch.nn.ELU.html) operation, 
+we will get following error:
+
+> Unable to convert node: %result.2 : Tensor = aten::elu(%x.1, %2, %3, %3) # /home/bowa/.local/lib/python3.6/site-packages/torch/nn/functional.py:1227:17 (conversion.AddLayer)
+Schema: aten::elu(Tensor self, Scalar alpha=1, Scalar scale=1, Scalar input_scale=1) -> (Tensor)
+Converter for aten::elu requested, but no such converter was found.
+If you need a converter for this operator, you can try implementing one yourself
+or request a converter: https://www.github.com/NVIDIA/TRTorch/issues
+
+## Writing Converter in C++
+We can register a converter for this operator in our application. You can find more 
+information on all the details of writing converters in the contributors documentation
+([Writing Converters](https://nvidia.github.io/TRTorch/contributors/writing_converters.html)).
+Once we are clear about these rules and writing patterns, we can create a seperate new C++ source file as:
+
+```c++
+#include "core/conversion/converters/converters.h"
+#include "core/util/prelude.h"
+
+namespace trtorch {
+namespace core {
+namespace conversion {
+namespace converters {
+namespace impl {
+namespace {
+
+auto acthardtanh TRTORCH_UNUSED = RegisterNodeConversionPatterns().pattern(
+    {"aten::elu(Tensor self, Scalar alpha=1, Scalar scale=1, Scalar input_scale=1) -> (Tensor)",
+     [](ConversionCtx* ctx, const torch::jit::Node* n, args& args) -> bool {
+       auto in = args[0].ITensorOrFreeze(ctx);
+       auto alpha = args[1].unwrapToDouble();
+
+       auto new_layer = ctx->net->addActivation(*in, nvinfer1::ActivationType::kELU);
+       TRTORCH_CHECK(new_layer, "Unable to create layer for aten::elu");
+
+       new_layer->setAlpha(alpha);
+       new_layer->setName(util::node_info(n).c_str());
+       auto out_tensor = ctx->AssociateValueAndTensor(n->outputs()[0], new_layer->getOutput(0));
+
+       LOG_DEBUG("Output shape: " << out_tensor->getDimensions());
+       return true;
+     }});
+
+} // namespace
+} // namespace impl
+} // namespace converters
+} // namespace conversion
+} // namespace core
+} // namespace trtorch
+```
+
+## Generate `.so` library
+To use this converter in Python, it is recommended to use PyTorch's
+[C++/CUDA Extension](https://pytorch.org/tutorials/advanced/cpp_extension.html#custom-c-and-cuda-extensions).
+We give an example here about how to wrap the converter into a `.so`
+library so that you can load it to use in Python applicaton. 
+```python
+import os
+from setuptools import setup, Extension
+from torch.utils import cpp_extension
+
+dir_path = os.path.dirname(os.path.realpath(__file__))
+
+ext_modules = [
+    cpp_extension.CUDAExtension('elu_converter', ['elu_converter.cpp'],
+                                library_dirs=[(
+                                        dir_path + "/../../bazel-bin/cpp/api/lib/"
+                                )],
+                                libraries=["trtorch"],
+                                include_dirs=[dir_path + "/../../"]
+                                )
+]
+
+setup(
+    name='elu_converter',
+    ext_modules=ext_modules,
+    cmdclass={'build_ext': cpp_extension.BuildExtension},
+)
+```
+Make sure to include the path for header files in `include_dirs` and the path 
+for dependent libraries in `library_dirs`. You could also add other compilation 
+flags in cpp_extension if you need. Then, run above python scripts as:
+```shell
+python3 setup.py install --user
+```
+You should see the output similar to the contents indicated [here](https://pytorch.org/tutorials/advanced/cpp_extension.html#custom-c-and-cuda-extensions)   after running
+`python setup.py install`. You should find a couple of new folders generated
+by the command above. In build folder, you can find the generated `.so` library, 
+which could be loaded in our Python application. 
+
+## Load `.so` in Python Application
+With the new generated library, TRTorch now support the new developed converter. 
+We use `torch.ops.load_library` to load `.so`. For example, we could load the ELU 
+converter and use it in our application:
+```python
+import torch
+import trtorch
+
+torch.ops.load_library('./build/lib.linux-x86_64-3.6/elu_converter.cpython-36m-x86_64-linux-gnu.so')
+
+class Elu(torch.nn.Module):
+    def __init__(self):
+        super(Elu, self).__init__()
+        self.elu = torch.nn.ELU()
+
+    def forward(self, x):
+        return self.elu(x)
+
+def main():
+    data = torch.randn((1, 1, 2, 2)).to("cuda")
+    model = Elu().eval()  #.cuda()
+
+    scripted_model = torch.jit.script(model)
+    print(scripted_model.graph)
+    compile_settings = {
+        "input_shapes": [{
+            "min": [1024, 1, 32, 32],
+            "opt": [1024, 1, 33, 33],
+            "max": [1024, 1, 34, 34],
+        }],
+        "op_precision":
+            torch.half  # Run with FP16
+    }
+    trt_ts_module = trtorch.compile(scripted_model, compile_settings)
+    input_data = torch.randn((1024, 1, 32, 32))
+    print(input_data[0, :, :, 0])
+    input_data = input_data.half().to("cuda")
+    result = trt_ts_module(input_data)
+    print(result[0, :, :, 0])
+
+if __name__ == "__main__":
+    main()
+
+```
+Run this script, we can get the Tensor before and after ELU operator.
+### Example Output
+```bash
+graph(%self : __torch__.Elu,
+      %x.1 : Tensor):
+  %2 : __torch__.torch.nn.modules.activation.ELU = prim::GetAttr[name="elu"](%self)
+  %4 : Tensor = prim::CallMethod[name="forward"](%2, %x.1) # elu_converter_test.py:13:15
+  return (%4)
+
+tensor([[ 1.3482,  1.9848, -1.0818, -1.3252,  0.2470,  0.7011,  0.3174, -1.8349,
+          0.3024, -0.0453, -0.0681, -1.7377,  1.5909,  0.2549, -0.3029,  0.2583,
+          0.0242,  2.0748, -0.5454,  0.7137,  1.6688,  0.7108, -0.8681,  0.2486,
+         -1.3981,  1.0241,  1.2413,  0.2725,  1.4265,  0.9329,  0.4020, -2.6813]])
+tensor([[ 1.3486,  1.9844, -0.6611, -0.7344,  0.2471,  0.7012,  0.3174, -0.8403,
+          0.3025, -0.0443, -0.0659, -0.8242,  1.5908,  0.2549, -0.2615,  0.2583,
+          0.0242,  2.0742, -0.4204,  0.7139,  1.6689,  0.7109, -0.5801,  0.2485,
+         -0.7529,  1.0244,  1.2412,  0.2725,  1.4268,  0.9331,  0.4021, -0.9316]],
+       device='cuda:0', dtype=torch.float16)
+
+```
diff --git a/examples/elu_converter/elu_converter.cpp b/examples/elu_converter/elu_converter.cpp
@@ -0,0 +1,33 @@
+#include "core/conversion/converters/converters.h"
+#include "core/util/prelude.h"
+
+namespace trtorch {
+namespace core {
+namespace conversion {
+namespace converters {
+namespace impl {
+namespace {
+
+auto acthardtanh TRTORCH_UNUSED = RegisterNodeConversionPatterns().pattern(
+    {"aten::elu(Tensor self, Scalar alpha=1, Scalar scale=1, Scalar input_scale=1) -> (Tensor)",
+     [](ConversionCtx* ctx, const torch::jit::Node* n, args& args) -> bool {
+       auto in = args[0].ITensorOrFreeze(ctx);
+       auto alpha = args[1].unwrapToDouble();
+
+       auto new_layer = ctx->net->addActivation(*in, nvinfer1::ActivationType::kELU);
+       TRTORCH_CHECK(new_layer, "Unable to create layer for aten::elu");
+
+       new_layer->setAlpha(alpha);
+       new_layer->setName(util::node_info(n).c_str());
+       auto out_tensor = ctx->AssociateValueAndTensor(n->outputs()[0], new_layer->getOutput(0));
+
+       LOG_DEBUG("Output shape: " << out_tensor->getDimensions());
+       return true;
+     }});
+
+} // namespace
+} // namespace impl
+} // namespace converters
+} // namespace conversion
+} // namespace core
+} // namespace trtorch
diff --git a/examples/elu_converter/elu_converter_test.py b/examples/elu_converter/elu_converter_test.py
@@ -0,0 +1,44 @@
+import torch
+import trtorch
+
+torch.ops.load_library('./build/lib.linux-x86_64-3.6/elu_converter.cpython-36m-x86_64-linux-gnu.so')
+
+
+class Elu(torch.nn.Module):
+
+    def __init__(self):
+        super(Elu, self).__init__()
+        self.elu = torch.nn.ELU()
+
+    def forward(self, x):
+        return self.elu(x)
+
+
+def main():
+    data = torch.randn((1, 1, 2, 2)).to("cuda")
+    model = Elu().eval()  #.cuda()
+
+    # traced_model = torch.jit.trace(model, [data])
+    scripted_model = torch.jit.script(model)
+    print(scripted_model.graph)
+    # torch.jit.save(scripted_model, 'elu.jit')
+    compile_settings = {
+        "input_shapes": [{
+            "min": [1024, 1, 32, 32],
+            "opt": [1024, 1, 33, 33],
+            "max": [1024, 1, 34, 34],
+        }],
+        "op_precision":
+            torch.half  # Run with FP16
+    }
+    trt_ts_module = trtorch.compile(scripted_model, compile_settings)
+    input_data = torch.randn((1024, 1, 32, 32))
+    print(input_data[0, :, :, 0])
+    input_data = input_data.half().to("cuda")
+    result = trt_ts_module(input_data)
+    print(result[0, :, :, 0])
+    # torch.jit.save(trt_ts_module, "trt_ts_module.ts")
+
+
+if __name__ == "__main__":
+    main()
diff --git a/examples/elu_converter/setup.py b/examples/elu_converter/setup.py
@@ -0,0 +1,18 @@
+import os
+from setuptools import setup, Extension
+from torch.utils import cpp_extension
+
+dir_path = os.path.dirname(os.path.realpath(__file__))
+
+ext_modules = [
+    cpp_extension.CUDAExtension('elu_converter', ['elu_converter.cpp'],
+                                library_dirs=[(dir_path + "/../../bazel-bin/cpp/api/lib/")],
+                                libraries=["trtorch"],
+                                include_dirs=[dir_path + "/../../"])
+]
+
+setup(
+    name='elu_converter',
+    ext_modules=ext_modules,
+    cmdclass={'build_ext': cpp_extension.BuildExtension},
+)