Megvii-BaseDetection
diff --git a/‎README.md‎
Lines changed: 89 additions & 0 deletions b/‎README.md‎
Lines changed: 89 additions & 0 deletions
diff --git a/‎demo/introduce.png‎
640 KB b/‎demo/introduce.png‎
640 KB
diff --git a/‎furnace/__init__.py‎ b/‎furnace/__init__.py‎
diff --git a/‎furnace/base_model/__init__.py‎
Lines changed: 2 additions & 0 deletions b/‎furnace/base_model/__init__.py‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎furnace/base_model/resnet.py‎
Lines changed: 279 additions & 0 deletions b/‎furnace/base_model/resnet.py‎
Lines changed: 279 additions & 0 deletions
@@ -0,0 +1,89 @@
+# TreeFilter-Torch
+This project provides a cuda implementation for "[Learnable Tree Filter for Structure-preserving
+Feature Transform](https://megvii-my.sharepoint.cn/:b:/g/personal/songlin_megvii_com/EfbrITIdvqBCu-SaW9gZOHQBFIkcIisB6-FyO9SzzrZyPQ?e=YI06YP)" on PyTorch. Multiple semantic segmentation experiments are reproduced to verify the effectiveness of tree filtering module on PASCAL VOC2012 and Cityscapes. For the reason that the experiments in the paper were conducted using internal framework, this project reimplements them on PyTorch and reports detailed comparisons below. In addition, many thanks to [TorchSeg](https://github.com/ycszen/TorchSeg).
+
+![introduce image](demo/introduce.png)
+
+## Prerequisites
+- PyTorch 1.2
+  - `sudo pip3 install torch torchvision`
+- Easydict
+  - `sudo pip3 install easydict`
+- Apex
+  - `https://nvidia.github.io/apex/index.html`
+- Ninja
+  - `sudo apt-get install ninja-build`
+- tqdm
+  - `sudo pip3 install tqdm`
+- Boost (optional for Prim and  Kruskal algorithm)
+  -  `sudo apt-get install libboost-dev`
+
+## Installation
+### Building from source
+- `git clone https://github.com/StevenGrove/TreeFilter-Seg`
+- `cd TreeFilter-Seg/furnace/kernels/lib_tree_filter`
+- `sudo python3 setup.py build develop`
+
+This project implements three well-known algorithms of minimal spanning tree, i.e., Boruvka, Kruskal and  Prim. The default algorithm is set to *Boruvka* for its linear computational complexity in the plain graph. The user can change the configuration in the source file "lib_tree_filter/src/mst/mst.cu" .
+
+## Pretrained Model
+- ResNet-50 [GoogleDrive](https://drive.google.com/open?id=1tRO4SUL0rdjXbKcyp1CQ6SefkL9QtX1b)
+- ResNet-101 [GoogleDrive](https://drive.google.com/open?id=11t0f0FcLOPj7KvHYdIAGANNbbWU_fJ1d)
+
+## Performance and Benchmarks
+### Notes
+FCN-32d: FCN with decoder whose maximum stride is 32;  
+Extra: Global average pooling + ResBlock;  
+TF: Learnable tree filtering module;  
+SS: Single-scale;  
+MSF: Multi-scale + Flip.
+
+### PASCAL VOC 2012 *val* set
+ Methods | Backbone | mIoU (ss) | Acc (ss) | mIoU (msf) | Acc (msf) | Model 
+:--:|:--:|:--:|:--:|:--:|:--:|:--:
+ FCN-32d | R50_v1c | 71.82% | 93.62% | 73.96% | 94.14% |  [GoogleDrive](https://drive.google.com/open?id=1Wzdhfa1mh_JFcqvLKPs7dXWgkOCqTnoH)
+ FCN-32d+TF  | R50_v1c | 76.31%  | 94.57%  | 77.80%  | 94.96%  |  [GoogleDrive](https://drive.google.com/open?id=19wwP7KW8aCWjyd21zGLhrMz2g3lW9o9Z)
+ FCN-32d  | R101_v1c | 74.53% |  94.29% | 76.08% | 94.63% |  [GoogleDrive](https://drive.google.com/open?id=19HQYK5JMS2bw2CbkTmG0VypnYfT3p-NN)
+ FCN-32d+TF  | R101_v1c |  77.82% |  94.92% | 79.22%  | 95.22%  |  [GoogleDrive](https://drive.google.com/open?id=1HywWQn-sHR9iddHTiLyYHNsH3TClvQMo)
+ FCN-32d+Extra | R101_v1c | 78.04%  | 95.01%  | 79.69%  | 95.41%  |  [GoogleDrive](https://drive.google.com/open?id=1dzag3GVcY9k-6ExOb1B4zQqfmtt4PbBy)
+ FCN-32d+Extra+TF | R101_v1c | 79.81% |  95.38% | 80.97%  | 95.67%  |  [GoogleDrive](https://drive.google.com/open?id=1sfZyuL2pikmhWLRw9-XbrJpZJEbjawh6)
+  FCN-32d+Extra+TF<sup>*</sup> | R101_v1c | 80.32% | 95.66%  | 82.28%  | 96.01%  |  [GoogleDrive](https://drive.google.com/open?id=19FpTs6NtfJLsLwN_03U4A2zTpPSIAnfS)
+  
+ <sup>*</sup> further finetuned on the original train set
+
+### Cityscapes *val* set
+ Methods | Backbone | mIoU (ss) | Acc (ss) | mIoU (msf) | Acc (msf) | Model 
+:--:|:--:|:--:|:--:|:--:|:--:|:--:
+ FCN-32d+Extra | R101_v1c | 78.29%  | 96.09%  | 79.40% | 96.27%  |  [GoogleDrive](https://drive.google.com/open?id=1MT4-ZzuCTNgfpRHGG6fT4TkUVtFuuJO5)
+ FCN-32d+Extra+TF | R101_v1c | 79.58%  | 96.31%  | 80.85%  | 96.46%  |  [GoogleDrive](https://drive.google.com/open?id=1yXPEUrIZ1CfFk7-1YHgMlDNz81Fhp1kz)
+
+## Usage
+As in the original TorchSeg, distributed training is recommended for either single machine or multiple machines.  
+For detailed usage, please refer to the [Training](https://github.com/ycszen/TorchSeg#Training) and [Inference](https://github.com/ycszen/TorchSeg#Inference) sections in TorchSeg.
+
+## To do
+- [ ] Experiments on ADE20K
+- [ ] Visualization of tree filter
+- [ ] Additional tasks
+  - [ ] Object detection
+  - [ ] Instance segmentation
+  - [ ] Optical flow
+
+## Citation
+
+Please cite the learnable tree filter in your publications if it helps your research. 
+
+```
+The pre-printed version has been submitted to Arxiv and is awaiting public.
+```
+
+Please cite this project in your publications if it helps your research. 
+```
+@misc{treefilter-torch,
+  author =       {Song, Lin},
+  title =        {TreeFiler-Torch},
+  howpublished = {\url{https://github.com/StevenGrove/TreeFilter-Torch}},
+  year =         {2019}
+}
+```
+
@@ -0,0 +1,2 @@
+from .resnet import ResNet, resnet18, resnet34, resnet50, resnet101, resnet152
+from .xception import Xception, xception39
@@ -0,0 +1,279 @@
+import functools
+import torch.nn as nn
+import torch.nn.functional as F
+
+from utils.pyt_utils import load_model
+
+
+__all__ = ['ResNet', 'resnet18', 'resnet34', 'resnet50', 'resnet101',
+           'resnet152']
+
+
+def conv3x3(in_planes, out_planes, stride=1):
+    """3x3 convolution with padding"""
+    return nn.Conv2d(in_planes, out_planes, kernel_size=3, stride=stride,
+                     padding=1, bias=False)
+
+
+class BasicBlock(nn.Module):
+    expansion = 1
+
+    def __init__(self, inplanes, planes, stride=1, norm_layer=None,
+                 bn_eps=1e-5, bn_momentum=0.1, downsample=None, inplace=True,
+                 has_relu=True):
+        super(BasicBlock, self).__init__()
+        self.conv1 = conv3x3(inplanes, planes, stride)
+        self.bn1 = norm_layer(planes, eps=bn_eps, momentum=bn_momentum)
+        self.relu = nn.ReLU(inplace=inplace)
+        self.relu_inplace = nn.ReLU(inplace=True)
+        self.conv2 = conv3x3(planes, planes)
+        self.bn2 = norm_layer(planes, eps=bn_eps, momentum=bn_momentum)
+        self.downsample = downsample
+        if downsample is None and inplace != planes:
+            self.downsample = nn.Sequential(
+                nn.Conv2d(inplanes, planes,
+                          kernel_size=1, stride=stride, bias=False),
+                norm_layer(planes, eps=bn_eps,momentum=bn_momentum))
+        self.stride = stride
+        self.inplace = inplace
+        self.has_relu = has_relu
+
+    def forward(self, x):
+        residual = x
+
+        out = self.conv1(x)
+        out = self.bn1(out)
+        out = self.relu(out)
+
+        out = self.conv2(out)
+        out = self.bn2(out)
+
+        if self.downsample is not None:
+            residual = self.downsample(x)
+
+        if self.inplace:
+            out += residual
+        else:
+            out = out + residual
+
+        if self.has_relu:
+            out = self.relu_inplace(out)
+
+        return out
+
+class ResBlock(nn.Module):
+    def __init__(self, in_channels, out_channels, stride=1,
+                 expansion=2, norm_layer=None, bn_eps=1e-5,
+                 bn_momentum=0.1, has_relu=True, has_bias=False):
+        super(ResBlock, self).__init__()
+        self.has_relu = has_relu
+        mid_channels = out_channels // expansion
+        self.conv1 = nn.Conv2d(in_channels, mid_channels, kernel_size=1, bias=has_bias)
+        self.bn1 = norm_layer(mid_channels, eps=bn_eps, momentum=bn_momentum)
+        self.conv2 = nn.Conv2d(mid_channels, mid_channels, kernel_size=3, stride=stride,
+                               padding=1, bias=has_bias)
+        self.bn2 = norm_layer(mid_channels, eps=bn_eps, momentum=bn_momentum)
+        self.conv3 = nn.Conv2d(mid_channels, out_channels, kernel_size=1, bias=has_bias)
+        self.bn3 = norm_layer(out_channels, eps=bn_eps, momentum=bn_momentum)
+        if in_channels != out_channels:
+            self.down_sampler = nn.Sequential(
+                nn.Conv2d(in_channels, out_channels, kernel_size=1, stride=stride, bias=False),
+                norm_layer(out_channels, eps=bn_eps,momentum=bn_momentum))
+        else:
+            self.down_sampler = None
+
+    def forward(self, x):
+        residual = x
+
+        out = self.conv1(x)
+        out = self.bn1(out)
+        out = F.relu(out)
+
+        out = self.conv2(out)
+        out = self.bn2(out)
+        out = F.relu(out)
+
+        out = self.conv3(out)
+        out = self.bn3(out)
+
+        if self.down_sampler is not None:
+            residual = self.down_sampler(x)
+
+        out += residual
+        if self.has_relu:
+            out = F.relu(out, inplace=True)
+
+        return out
+
+class Bottleneck(nn.Module):
+    expansion = 4
+
+    def __init__(self, inplanes, planes, stride=1,
+                 norm_layer=None, bn_eps=1e-5, bn_momentum=0.1,
+                 downsample=None, inplace=True, has_relu=True):
+        super(Bottleneck, self).__init__()
+        self.conv1 = nn.Conv2d(inplanes, planes, kernel_size=1, bias=False)
+        self.bn1 = norm_layer(planes, eps=bn_eps, momentum=bn_momentum)
+        self.conv2 = nn.Conv2d(planes, planes, kernel_size=3, stride=stride,
+                               padding=1, bias=False)
+        self.bn2 = norm_layer(planes, eps=bn_eps, momentum=bn_momentum)
+        self.conv3 = nn.Conv2d(planes, planes * self.expansion, kernel_size=1,
+                               bias=False)
+        self.bn3 = norm_layer(planes * self.expansion, eps=bn_eps,
+                              momentum=bn_momentum)
+        self.has_relu = has_relu
+        self.relu = nn.ReLU(inplace=inplace)
+        self.relu_inplace = nn.ReLU(inplace=True)
+        self.downsample = downsample
+        self.stride = stride
+        self.inplace = inplace
+
+    def forward(self, x):
+        residual = x
+
+        out = self.conv1(x)
+        out = self.bn1(out)
+        out = self.relu(out)
+
+        out = self.conv2(out)
+        out = self.bn2(out)
+        out = self.relu(out)
+
+        out = self.conv3(out)
+        out = self.bn3(out)
+
+        if self.downsample is not None:
+            residual = self.downsample(x)
+
+        if self.inplace:
+            out += residual
+        else:
+            out = out + residual
+        if self.has_relu:
+            out = self.relu_inplace(out)
+
+        return out
+
+
+class ResNet(nn.Module):
+
+    def __init__(self, block, layers, norm_layer=nn.BatchNorm2d, bn_eps=1e-5,
+                 bn_momentum=0.1, deep_stem=False, stem_width=32, inplace=True):
+        self.inplanes = stem_width * 2 if deep_stem else 64
+        super(ResNet, self).__init__()
+        if deep_stem:
+            self.conv1 = nn.Sequential(
+                nn.Conv2d(3, stem_width, kernel_size=3, stride=2, padding=1,
+                          bias=False),
+                norm_layer(stem_width, eps=bn_eps, momentum=bn_momentum),
+                nn.ReLU(inplace=inplace),
+                nn.Conv2d(stem_width, stem_width, kernel_size=3, stride=1,
+                          padding=1,
+                          bias=False),
+                norm_layer(stem_width, eps=bn_eps, momentum=bn_momentum),
+                nn.ReLU(inplace=inplace),
+                nn.Conv2d(stem_width, stem_width * 2, kernel_size=3, stride=1,
+                          padding=1,
+                          bias=False),
+            )
+        else:
+            self.conv1 = nn.Conv2d(3, 64, kernel_size=7, stride=2, padding=3,
+                                   bias=False)
+
+        self.bn1 = norm_layer(stem_width * 2 if deep_stem else 64, eps=bn_eps,
+                              momentum=bn_momentum)
+        self.relu = nn.ReLU(inplace=inplace)
+        self.maxpool = nn.MaxPool2d(kernel_size=3, stride=2, padding=1)
+        self.layer1 = self._make_layer(block, norm_layer, 64, layers[0],
+                                       inplace,
+                                       bn_eps=bn_eps, bn_momentum=bn_momentum)
+        self.layer2 = self._make_layer(block, norm_layer, 128, layers[1],
+                                       inplace, stride=2,
+                                       bn_eps=bn_eps, bn_momentum=bn_momentum)
+        self.layer3 = self._make_layer(block, norm_layer, 256, layers[2],
+                                       inplace, stride=2,
+                                       bn_eps=bn_eps, bn_momentum=bn_momentum)
+        self.layer4 = self._make_layer(block, norm_layer, 512, layers[3],
+                                       inplace, stride=2,
+                                       bn_eps=bn_eps, bn_momentum=bn_momentum)
+        self.layer_channel_nums = (256, 512, 1024, 2048)
+
+    def _make_layer(self, block, norm_layer, planes, blocks, inplace=True,
+                    stride=1, bn_eps=1e-5, bn_momentum=0.1):
+        downsample = None
+        if stride != 1 or self.inplanes != planes * block.expansion:
+            downsample = nn.Sequential(
+                nn.Conv2d(self.inplanes, planes * block.expansion,
+                          kernel_size=1, stride=stride, bias=False),
+                norm_layer(planes * block.expansion, eps=bn_eps,
+                           momentum=bn_momentum),
+            )
+
+        layers = []
+        layers.append(block(self.inplanes, planes, stride, norm_layer, bn_eps,
+                            bn_momentum, downsample, inplace))
+        self.inplanes = planes * block.expansion
+        for i in range(1, blocks):
+            layers.append(block(self.inplanes, planes,
+                                norm_layer=norm_layer, bn_eps=bn_eps,
+                                bn_momentum=bn_momentum, inplace=inplace))
+
+        return nn.Sequential(*layers)
+
+    def forward(self, x):
+        x = self.conv1(x)
+        x = self.bn1(x)
+        x = self.relu(x)
+        x = self.maxpool(x)
+
+        blocks = []
+        x = self.layer1(x)
+        blocks.append(x)
+        x = self.layer2(x)
+        blocks.append(x)
+        x = self.layer3(x)
+        blocks.append(x)
+        x = self.layer4(x)
+        blocks.append(x)
+
+        return blocks
+
+
+def resnet18(pretrained_model=None, **kwargs):
+    model = ResNet(BasicBlock, [2, 2, 2, 2], **kwargs)
+
+    if pretrained_model is not None:
+        model = load_model(model, pretrained_model)
+    return model
+
+
+def resnet34(pretrained_model=None, **kwargs):
+    model = ResNet(BasicBlock, [3, 4, 6, 3], **kwargs)
+
+    if pretrained_model is not None:
+        model = load_model(model, pretrained_model)
+    return model
+
+
+def resnet50(pretrained_model=None, **kwargs):
+    model = ResNet(Bottleneck, [3, 4, 6, 3], **kwargs)
+
+    if pretrained_model is not None:
+        model = load_model(model, pretrained_model)
+    return model
+
+
+def resnet101(pretrained_model=None, **kwargs):
+    model = ResNet(Bottleneck, [3, 4, 23, 3], **kwargs)
+
+    if pretrained_model is not None:
+        model = load_model(model, pretrained_model)
+    return model
+
+
+def resnet152(pretrained_model=None, **kwargs):
+    model = ResNet(Bottleneck, [3, 8, 36, 3], **kwargs)
+
+    if pretrained_model is not None:
+        model = load_model(model, pretrained_model)
+    return model
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+from .resnet import ResNet, resnet18, resnet34, resnet50, resnet101, resnet152`
	`2`	`+from .xception import Xception, xception39`