HF spaces demos for mask-rcnn,faster-rcnn, yolov4, DUC, and FCN (#508)

AK391 · web-flow · commit 19e5ebdd5293 · 2022-03-27T20:10:50.000-07:00
* add Squeezenet HF space

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* efficientnet v4 hf spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add Resnet HF spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add VGG HF spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add GoogleNet and AlexNet HF spaces links

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add inceptionv1 HF spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add CaffeNet HF spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* ZFNet 512 HF spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add DenseNet HF space link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add arcface hf space link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add column header for HF

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add Ultraface HF spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add sub_pixel_cnn_2016 HF spaces link

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add column for HF spaces

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add T5 demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add BERT demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add RoBERTa demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add GPT-2 demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add BiDAF demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add DUC demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add Yolov4 demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add FCN demo

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add mask-rcnn

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;

* add faster rcnn

Signed-off-by: AK391 &lt;ahsen@huggingface.co&gt;
diff --git a/README.md b/README.md
@@ -61,20 +61,20 @@ This subset of models classify images for specific domains and datasets.
 ### Object Detection & Image Segmentation <a name="object_detection"/>
 Object detection models detect the presence of multiple objects in an image and segment out areas of the image where the objects are detected. Semantic segmentation models partition an input image by labeling each pixel into a set of pre-defined categories.
 
-|Model Class |Reference |Description |
-|-|-|-|
+|Model Class |Reference |Description |Hugging Face Spaces |
+|-|-|-|-|
 |<b>[Tiny YOLOv2](vision/object_detection_segmentation/tiny-yolov2)</b>|[Redmon et al.](https://arxiv.org/pdf/1612.08242.pdf)|A real-time CNN for object detection that detects 20 different classes. A smaller version of the more complex full YOLOv2 network.|
 |<b>[SSD](vision/object_detection_segmentation/ssd)</b>|[Liu et al.](https://arxiv.org/abs/1512.02325)|Single Stage Detector: real-time CNN for object detection that detects 80 different classes.|
 |<b>[SSD-MobileNetV1](vision/object_detection_segmentation/ssd-mobilenetv1)</b>|[Howard et al.](https://arxiv.org/abs/1704.04861)|A variant of MobileNet that uses the Single Shot Detector (SSD) model framework. The model detects 80 different object classes and locates up to 10 objects in an image.|
-|<b>[Faster-RCNN](vision/object_detection_segmentation/faster-rcnn)</b>|[Ren et al.](https://arxiv.org/abs/1506.01497)|Increases efficiency from R-CNN by connecting a RPN with a CNN to create a single, unified network for object detection that detects 80 different classes.|
-|<b>[Mask-RCNN](vision/object_detection_segmentation/mask-rcnn)</b>|[He et al.](https://arxiv.org/abs/1703.06870)|A real-time neural network for object instance segmentation that detects 80 different classes. Extends Faster R-CNN as each of the 300 elected ROIs go through 3 parallel branches of the network: label prediction, bounding box prediction and mask prediction.|
+|<b>[Faster-RCNN](vision/object_detection_segmentation/faster-rcnn)</b>|[Ren et al.](https://arxiv.org/abs/1506.01497)|Increases efficiency from R-CNN by connecting a RPN with a CNN to create a single, unified network for object detection that detects 80 different classes.| [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/onnx/faster-rcnn) |
+|<b>[Mask-RCNN](vision/object_detection_segmentation/mask-rcnn)</b>|[He et al.](https://arxiv.org/abs/1703.06870)|A real-time neural network for object instance segmentation that detects 80 different classes. Extends Faster R-CNN as each of the 300 elected ROIs go through 3 parallel branches of the network: label prediction, bounding box prediction and mask prediction.| [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/onnx/mask-rcnn) |
 |<b>[RetinaNet](vision/object_detection_segmentation/retinanet)</b>|[Lin et al.](https://arxiv.org/abs/1708.02002)|A real-time dense detector network for object detection that addresses class imbalance through Focal Loss. RetinaNet is able to match the speed of previous one-stage detectors and defines the state-of-the-art in two-stage detectors (surpassing R-CNN).|
 |<b>[YOLO v2-coco](vision/object_detection_segmentation/yolov2-coco)</b>|[Redmon et al.](https://arxiv.org/abs/1612.08242)|A CNN model for real-time object detection system that can detect over 9000 object categories. It uses a single network evaluation, enabling it to be more than 1000x faster than R-CNN and 100x faster than Faster R-CNN. This model is trained with COCO dataset and contains 80 classes.
 |<b>[YOLO v3](vision/object_detection_segmentation/yolov3)</b>|[Redmon et al.](https://arxiv.org/pdf/1804.02767.pdf)|A deep CNN model for real-time object detection that detects 80 different classes. A little bigger than YOLOv2 but still very fast. As accurate as SSD but 3 times faster.|
 |<b>[Tiny YOLOv3](vision/object_detection_segmentation/tiny-yolov3)</b>|[Redmon et al.](https://arxiv.org/pdf/1804.02767.pdf)| A smaller version of YOLOv3 model. |
-|<b>[YOLOv4](vision/object_detection_segmentation/yolov4)</b>|[Bochkovskiy et al.](https://arxiv.org/abs/2004.10934)|Optimizes the speed and accuracy of object detection. Two times faster than EfficientDet. It improves YOLOv3's AP and FPS by 10% and 12%, respectively, with mAP50 of 52.32 on the COCO 2017 dataset and FPS of 41.7 on a Tesla V100.|
-|<b>[DUC](vision/object_detection_segmentation/duc)</b>|[Wang et al.](https://arxiv.org/abs/1702.08502)|Deep CNN based pixel-wise semantic segmentation model with >80% [mIOU](/models/semantic_segmentation/DUC/README.md/#metric) (mean Intersection Over Union). Trained on cityscapes dataset, which can be effectively implemented in self driving vehicle systems.|
-|<b>[FCN](vision/object_detection_segmentation/fcn)|[Long et al.](https://people.eecs.berkeley.edu/~jonlong/long_shelhamer_fcn.pdf)|Deep CNN based segmentation model trained end-to-end, pixel-to-pixel that produces efficient inference and learning. Built off of AlexNet, VGG net, GoogLeNet classification methods. <br>[contribute](contribute.md)|
+|<b>[YOLOv4](vision/object_detection_segmentation/yolov4)</b>|[Bochkovskiy et al.](https://arxiv.org/abs/2004.10934)|Optimizes the speed and accuracy of object detection. Two times faster than EfficientDet. It improves YOLOv3's AP and FPS by 10% and 12%, respectively, with mAP50 of 52.32 on the COCO 2017 dataset and FPS of 41.7 on a Tesla V100.| [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/onnx/yolov4) |
+|<b>[DUC](vision/object_detection_segmentation/duc)</b>|[Wang et al.](https://arxiv.org/abs/1702.08502)|Deep CNN based pixel-wise semantic segmentation model with >80% [mIOU](/models/semantic_segmentation/DUC/README.md/#metric) (mean Intersection Over Union). Trained on cityscapes dataset, which can be effectively implemented in self driving vehicle systems.| [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/onnx/DUC) |
+|<b>[FCN](vision/object_detection_segmentation/fcn)|[Long et al.](https://people.eecs.berkeley.edu/~jonlong/long_shelhamer_fcn.pdf)|Deep CNN based segmentation model trained end-to-end, pixel-to-pixel that produces efficient inference and learning. Built off of AlexNet, VGG net, GoogLeNet classification methods. <br>[contribute](contribute.md)| [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/onnx/FCN) |
 <hr>
 
 ### Body, Face & Gesture Analysis <a name="body_analysis"/>