|
1 | 1 | # TF-Vision Model Garden
|
2 | 2 |
|
| 3 | +⚠️ Disclaimer: All datasets hyperlinked from this page are not owned or |
| 4 | +distributed by Google. The dataset is made available by third parties. |
| 5 | +Please review the terms and conditions made available by the third parties |
| 6 | +before using the data. |
| 7 | + |
3 | 8 | ## Introduction
|
4 | 9 |
|
5 | 10 | TF-Vision modeling library for computer vision provides a collection of
|
@@ -59,11 +64,12 @@ depth, label smoothing and dropout.
|
59 | 64 | * [RetinaNet](https://arxiv.org/abs/1708.02002) and [RetinaNet-RS](https://arxiv.org/abs/2107.00057)
|
60 | 65 | * [Mask R-CNN](https://arxiv.org/abs/1703.06870)
|
61 | 66 | * [Cascade RCNN](https://arxiv.org/abs/1712.00726) and [Cascade RCNN-RS](https://arxiv.org/abs/2107.00057)
|
62 |
| - |
63 |
| -* Models are all trained on COCO train2017 and evaluated on COCO val2017. |
| 67 | +* Models are all trained on [COCO](https://cocodataset.org/) train2017 and |
| 68 | +evaluated on [COCO](https://cocodataset.org/) val2017. |
64 | 69 | * Training details:
|
65 |
| - * Models finetuned from ImageNet pretrained checkpoints adopt the 12 or 36 |
66 |
| - epochs schedule. Models trained from scratch adopt the 350 epochs schedule. |
| 70 | + * Models finetuned from [ImageNet](https://www.image-net.org/) pretrained |
| 71 | + checkpoints adopt the 12 or 36 epochs schedule. Models trained from scratch |
| 72 | + adopt the 350 epochs schedule. |
67 | 73 | * The default training data augmentation implements horizontal flipping and
|
68 | 74 | scale jittering with a random scale between [0.5, 2.0].
|
69 | 75 | * Unless noted, all models are trained with l2 weight regularization and ReLU
|
@@ -106,18 +112,18 @@ depth, label smoothing and dropout.
|
106 | 112 |
|
107 | 113 | | Backbone | Resolution | Epochs | FLOPs (B) | Params (M) | Box AP | Mask AP | Download |
|
108 | 114 | | ------------ |:-------------:| -------:|-----------:|-----------:|-------:|--------:|---------:|
|
109 |
| -ResNet50-FPN | 640x640 | 350 | 227.7 | 46.3 | 42.3 | 37.6 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/r50fpn_640_coco_scratch_tpu4x4.yaml) | |
| 115 | +| ResNet50-FPN | 640x640 | 350 | 227.7 | 46.3 | 42.3 | 37.6 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/r50fpn_640_coco_scratch_tpu4x4.yaml) | |
110 | 116 | | SpineNet-49 | 640x640 | 350 | 215.7 | 40.8 | 42.6 | 37.9 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet49_mrcnn_tpu.yaml) |
|
111 |
| -SpineNet-96 | 1024x1024 | 500 | 315.0 | 55.2 | 48.1 | 42.4 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet96_mrcnn_tpu.yaml) | |
112 |
| -SpineNet-143 | 1280x1280 | 500 | 498.8 | 79.2 | 49.3 | 43.4 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet143_mrcnn_tpu.yaml) | |
| 117 | +| SpineNet-96 | 1024x1024 | 500 | 315.0 | 55.2 | 48.1 | 42.4 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet96_mrcnn_tpu.yaml) | |
| 118 | +| SpineNet-143 | 1280x1280 | 500 | 498.8 | 79.2 | 49.3 | 43.4 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet143_mrcnn_tpu.yaml) | |
113 | 119 |
|
114 | 120 |
|
115 | 121 | #### Cascade RCNN-RS (Trained from scratch)
|
116 | 122 |
|
117 |
| -backbone | resolution | epochs | params (M) | box AP | mask AP | download |
| 123 | +| Backbone | Resolution | Epochs | Params (M) | Box AP | Mask AP | Download |
118 | 124 | ------------ | :--------: | -----: | ---------: | -----: | ------: | -------:
|
119 |
| -SpineNet-49 | 640x640 | 500 | 56.4 | 46.4 | 40.0 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet49_cascadercnn_tpu.yaml)| |
120 |
| -SpineNet-143 | 1280x1280 | 500 | 94.9 | 51.9 | 45.0 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet143_cascadercnn_tpu.yaml)| |
| 125 | +| SpineNet-49 | 640x640 | 500 | 56.4 | 46.4 | 40.0 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet49_cascadercnn_tpu.yaml)| |
| 126 | +| SpineNet-143 | 1280x1280 | 500 | 94.9 | 51.9 | 45.0 | [config](https://github.com/tensorflow/models/blob/master/official/vision/beta/configs/experiments/maskrcnn/coco_spinenet143_cascadercnn_tpu.yaml)| |
121 | 127 |
|
122 | 128 | ## Semantic Segmentation
|
123 | 129 |
|
|
0 commit comments