@@ -27,8 +27,8 @@ class KeypointRCNN(FasterRCNN):
27
27
During training, the model expects both the input tensors, as well as a targets (list of dictionary),
28
28
containing:
29
29
30
- - boxes (FloatTensor[N, 4]): the ground-truth boxes in [x1, y1, x2, y2] format, with values of x
31
- between 0 and W and values of y between 0 and H
30
+ - boxes (`` FloatTensor[N, 4]`` ): the ground-truth boxes in `` [x1, y1, x2, y2]`` format, with
31
+ ``0 <= x1 < x2 <= W`` and ``0 <= y1 < y2 <= H``.
32
32
- labels (Int64Tensor[N]): the class label for each ground-truth box
33
33
- keypoints (FloatTensor[N, K, 3]): the K keypoints location for each of the N instances, in the
34
34
format [x, y, visibility], where visibility=0 means that the keypoint is not visible.
@@ -40,8 +40,8 @@ class KeypointRCNN(FasterRCNN):
40
40
predictions as a List[Dict[Tensor]], one for each input image. The fields of the Dict are as
41
41
follows:
42
42
43
- - boxes (FloatTensor[N, 4]): the predicted boxes in [x1, y1, x2, y2] format, with values of x
44
- between 0 and W and values of y between 0 and H
43
+ - boxes (`` FloatTensor[N, 4]`` ): the predicted boxes in `` [x1, y1, x2, y2]`` format, with
44
+ ``0 <= x1 < x2 <= W`` and ``0 <= y1 < y2 <= H``.
45
45
- labels (Int64Tensor[N]): the predicted labels for each image
46
46
- scores (Tensor[N]): the scores or each prediction
47
47
- keypoints (FloatTensor[N, K, 3]): the locations of the predicted keypoints, in [x, y, v] format.
@@ -286,8 +286,8 @@ def keypointrcnn_resnet50_fpn(pretrained=False, progress=True,
286
286
During training, the model expects both the input tensors, as well as a targets (list of dictionary),
287
287
containing:
288
288
289
- - boxes (``FloatTensor[N, 4]``): the ground-truth boxes in ``[x1, y1, x2, y2]`` format, with values of ``x``
290
- between ``0`` and ``W`` and values of ``y`` between ``0`` and `` H``
289
+ - boxes (``FloatTensor[N, 4]``): the ground-truth boxes in ``[x1, y1, x2, y2]`` format, with
290
+ ``0 <= x1 < x2 <= W`` and ``0 <= y1 < y2 <= H``.
291
291
- labels (``Int64Tensor[N]``): the class label for each ground-truth box
292
292
- keypoints (``FloatTensor[N, K, 3]``): the ``K`` keypoints location for each of the ``N`` instances, in the
293
293
format ``[x, y, visibility]``, where ``visibility=0`` means that the keypoint is not visible.
@@ -299,8 +299,8 @@ def keypointrcnn_resnet50_fpn(pretrained=False, progress=True,
299
299
predictions as a ``List[Dict[Tensor]]``, one for each input image. The fields of the ``Dict`` are as
300
300
follows:
301
301
302
- - boxes (``FloatTensor[N, 4]``): the predicted boxes in ``[x1, y1, x2, y2]`` format, with values of ``x``
303
- between ``0`` and ``W`` and values of ``y`` between ``0`` and `` H``
302
+ - boxes (``FloatTensor[N, 4]``): the predicted boxes in ``[x1, y1, x2, y2]`` format, with
303
+ ``0 <= x1 < x2 <= W`` and ``0 <= y1 < y2 <= H``.
304
304
- labels (``Int64Tensor[N]``): the predicted labels for each image
305
305
- scores (``Tensor[N]``): the scores or each prediction
306
306
- keypoints (``FloatTensor[N, K, 3]``): the locations of the predicted keypoints, in ``[x, y, v]`` format.
0 commit comments