"benchmarking/switchback/make_plot_with_jsonl.py" did not exist on "7f87ba83eeae47bfb6d092007dd6ca42fff6c57a"
Unverified Commit 819b2e9f authored by Robylyon93's avatar Robylyon93 Committed by GitHub
Browse files

docs for faster+mask rcnn coords is clearer (#1868)


Co-authored-by: default avatarrvirgolireply <51229032+rvirgolireply@users.noreply.github.com>
parent 1e27b533
...@@ -32,8 +32,8 @@ class FasterRCNN(GeneralizedRCNN): ...@@ -32,8 +32,8 @@ class FasterRCNN(GeneralizedRCNN):
During training, the model expects both the input tensors, as well as a targets (list of dictionary), During training, the model expects both the input tensors, as well as a targets (list of dictionary),
containing: containing:
- boxes (FloatTensor[N, 4]): the ground-truth boxes in [x1, y1, x2, y2] format, with values - boxes (FloatTensor[N, 4]): the ground-truth boxes in [x1, y1, x2, y2] format, with values of x
between 0 and H and 0 and W between 0 and W and values of y between 0 and H
- labels (Int64Tensor[N]): the class label for each ground-truth box - labels (Int64Tensor[N]): the class label for each ground-truth box
The model returns a Dict[Tensor] during training, containing the classification and regression The model returns a Dict[Tensor] during training, containing the classification and regression
...@@ -42,8 +42,8 @@ class FasterRCNN(GeneralizedRCNN): ...@@ -42,8 +42,8 @@ class FasterRCNN(GeneralizedRCNN):
During inference, the model requires only the input tensors, and returns the post-processed During inference, the model requires only the input tensors, and returns the post-processed
predictions as a List[Dict[Tensor]], one for each input image. The fields of the Dict are as predictions as a List[Dict[Tensor]], one for each input image. The fields of the Dict are as
follows: follows:
- boxes (FloatTensor[N, 4]): the predicted boxes in [x1, y1, x2, y2] format, with values between - boxes (FloatTensor[N, 4]): the predicted boxes in [x1, y1, x2, y2] format, with values of x
0 and H and 0 and W between 0 and W and values of y between 0 and H
- labels (Int64Tensor[N]): the predicted labels for each image - labels (Int64Tensor[N]): the predicted labels for each image
- scores (Tensor[N]): the scores or each prediction - scores (Tensor[N]): the scores or each prediction
...@@ -300,8 +300,8 @@ def fasterrcnn_resnet50_fpn(pretrained=False, progress=True, ...@@ -300,8 +300,8 @@ def fasterrcnn_resnet50_fpn(pretrained=False, progress=True,
During training, the model expects both the input tensors, as well as a targets (list of dictionary), During training, the model expects both the input tensors, as well as a targets (list of dictionary),
containing: containing:
- boxes (``FloatTensor[N, 4]``): the ground-truth boxes in ``[x1, y1, x2, y2]`` format, with values - boxes (``FloatTensor[N, 4]``): the ground-truth boxes in ``[x1, y1, x2, y2]`` format, with values of ``x``
between ``0`` and ``H`` and ``0`` and ``W`` between ``0`` and ``W`` and values of ``y`` between ``0`` and ``H``
- labels (``Int64Tensor[N]``): the class label for each ground-truth box - labels (``Int64Tensor[N]``): the class label for each ground-truth box
The model returns a ``Dict[Tensor]`` during training, containing the classification and regression The model returns a ``Dict[Tensor]`` during training, containing the classification and regression
...@@ -310,8 +310,8 @@ def fasterrcnn_resnet50_fpn(pretrained=False, progress=True, ...@@ -310,8 +310,8 @@ def fasterrcnn_resnet50_fpn(pretrained=False, progress=True,
During inference, the model requires only the input tensors, and returns the post-processed During inference, the model requires only the input tensors, and returns the post-processed
predictions as a ``List[Dict[Tensor]]``, one for each input image. The fields of the ``Dict`` are as predictions as a ``List[Dict[Tensor]]``, one for each input image. The fields of the ``Dict`` are as
follows: follows:
- boxes (``FloatTensor[N, 4]``): the predicted boxes in ``[x1, y1, x2, y2]`` format, with values between - boxes (``FloatTensor[N, 4]``): the predicted boxes in ``[x1, y1, x2, y2]`` format, with values of ``x``
``0`` and ``H`` and ``0`` and ``W`` between ``0`` and ``W`` and values of ``y`` between ``0`` and ``H``
- labels (``Int64Tensor[N]``): the predicted labels for each image - labels (``Int64Tensor[N]``): the predicted labels for each image
- scores (``Tensor[N]``): the scores or each prediction - scores (``Tensor[N]``): the scores or each prediction
......
...@@ -28,8 +28,8 @@ class MaskRCNN(FasterRCNN): ...@@ -28,8 +28,8 @@ class MaskRCNN(FasterRCNN):
During training, the model expects both the input tensors, as well as a targets (list of dictionary), During training, the model expects both the input tensors, as well as a targets (list of dictionary),
containing: containing:
- boxes (FloatTensor[N, 4]): the ground-truth boxes in [x1, y1, x2, y2] format, with values - boxes (FloatTensor[N, 4]): the ground-truth boxes in [x1, y1, x2, y2] format, with values of x
between 0 and H and 0 and W between 0 and W and values of y between 0 and H
- labels (Int64Tensor[N]): the class label for each ground-truth box - labels (Int64Tensor[N]): the class label for each ground-truth box
- masks (UInt8Tensor[N, H, W]): the segmentation binary masks for each instance - masks (UInt8Tensor[N, H, W]): the segmentation binary masks for each instance
...@@ -39,8 +39,8 @@ class MaskRCNN(FasterRCNN): ...@@ -39,8 +39,8 @@ class MaskRCNN(FasterRCNN):
During inference, the model requires only the input tensors, and returns the post-processed During inference, the model requires only the input tensors, and returns the post-processed
predictions as a List[Dict[Tensor]], one for each input image. The fields of the Dict are as predictions as a List[Dict[Tensor]], one for each input image. The fields of the Dict are as
follows: follows:
- boxes (FloatTensor[N, 4]): the predicted boxes in [x1, y1, x2, y2] format, with values between - boxes (FloatTensor[N, 4]): the predicted boxes in [x1, y1, x2, y2] format, with values of x
0 and H and 0 and W between 0 and W and values of y between 0 and H
- labels (Int64Tensor[N]): the predicted labels for each image - labels (Int64Tensor[N]): the predicted labels for each image
- scores (Tensor[N]): the scores or each prediction - scores (Tensor[N]): the scores or each prediction
- masks (UInt8Tensor[N, 1, H, W]): the predicted masks for each instance, in 0-1 range. In order to - masks (UInt8Tensor[N, 1, H, W]): the predicted masks for each instance, in 0-1 range. In order to
...@@ -276,8 +276,8 @@ def maskrcnn_resnet50_fpn(pretrained=False, progress=True, ...@@ -276,8 +276,8 @@ def maskrcnn_resnet50_fpn(pretrained=False, progress=True,
During training, the model expects both the input tensors, as well as a targets (list of dictionary), During training, the model expects both the input tensors, as well as a targets (list of dictionary),
containing: containing:
- boxes (``FloatTensor[N, 4]``): the ground-truth boxes in ``[x1, y1, x2, y2]`` format, with values - boxes (``FloatTensor[N, 4]``): the ground-truth boxes in ``[x1, y1, x2, y2]`` format, with values of ``x``
between ``0`` and ``H`` and ``0`` and ``W`` between ``0`` and ``W`` and values of ``y`` between ``0`` and ``H``
- labels (``Int64Tensor[N]``): the class label for each ground-truth box - labels (``Int64Tensor[N]``): the class label for each ground-truth box
- masks (``UInt8Tensor[N, H, W]``): the segmentation binary masks for each instance - masks (``UInt8Tensor[N, H, W]``): the segmentation binary masks for each instance
...@@ -287,8 +287,8 @@ def maskrcnn_resnet50_fpn(pretrained=False, progress=True, ...@@ -287,8 +287,8 @@ def maskrcnn_resnet50_fpn(pretrained=False, progress=True,
During inference, the model requires only the input tensors, and returns the post-processed During inference, the model requires only the input tensors, and returns the post-processed
predictions as a ``List[Dict[Tensor]]``, one for each input image. The fields of the ``Dict`` are as predictions as a ``List[Dict[Tensor]]``, one for each input image. The fields of the ``Dict`` are as
follows: follows:
- boxes (``FloatTensor[N, 4]``): the predicted boxes in ``[x1, y1, x2, y2]`` format, with values between - boxes (``FloatTensor[N, 4]``): the predicted boxes in ``[x1, y1, x2, y2]`` format, with values of ``x``
``0`` and ``H`` and ``0`` and ``W`` between ``0`` and ``W`` and values of ``y`` between ``0`` and ``H``
- labels (``Int64Tensor[N]``): the predicted labels for each image - labels (``Int64Tensor[N]``): the predicted labels for each image
- scores (``Tensor[N]``): the scores or each prediction - scores (``Tensor[N]``): the scores or each prediction
- masks (``UInt8Tensor[N, 1, H, W]``): the predicted masks for each instance, in ``0-1`` range. In order to - masks (``UInt8Tensor[N, 1, H, W]``): the predicted masks for each instance, in ``0-1`` range. In order to
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment