@@ -7,28 +7,35 @@ NVIDIA-accelerated, deep learned model support for object detection including De
77
88## Overview
99
10- [ Isaac ROS Object Detection] ( https://github.com/NVIDIA-ISAAC-ROS/isaac_ros_object_detection ) contains an ROS 2 package to perform object
11- detection. ` isaac_ros_detectnet ` provides a method for spatial
10+ [ Isaac ROS Object Detection] ( https://github.com/NVIDIA-ISAAC-ROS/isaac_ros_object_detection ) contains ROS 2 packages to perform object
11+ detection.
12+ ` isaac_ros_rtdetr ` , ` isaac_ros_detectnet ` , and ` isaac_ros_yolov8 ` each provide a method for spatial
1213classification using bounding boxes with an input image. Classification
13- is performed by a GPU-accelerated
14- [ DetectNet] ( https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/models/pretrained_detectnet_v2 )
15- model. The output prediction can be used by perception functions to
14+ is performed by a GPU-accelerated model of the appropriate architecture:
15+
16+ - ` isaac_ros_rtdetr ` : [ RT-DETR models] ( https://nvidia-isaac-ros.github.io/concepts/object_detection/rtdetr/index.html )
17+ - ` isaac_ros_detectnet ` : [ DetectNet models] ( https://nvidia-isaac-ros.github.io/concepts/object_detection/detectnet/index.html )
18+ - ` isaac_ros_yolov8 ` : [ YOLOv8 models] ( https://nvidia-isaac-ros.github.io/concepts/object_detection/yolov8/index.html )
19+
20+ The output prediction can be used by perception functions to
1621understand the presence and spatial location of an object in an image.
1722
1823<div align =" center " ><a class =" reference internal image-reference " href =" https://media.githubusercontent.com/media/NVIDIA-ISAAC-ROS/.github/main/resources/isaac_ros_docs/repositories_and_packages/isaac_ros_object_detection/isaac_ros_object_detection_nodegraph.png/ " ><img alt =" image " src =" https://media.githubusercontent.com/media/NVIDIA-ISAAC-ROS/.github/main/resources/isaac_ros_docs/repositories_and_packages/isaac_ros_object_detection/isaac_ros_object_detection_nodegraph.png/ " width =" 800px " /></a ></div >
1924
20- ` isaac_ros_detectnet ` is used in a graph of nodes to provide a
25+ Each Isaac ROS Object Detection package is used in a graph of nodes to provide a
2126bounding box detection array with object classes from an input image. A
22- [ DetectNet ] ( https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/models/pretrained_detectnet_v2 )
23- model is required to produce the detection array. Input images may need
24- to be cropped and resized to maintain the aspect ratio and match the
25- input resolution of DetectNet ; image resolution may be reduced to
27+ trained model of the appropriate architecture is required to produce the detection array.
28+
29+ Input images may need to be cropped and resized to maintain the aspect ratio and match the
30+ input resolution of the specific object detection model ; image resolution may be reduced to
2631improve DNN inference performance, which typically scales directly with
2732the number of pixels in the image. ` isaac_ros_dnn_image_encoder `
28- provides a DNN encoder to process the input image into Tensors for the
29- DetectNet model. Prediction results are clustered in the DNN decoder to
30- group multiple detections on the same object. Output is provided as a
31- detection array with object classes.
33+ provides DNN encoder utilities to process the input image into Tensors for the
34+ object detection models.
35+ Prediction results are decoded in model-specific ways,
36+ often involving clustering and thresholding to group multiple detections
37+ on the same object and reduce spurious detections.
38+ Output is provided as a detection array with object classes.
3239
3340DNNs have a minimum number of pixels that need to be visible on the
3441object to provide a classification prediction. If a person cannot see
@@ -41,10 +48,6 @@ image, which provides 1K pixels for the person, and fail to detect the
4148same person in the downscaled resolution, which only provides 0.25K
4249pixels for the person.
4350
44- > [ !Note]
45- > DetectNet is similar to other popular object detection
46- > models such as YOLOV3, FasterRCNN, and SSD, while being efficient at
47- > detecting multiple object classes in large images.
4851<div align =" center " ><a class =" reference internal image-reference " href =" https://media.githubusercontent.com/media/NVIDIA-ISAAC-ROS/.github/main/resources/isaac_ros_docs/repositories_and_packages/isaac_ros_object_detection/isaac_ros_object_detection_example_bboxseg.png/ " ><img alt =" image " src =" https://media.githubusercontent.com/media/NVIDIA-ISAAC-ROS/.github/main/resources/isaac_ros_docs/repositories_and_packages/isaac_ros_object_detection/isaac_ros_object_detection_example_bboxseg.png/ " width =" 800px " /></a ></div >
4952
5053Object detection classifies a rectangle of pixels as containing an
0 commit comments