Add a porting guide and update README.md [no ci]

Matevz Morato · Matevz Morato · commit 5cfd862da0ba · 2025-05-23T01:26:32.000+02:00
diff --git a/README.md b/README.md
@@ -8,6 +8,7 @@ DepthAI library for interfacing with Luxonis DepthAI hardware.
 
 > ⚠️ This is a `v3.x.y` version of the library which is still in active development without a stable API yet.
 
+> ℹ️ For porting code from `v2` version of the library, we recommend using the [porting guide](./V2V3PortinGuide.md)
 
 ## Documentation
 Documentation is available over at [Luxonis DepthAI API](https://stg.docs.luxonis.com/software/v3/)
@@ -16,7 +17,7 @@ Documentation is available over at [Luxonis DepthAI API](https://stg.docs.luxoni
 DepthAI library doesn't yet provide API stability guarantees. While we take care to properly deprecate old functions, some changes might still be breaking.
 
 ## Examples
-Examples for both C++ and Python are available in the `examples` folder. To get started with them see [README.md](./examples/README.md) for more information.
+Examples for both C++ and Python are available in the `examples` folder. To see hwo to build and run them see [README.md](./examples/README.md) for more information.
 
 ## Dependencies
 - CMake >= 3.14 and <4.0
diff --git a/V2V3PortinGuide.md b/V2V3PortinGuide.md
@@ -0,0 +1,208 @@
+# DepthAI v2 → v3 Porting Guide
+
+This document describes the changes between the v2 and v3 APIs of DepthAI and how to migrate existing code.
+
+## What's new in the v3 API
+
+* No more **explicit** XLink nodes – the XLink “bridges” are created automatically.
+* Host nodes – nodes that run on the host machine now work cleanly with device‑side nodes.
+* Custom host nodes – users can create custom nodes that run on the host machine
+
+  * Both `ThreadedHostNode` and `HostNode` are supported.
+  * `ThreadedHostNode` works similarly to `ScriptNode`; the user **specifies** a `run` function that executes in a separate thread.
+  * `HostNode` exposes an input map `inputs` whose entries are implicitly synced.
+  * Available in both Python and C++.
+* Record‑and‑replay nodes.
+* `Pipeline` now has a live device that can be queried during pipeline creation.
+* Support for the new **Model Zoo**.
+* `ImageManip` has a refreshed API with better‑defined behaviour.
+* `ColorCamera` and `MonoCamera` are deprecated in favour of the new `Camera` node.
+
+---
+
+## Minimal changes required
+
+* Remove the explicit creation of `dai.Device` (unless you intentionally pass a live device handle via the pipeline constructor – a rare edge case).
+* Remove explicit XLink nodes.
+* Replace `dai.Device(pipeline)` with `pipeline.start()`.
+* Replace any `.getOutputQueue()` calls with `output.createOutputQueue()`.
+* Replace any `.getInputQueue()` calls with `input.createInputQueue()`.
+
+---
+
+## Quick port: simple RGB stream example
+
+Below, the old v2 code is commented with `# ORIG` and the new code with `# NEW`.
+
+```python
+#!/usr/bin/env python3
+
+import cv2
+import depthai as dai
+
+# Create pipeline
+pipeline = dai.Pipeline()
+
+# Define source and output
+camRgb = pipeline.create(dai.node.ColorCamera)
+
+# ORIG – explicit XLink removed in v3
+# xoutVideo = pipeline.create(dai.node.XLinkOut)
+# xoutVideo.setStreamName("video")
+
+# Properties
+camRgb.setBoardSocket(dai.CameraBoardSocket.CAM_A)
+camRgb.setResolution(dai.ColorCameraProperties.SensorResolution.THE_1080_P)
+camRgb.setVideoSize(1920, 1080)
+
+# Linking
+# ORIG
+# camRgb.video.link(xoutVideo.input)
+# NEW – output queue straight from the node
+videoQueue = camRgb.video.createOutputQueue()
+
+# ORIG – entire `with dai.Device` block removed
+# with dai.Device(pipeline) as device:
+#   video = device.getOutputQueue(name="video", maxSize=1, blocking=False)
+#   while True:
+# NEW – start the pipeline
+pipeline.start()
+while pipeline.isRunning():
+    videoIn = videoQueue.get()  # blocking
+    cv2.imshow("video", videoIn.getCvFrame())
+    if cv2.waitKey(1) == ord('q'):
+        break
+```
+
+This runs on RVC2 devices. Note that `ColorCamera`/`MonoCamera` nodes are deprecated on RVC4; see the next section for using `Camera` instead.
+
+---
+
+## Porting `ColorCamera` / `MonoCamera` usage to `Camera`
+
+The new `Camera` node can expose as many outputs as you request.
+
+```python
+camRgb = pipeline.create(dai.node.ColorCamera)
+camRgb.setPreviewSize(300, 300)
+camRgb.setInterleaved(False)
+camRgb.setColorOrder(dai.ColorCameraProperties.ColorOrder.RGB)
+outputQueue = camRgb.preview.createOutputQueue()
+```
+
+turns into
+
+```python
+camRgb = pipeline.create(dai.node.Camera).build()  # don’t forget .build()
+cameraOutput = camRgb.requestOutput((300, 300), type=dai.ImgFrame.Type.RGB888p)  # replaces .preview
+outputQueue = cameraOutput.createOutputQueue()
+```
+
+Request multiple outputs simply by calling `requestOutput` again. For full‑resolution use‑cases that previously used `.isp`, call `requestFullResolutionOutput()` instead.
+
+For former `MonoCamera` pipelines, replace the `.out` output with `requestOutput`, e.g.
+
+```python
+mono = pipeline.create(dai.node.Camera).build()
+monoOut = mono.requestOutput((1280, 720), type=dai.ImgFrame.Type.GRAY8)
+```
+
+---
+
+## Porting the old `ImageManip` to the new API
+
+The new API tracks every transformation in sequence and separates *how* the final image is resized.
+See the [official documentation](https://docs.luxonis.com/software/v3/depthai-components/nodes/image_manip/) for full details.
+
+### v2 example
+
+```python
+#!/usr/bin/env python3
+
+import cv2
+import depthai as dai
+
+# Create pipeline
+pipeline = dai.Pipeline()
+
+camRgb = pipeline.create(dai.node.ColorCamera)
+camRgb.setPreviewSize(1000, 500)
+camRgb.setInterleaved(False)
+maxFrameSize = camRgb.getPreviewHeight() * camRgb.getPreviewWidth() * 3
+
+# In this example we use 2 imageManips for splitting the original 1000x500
+# preview frame into 2 500x500 frames
+manip1 = pipeline.create(dai.node.ImageManip)
+manip1.initialConfig.setCropRect(0, 0, 0.5, 1)
+manip1.setMaxOutputFrameSize(maxFrameSize)
+camRgb.preview.link(manip1.inputImage)
+
+manip2 = pipeline.create(dai.node.ImageManip)
+manip2.initialConfig.setCropRect(0.5, 0, 1, 1)
+manip2.setMaxOutputFrameSize(maxFrameSize)
+camRgb.preview.link(manip2.inputImage)
+
+xout1 = pipeline.create(dai.node.XLinkOut)
+xout1.setStreamName('out1')
+manip1.out.link(xout1.input)
+
+xout2 = pipeline.create(dai.node.XLinkOut)
+xout2.setStreamName('out2')
+manip2.out.link(xout2.input)
+
+# Connect to device and start pipeline
+with dai.Device(pipeline) as device:
+    # Output queue will be used to get the rgb frames from the output defined above
+    q1 = device.getOutputQueue(name="out1", maxSize=4, blocking=False)
+    q2 = device.getOutputQueue(name="out2", maxSize=4, blocking=False)
+
+    while True:
+        if q1.has():
+            cv2.imshow("Tile 1", q1.get().getCvFrame())
+
+        if q2.has():
+            cv2.imshow("Tile 2", q2.get().getCvFrame())
+
+        if cv2.waitKey(1) == ord('q'):
+            break
+```
+
+### v3 equivalent:
+
+```python
+#!/usr/bin/env python3
+
+import cv2
+import depthai as dai
+
+# Create pipeline
+pipeline = dai.Pipeline()
+
+camRgb = pipeline.create(dai.node.Camera).build()
+preview = camRgb.requestOutput((1000, 500), type=dai.ImgFrame.Type.RGB888p)
+
+# In this example we use 2 imageManips for splitting the original 1000x500
+# preview frame into 2 500x500 frames
+manip1 = pipeline.create(dai.node.ImageManip)
+manip1.initialConfig.addCrop(0, 0, 500, 500)
+preview.link(manip1.inputImage)
+
+manip2 = pipeline.create(dai.node.ImageManip)
+manip2.initialConfig.addCrop(500, 0, 500, 500)
+preview.link(manip2.inputImage)
+
+q1 = manip1.out.createOutputQueue()
+q2 = manip2.out.createOutputQueue()
+
+pipeline.start()
+with pipeline:
+    while pipeline.isRunning():
+        if q1.has():
+            cv2.imshow("Tile 1", q1.get().getCvFrame())
+
+        if q2.has():
+            cv2.imshow("Tile 2", q2.get().getCvFrame())
+
+        if cv2.waitKey(1) == ord('q'):
+            break
+```
diff --git a/examples/README.md b/examples/README.md
@@ -1,137 +1,24 @@
-# Examples for the Depthai V3 API
-
-The examples in this directory show the existing functionality of the Depthai V3 API.
-
-The examples range from the ones that were just minimally ported from the V2 API, to showcase that porting the existing code is straightforward,
-to the ones that are specifically designed to show the new features of the V3 API.
-
-The StereoDepth/stereo_autocreate.py example is a good example of the new features of the V3 API which showcases the ability to automatically create the stereo inputs
-as well as the ability to create a custom node that can be used in the pipeline.
-
-```python
-# Create pipeline
-with dai.Pipeline() as pipeline:
-    # Allow stereo inputs to be created automatically
-    stereo = pipeline.create(dai.node.StereoDepth).build(autoCreateCameras=True)
-    # This can be alternatively written as:
-    # stereo = dai.node.StereoDepth(autoCreateCameras=True)
-    visualizer = pipeline.create(StereoVisualizer).build(stereo.disparity)
-    pipeline.start()
-    while pipeline.isRunning():
-        time.sleep(0.1)
-```
+# DepthAI Examples
 
+For more information about the examples, please refer to the [DepthAI documentation](https://stg.docs.luxonis.com/software/v3/examples/).
 ## Supported platforms
 The examples are now split into three categories:
-* In the root directory of examples, there are the examples that are supported on all platforms
-* In RVC2/RVC4 directories there are the examples that are supported only on the RVC2/RVC4 platform
-
-In the future we plan to make the examples more platform agnostic and we'll be slowly moving as many examples as possible to the root directory.
-
-## Supported languages
-The examples are currently in Python and C++. The C++ examples are in the `cpp` directory and the Python examples are in the `python` directory.
-
-Currently there are more python examples than C++ examples, but we plan to match the examples in both languages in the future and keep them 1:1.
+* In the root directory of examples, there are the examples that are supported on both RVC2 and RVC4 platforms devices
+* In RVC2/RVC4 directories there are the examples that are supported only on one of the platforms
 
-## Syntax differences compared to the V2 API
+## Python
 
-In Python, currently, there exist two syntaxes for creating nodes. Before we decide, we first want to gather feedback from you.
-
-One syntax uses the `Pipeline.create(...)` and `Node.build(...)` methods like this `pipeline.create(dai.node.StereoDepth).build(autoCreateCameras=True)`.
-
-The other syntax uses the pipeline from the `with` statement and does all the work in the constructor -- e.g. `dai.node.StereoDepth(autoCreateCameras=True`.
-
-In Python, both syntaxes also accept keyword arguments that mirror setter methods -- e.g. `dai.node.VideoEncoder(bitrate=2*24)`. Currently, not all parameters are available.
-
-We'd love to hear your opinion which one you prefer. Currently, this syntax is now available only for `Camera`, `DetectionNetwork`, `StereoDepth` and `VideoEncoder`. Other nodes still have to be created with `pipeline.create`, parameters set with setter methods and inputs linked with `.link`.
-
-Examples of this can be found in the `VideoEncoder` example.
-
-## Python installation
-
-To get the examples running, install the requirements with:
+To get the python examples running, install the requirements with:
 
 ```
 python3 depthai-core/examples/python/install_requirements.py
 ```
-
-NOTE: Right now wheels for windows are missing, but wheels for MacOS and Linux, both x86_64 and arm64 are available.
-
-## What's new in the V3 API
-* No more expliclit XLink nodes - the XLink "bridges" are created automatically
-* Host nodes - nodes that run on the host machine now cleanly interoperate with the device nodes
-* Custom host nodes - the user can create custom nodes that run on the host machine
-  * Both `ThreadedHostNode` and `HostNode` are supported.
-  * `ThreadedHostNode` works in a very similar fashion to the `ScriptNode` where the user specifes a `run` function which is then executed in a separate thread.
-  * `HostNode` has an input map `inputs` where all the inputs are implicitly synced
-  * Available both in Python and C++
-* Record and replay nodes
-  * Holistic record and replay is WIP
-* Support for both RVC2&RVC3 with initial support for RVC4
-* Device is now available at node construction, so we will be able to create smart defaults
-  * Not used extensively yet, will be added gradually to more and more nodes.
-* Support for NNArchive for the existing NN nodes
-* `build(params)` functions for nodes where they can autocreate its inputs
-  * Not yet used extensively yet, will be added gradually to more and more nodes.
-
-
-## How to port an example from V2 to V3
-The process of porting an example from V2 to V3 should be straightforward.
-
-The minimal needed changes:
-* Remove the explicit creation of the device **or** pass the device in the pipeline constructor
-* Remove the explicit XLink nodes
-* Replace any `.getOutputQueue()` calls with `output.createOutputQueue()` calls
-
-
-### Quick porting example
-Let's take the simplest `rgb_video.py` example and port it to the V3 API.
-
-The commented out code from the old API is commented with #ORIG and the new code is commented with #NEW.:
-```python
-#!/usr/bin/env python3
-
-import cv2
-import depthai as dai
-
-# Create pipeline
-# ORIG
-# pipeline = dai.Pipeline()
-with dai.Pipeline() as pipeline:
-    # Define source and output
-    camRgb = pipeline.create(dai.node.ColorCamera)
-
-    # ORIG
-    # xoutVideo = pipeline.create(dai.node.XLinkOut)
-    # xoutVideo.setStreamName("video")
-
-    # Properties
-    camRgb.setBoardSocket(dai.CameraBoardSocket.CAM_A)
-    camRgb.setResolution(dai.ColorCameraProperties.SensorResolution.THE_1080_P)
-    camRgb.setVideoSize(1920, 1080)
-
-    # Linking
-    camRgb.video.link(xoutVideo.input)
-    # NEW
-    videoQueue = camRgb.video.createOutputQueue()
-
-# ORIG
-# with dai.Device(pipeline) as device:
-#   video = device.getOutputQueue(name="video", maxSize=1, blocking=False)
-#   while True:
-# NEW
-    while pipeline.isRunning():
-        videoIn = video.get()
-        # Get BGR frame from NV12 encoded video frame to show with opencv
-        # Visualizing the frame on slower hosts might have overhead
-        cv2.imshow("video", videoIn.getCvFrame())
-
-        if cv2.waitKey(1) == ord('q'):
-            break
+and run the example with:
+```
+python3 depthai-core/examples/python/Camera/camera_output.py
 ```
 
-
-## Running examples
+## C++
 
 To build the examples configure with following option added from the root of the repository:
 ```
@@ -142,7 +29,7 @@ cmake --build build
 Then navigate to `build/examples` folder and run a preferred example
 ```
 cd build/examples
-./MobileNet/rgb_mobilenet
+./cpp/Camera/camera_output
 ```
 
 ## VSLAM
@@ -163,4 +50,4 @@ python3 depthai-core/examples/python/install_requirements.py --install_rerun
 You can also install it separately, installation instructions can be found [here](https://rerun.io/docs/getting-started/installing-viewer). If you use Numpy v2.0 you might need to downgrade it for Rerun.
 **NOTE** Currently, Rerun does not work with Numpy 2.0, you need to downgrade it to, for example 1.24.4 to be able to properly view images.
 
-> ℹ️ Multi-Config generators (like Visual Studio on Windows) will have the examples built in `build/examples/MobileNet/[Debug/Release/...]/rgb_mobilenet`
+> ℹ️ Multi-Config generators (like Visual Studio on Windows) will have the examples built in `build/examples/Camera/[Debug/Release/...]/camera_output`