interfaces for decoder fallback, encoder avoids cuda and vaapi hard coding, improved flush #5

berndpfrommer · 2025-07-15T07:48:46Z

This PR covers several changes that aim at making the encoder and decoder more generally useful.

The encoder avoid explicit reference to VAAPI and cuda to avoid spurious error messages when e.g. cuda is not available
Improved flushing of encoder: only need to give frame_id, not full header (was not used before)
Added support for discovery of decoders

…aapi hard coding, improved flush

hidmic

First pass. Mostly stylistic. Functionally looks correct.

I'll test it locally.

README.md

include/ffmpeg_encoder_decoder/decoder.hpp

src/decoder.cpp

src/encoder.cpp

src/utils.cpp

berndpfrommer · 2025-07-16T05:37:07Z

Thanks for looking over the PR!
I have been making a number of other changes since then. I will piggyback your feedback onto the next commit.

hidmic · 2025-07-18T14:07:32Z

src/encoder.cpp

+  frames_ctx->format = utils::find_hw_config(&usesHardwareFrames_, hwDevType, codec);

  if (usesHardwareFrames_) {
    const auto fmts = utils::get_hwframe_transfer_formats(hw_frames_ref);


@berndpfrommer I finally got around to try this on a machine with an RTX 4090. It failed with "cannot find valid sw pixel format!" when trying to encode images using hevc_nvenc.

After some digging, I think the problem may be in how we look for supported pixel formats. utils::get_hwframe_transfer_formats() hits av_hwframe_transfer_get_formats(). When frames_ctx->format is cuda, there are no transfer formats available (see https://ffmpeg.org/doxygen/trunk/hwcontext__cuda_8c_source.html#l00216). IIUC what goes in goes out (see https://ffmpeg.org/pipermail/libav-user/2018-June/011179.html).

I believe we should be using av_hwdevice_get_hwframe_constraints() to fetch the set of supported pixel formats, but don't quote me on that.

I am able to replicate the issue.

The old code only used hardware frames from vaapi, not for cuda, and it worked. Somehow although nvenc is hardware accelerated, it does not use hardware frames, or open a hardware device. Apparently, no call to av_hwdevice_ctx_create() is necessary for nvenc.

What I don't know is when to use hw frames and when not, and I want to avoid hardcoding "vaapi" as it was before. I'm looking for any flags or other indicators that would tell when to use hwframes.

berndpfrommer · 2025-07-21T15:02:31Z

@hidmic I have found a way to tell hevc_nvenc apart from hevc_vaapi by looking at the the hardware formats to transfer TO (rather than FROM). In the case of nvenc one does not even have to open a hardware device. Apparently this is handled internally by the nvenc library.
I could encode/decode using hevc (nvenc/cuvid and vaapi), and av1 (librav1e/libaom-av1). I assume h264 should also work. Note that the ROS parameter names have changed now, creating backwards incompatibility. I did not want to carry forward the "tune" etc parameters. They are now all settable via the av_options parameter.
Your help in testing is very much appreciated. Please let me know if things are still broken for you.

berndpfrommer · 2025-07-24T05:57:14Z

I have more changes coming. The subject of image formats is really tricky.
Here's the issue: no encoders support the bayer images natively. Few even support single-channel formats like mono8. This means bayer images need to first be converted to e.g. rbg8 color before encoding, which inflates the data, and can actually increase the bit rate when configuring for lossless encoding (I had this happen with hevc_nvenc).
Single-channel image encoding can be achieved by converting e.g. mono8 -> nv12 (and setting the color channels to zero), and then converting back to the single channel encoding after decoding. But on the decoder side there is no way to tell
what the original image format was. I will embed the image format in the encoding field of the message, separated with a slash: hevc/bayer_rggb8. This way the decoder knows that a decoded nv12 formatted image actually once was a bayer image, and can reformat accordingly.

hidmic · 2025-07-24T16:20:01Z

I have found a way to tell hevc_nvenc apart from hevc_vaapi by looking at the the hardware formats to transfer TO (rather than FROM).

Wonderful. I'll give it a shot.

Note that the ROS parameter names have changed now, creating backwards incompatibility. I did not want to carry forward the "tune" etc parameters. They are now all settable via the av_options parameter.

Understood, but won't this be a blocker for releasing this package against existing LTS distributions?

I will embed the image format in the encoding field of the message, separated with a slash: hevc/bayer_rggb8

That sounds reasonable, but do note that sensor_msgs/msg/CompressedImage has an format pattern already that should cover this use case. Packet != image and formats need not match, but it'd be nice to minimize the variance.

berndpfrommer · 2025-07-26T07:10:09Z

Note that the ROS parameter names have changed now, creating backwards incompatibility. I did not want to carry forward the "tune" etc parameters. They are now all settable via the av_options parameter.

Understood, but won't this be a blocker for releasing this package against existing LTS distributions?

I really don't know in how far the policies apply only to core ros packages (like rclcpp) or if they apply to all packages. For my packages I usually just bump the major version and release them. The user base is small, and the packages are not mature. If I stuck to the official core policy my humble/jazzy packages would be hopelessly outdated. So far nobody has screamed at me for modifying my APIs on LTS distros.
About message formats, that's another thing, so I'm glad you pointed that one out (see below). It's very tedious to change data that has been collected, so I definitely want to be get that right.

I will embed the image format in the encoding field of the message, separated with a slash: hevc/bayer_rggb8

That sounds reasonable, but do note that sensor_msgs/msg/CompressedImage has an format pattern already that should cover this use case. Packet != image and formats need not match, but it'd be nice to minimize the variance.
Thanks for pointing this out. I will change from / to semicolon and organize it similarly to the compressed_image_transport.

hidmic · 2025-07-28T17:48:42Z

I can confirm encoding using hevc_nvenc works as intended. Decoding on republishing isn't 🤔

On what distro are you testing this @berndpfrommer ?

berndpfrommer · 2025-07-29T06:39:23Z

I'm developing on rolling.
Can you provide a console log that would shed more light on what's going wrong?
I have added an encoder/decoder test to the ffmpeg_image_transport repo. Does colcon test pass? It passes for me on humble/jazzy/rolling.
Thanks,
Bernd

…dings

berndpfrommer · 2025-08-06T20:29:14Z

I have successfully tested this PR now under vaapi and nvenc. Merging it into master and will be releasing it on Rolling.

hidmic · 2025-08-06T20:39:33Z

Ah, you beat me to it @berndpfrommer. Working through a long backlog, didn't get back here in time. Sorry for that, and thank you for pushing!

What about the ffmpeg_image_transport change? I see the branch is up to date and ready to be PR'd too.

new interfaces to support decoder fallback, encoder avoids cuda and v…

c9dfc79

…aapi hard coding, improved flush

berndpfrommer force-pushed the support_fallback_decoder branch from 9832669 to c9dfc79 Compare July 15, 2025 08:28

berndpfrommer mentioned this pull request Jul 15, 2025

Fallback ffmpeg codecs support ros-misc-utilities/ffmpeg_image_transport#41

Closed

hidmic reviewed Jul 15, 2025

View reviewed changes

berndpfrommer force-pushed the support_fallback_decoder branch 2 times, most recently from 928f92a to 782d60a Compare July 18, 2025 05:29

decoder flush(), decoder testing, improved api docs

5eed638

berndpfrommer force-pushed the support_fallback_decoder branch from 782d60a to 5eed638 Compare July 18, 2025 05:38

hidmic reviewed Jul 18, 2025

View reviewed changes

berndpfrommer added 4 commits July 21, 2025 03:29

fix nvenc encoding by looking at transfer TO formats

43747b0

fix bug with non-accel pix fmt

780e5f7

filter misconfigured decoders

6c21e43

fix debug printout for AV_PIX_FMT_NONE

8f466c4

reworked tests to cover bayer

ed21fa3

fix uninitialized memory bug

3c314bd

new feature: can set AV options for decoder

81cdac0

berndpfrommer added 3 commits July 31, 2025 04:45

use comma-separated decoder list now, use semi-colon to separate enco…

c18a15c

…dings

improved documentation

61c5e65

use mutex to make thread safe

a426212

berndpfrommer merged commit 7c221cd into master Aug 6, 2025
3 checks passed

berndpfrommer mentioned this pull request Aug 6, 2025

support for falling back on other decoders if unsuccessfull ros-misc-utilities/ffmpeg_image_transport#45

Merged

interfaces for decoder fallback, encoder avoids cuda and vaapi hard coding, improved flush #5

interfaces for decoder fallback, encoder avoids cuda and vaapi hard coding, improved flush #5

Uh oh!

Conversation

berndpfrommer commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hidmic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

berndpfrommer commented Jul 16, 2025

Uh oh!

hidmic Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

berndpfrommer Jul 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

berndpfrommer commented Jul 21, 2025

Uh oh!

berndpfrommer commented Jul 24, 2025

Uh oh!

hidmic commented Jul 24, 2025

Uh oh!

berndpfrommer commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hidmic commented Jul 28, 2025

Uh oh!

berndpfrommer commented Jul 29, 2025

Uh oh!

berndpfrommer commented Aug 6, 2025

Uh oh!

Uh oh!

hidmic commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

berndpfrommer commented Jul 15, 2025 •

edited

Loading

berndpfrommer Jul 19, 2025 •

edited

Loading

berndpfrommer commented Jul 26, 2025 •

edited

Loading

hidmic commented Aug 6, 2025 •

edited

Loading