video: H.264 video format support #95407

thecapn32 · 2025-09-03T13:02:02Z

This is an attempt to support H.264 video compressed format on native_sim.
There is H.264 video provided in C array .h file and what is doing now is read video data from that file and send it over TCP. Due to this the frame rates are wrong.

This works needs to be done:

add mechanism to send h264 frames to maintain 30FPS
dynamic way to adjust video format in video-sw-generator ( now the H.264 format is static)

…ompressed formats

github-actions · 2025-09-03T13:02:39Z

Hello @thecapn32, and thank you very much for your first pull request to the Zephyr project!
Our Continuous Integration pipeline will execute a series of checks on your Pull Request commit messages and code, and you are expected to address any failures by updating the PR. Please take a look at our commit message guidelines to find out how to format your commit messages, and at our contribution workflow to understand how to update your Pull Request. If you haven't already, please make sure to review the project's Contributor Expectations and update (by amending and force-pushing the commits) your pull request if necessary.
If you are stuck or need help please join us on Discord and ask your question there. Additionally, you can escalate the review when applicable. 😊

josuah

Thanks for this improvement! Timely with the #92884 pull request doing the same thing but with real hardware, as well as every future video device.

Some quick feedback first, I will have to give a more in-depth round later.

josuah · 2025-09-03T13:12:30Z

drivers/video/video_sw_generator.c

+		.fmt.width = 640,                                                                  \
+		.fmt.height = 320,                                                                 \
+		.fmt.pitch = 0,                                                                    \
+		.fmt.pixelformat = VIDEO_PIX_FMT_H264,                                             \


This was probably convenient for testing, but it is better to keep it to what it was, and instead use the video_set_format() API from the application to select H.264.

josuah · 2025-09-03T13:14:13Z

samples/drivers/video/tcpserversink/src/main.c

+    if (video_bits_per_pixel(fmt.pixelformat) > 0) {
+        buffer_size = fmt.pitch * fmt.height;
+    } else {
+        buffer_size = fmt.width * fmt.height / 10;


This is probably going to need some other strategy as hard to tune the compression ratio.

This might as well be something the user decides, depending on what is expected. For instance, buffer_size = fmt.width * fmt.height / CONFIG_VIDEO_MIN_COMPRESSION_RATIO, or even buffer_size = CONFIG_VIDEO_COMPRESSED_BUFFER_SIZE.

josuah · 2025-09-03T13:17:15Z

include/zephyr/drivers/video.h

+/**
+ * H264 without start code
+ */
+#define VIDEO_PIX_FMT_H264_MVC VIDEO_FOURCC('M', '2', '6', '4')
+


Maybe we can keep 3D TV H.264 for another day and not introduce H264_MVC for now? https://en.wikipedia.org/wiki/Multiview_Video_Coding

In order to move forward on PR #92884, I've cherry-picked this commit but without the M264 definition.

josuah · 2025-09-03T13:24:48Z

drivers/video/video_sw_generator.c

+	/* Calculate copy size */
+	size_t remaining = h264_test_data_len - position;
+	copy_size = (remaining > buffer_size) ? buffer_size : remaining;


This fills a buffer with H.264 data completely, which does sound like a valid approach.
Though this might not fit the description of V4L2_PIX_FMT_H264:

H264 Access Unit. The decoder expects one Access Unit per buffer. The encoder generates one Access Unit per buffer. If ioctl VIDIOC_ENUM_FMT reports V4L2_FMT_FLAG_CONTINUOUS_BYTESTREAM then the decoder has no requirements since it can parse all the information from the raw bytestream. -- https://www.kernel.org/doc/html/latest/userspace-api/media/v4l/pixfmt-compressed.html#compressed-formats

I need to check https://en.wikipedia.org/wiki/Network_Abstraction_Layer to be sure of what that means in detail, but if wanting to use the H.264, you might need one buffer per source image frame.

That is, after every buffer, it is expected that transmitting that updates the frame immediately on the viewer.

This mean that every buffer will have a different size.

@josuah NAL units are seperated by start codes, can sending each frame at 33ms interval work?

It seems like I was mistaken: there are two valid approaches for V4L2_PIX_FMT_H264.

If I understand it right, what you propose is the stateful video encoder, where the data produced is immediately usable for being transferred (i.e. over the network).

A stateful video encoder takes raw video frames in display order and encodes them into a bytestream. It generates complete chunks of the bytestream, including all metadata, headers, etc. The resulting bytestream does not require any further post-processing by the client. - https://www.kernel.org/doc/html/latest/userspace-api/media/v4l/dev-encoder.html

And it seems like it matches the description of V4L2_PIX_FMT_H264 when V4L2_FMT_FLAG_CONTINUOUS_BYTESTREAM is enabled...

All good then! No need to change, but this opens the question of how to make the distinction bewteen "stateful" and "stateless" encoder in Zephyr.

josuah · 2025-09-03T13:26:07Z

drivers/video/video_h264_test.h

Clicking view this file shows something like this:

unsigned char h264_test_data[] = { 0x00, 0x00, 0x00, 0x01, 0x67, 0x42, 0xc0, 0x1e, 0x8c, 0x68, 0x0a, 0x03, 0xdb, 0x01, 0x01, 0xe1, 0x10, 0x8d, 0x40, 0x00, 0x00, 0x00, 0x01, 0x68, 0xce, 0x3c, 0x80, 0x00, 0x00, 0x00, 0x01, 0x65, 0xb8, 0x00, 0x04, 0x08, 0xf8, 0x84, 0x44, 0x44, 0x18, 0x11, 0xd5, 0x93, 0x80, 0xba, 0x90, 0x00, 0x75, 0x5f, 0xe0, 0x01, 0xcb, 0x33, 0x44, 0x51, 0xa0, 0xd4, 0xdd, 0xfe, ... 0xb4, 0x32, 0x5b, 0x5f, 0xf3, 0xc6, 0x2d, 0xc6, 0x1e, 0xb1, 0xfe, 0x87, 0xbc, 0x0d, 0xdb, 0x8a, 0x58, 0x4b, 0xf9, 0xe0, 0x9d, 0x44, 0xe7, 0xaf, 0x08, 0x5b, 0x9e, 0xa0 }; unsigned int h264_test_data_len = 315304;

Maybe it is interesting to include the file as a binary. I think there are some places in Zephyr where this is done and will try to give pointers on how to do this.

Related #42580 (comment)

I think the best approach would be to remove the blob, and add instructions in the sample doc showing how to generate the header from a video file.

This would also remove the amount of data on the repo.
Gstreamer and FFmpeg can generate files.

@JarmouniA Would it make sense to add these commands to CMake so that the user does not have to manually copy-paste a command in order to build? This would prevent to use it for i.e. CI.

It does not hurt to also document it though.

sonarqubecloud · 2025-09-03T14:35:47Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

thecapn32 added 4 commits September 3, 2025 16:10

include: driver: video: add h264 pixel format support

d50dd0c

driver: video: add h264 compressed data in array format

fa5a00f

driver: video: sw_generator: support h264 pixel format

366a6bd

samples: drivers: video: tcpserversink: fix buffer allocation for unc…

ef080c2

…ompressed formats

josuah self-requested a review September 3, 2025 13:09

thecapn32 force-pushed the video-sw-generator branch from dcda85f to ef080c2 Compare September 3, 2025 13:25

josuah reviewed Sep 3, 2025

View reviewed changes

thecapn32 requested a review from josuah September 3, 2025 13:30

josuah mentioned this pull request Sep 11, 2025

video: add video compression support to tcpserversink sample #95862

Open

hfruchet-st mentioned this pull request Oct 1, 2025

video: introduction of STM32 VENC driver for H264 hardware video compression #92884

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

video: H.264 video format support #95407

video: H.264 video format support #95407

thecapn32 commented Sep 3, 2025

Uh oh!

github-actions bot commented Sep 3, 2025

Uh oh!

josuah left a comment •

edited

Loading

Uh oh!

josuah Sep 3, 2025

Uh oh!

josuah Sep 3, 2025

Uh oh!

josuah Sep 3, 2025

Uh oh!

hfruchet-st Oct 1, 2025

Uh oh!

josuah Sep 3, 2025

Uh oh!

thecapn32 Sep 18, 2025

Uh oh!

josuah Sep 18, 2025

Uh oh!

josuah Sep 3, 2025

Uh oh!

josuah Sep 3, 2025

Uh oh!

JarmouniA Sep 5, 2025

Uh oh!

josuah Sep 5, 2025

Uh oh!

sonarqubecloud bot commented Sep 3, 2025

Uh oh!

Uh oh!

video: H.264 video format support #95407

Are you sure you want to change the base?

video: H.264 video format support #95407

Conversation

thecapn32 commented Sep 3, 2025

Uh oh!

github-actions bot commented Sep 3, 2025

Uh oh!

josuah left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Sep 3, 2025

Quality Gate passed

Uh oh!

Uh oh!

josuah left a comment •

edited

Loading