Merge pull request #1479 from yangxile/main

pareenaverma · web-flow · commit 7ff2eed376ff · 2025-01-06T09:48:40.000-05:00
Add new LP article on how to Run vvenc (H.266 encoder) on Arm servers
diff --git a/content/learning-paths/servers-and-cloud-computing/vvenc/_index.md b/content/learning-paths/servers-and-cloud-computing/vvenc/_index.md
@@ -0,0 +1,30 @@
+---
+armips:
+- Neoverse
+author_primary: Willen Yang
+layout: learningpathall
+learning_objectives:
+- Build vvenc(H.266 encoder) project on Arm server
+- Run vvenc on Arm server to encode a real 1080p video file to measure the performance
+learning_path_main_page: 'yes'
+minutes_to_complete: 10
+operatingsystems:
+- Linux
+prerequisites:
+- An [Arm based instance](/learning-paths/servers-and-cloud-computing/csp/) from an appropriate
+  cloud service provider. This Learning Path has been verified on Arm Neoverse N2 based Alibaba cloud ECS instance(g8y), running `Ubuntu Linux 22.04.`
+skilllevels: Introductory
+subjects: Libraries
+test_images:
+- ubuntu:22.04
+test_link: null
+test_maintenance: true
+test_status:
+- passed
+title: Run vvenc (H.266 encoder) on Arm servers
+tools_software_languages:
+- vvenc
+weight: 1
+who_is_this_for: This is an introductory topic for software developers who want to
+  build and run vvenc(H.266 encoder) project on Arm servers and measure the performance.
+---
diff --git a/content/learning-paths/servers-and-cloud-computing/vvenc/_next-steps.md b/content/learning-paths/servers-and-cloud-computing/vvenc/_next-steps.md
@@ -0,0 +1,43 @@
+---
+# ================================================================================
+#       Edit
+# ================================================================================
+
+next_step_guidance: >
+    You can continue learning about porting cloud applications to the Arm architecture for increased performance and cost savings. The Learning Path on MongoDB is a great next step.
+# 1-3 sentence recommendation outlining how the reader can generally keep learning about these topics, and a specific explanation of why the next step is being recommended.
+
+recommended_path: "/learning-paths/servers-and-cloud-computing/mongodb/"
+# Link to the next learning path being recommended.
+
+
+# further_reading links to references related to this path. Can be:
+    # Manuals for a tool / software mentioned   (type: documentation)
+    # Blog about related topics                 (type: blog)
+    # General online references                 (type: website) 
+
+further_reading:
+    - resource:
+        title: vvenc Documentation
+        link: https://github.com/fraunhoferhhi/vvenc/wiki/Usage
+        type: documentation
+    - resource:
+        title: Delivering the best H.265 video experience on Arm Neoverse N2 Platform
+        link: https://community.arm.com/arm-community-blogs/b/infrastructure-solutions-blog/posts/h265-video-on-neoverse-n2
+        type: blog
+    - resource:
+        title: Optimized Video Encoding with FFmpeg on AWS Graviton Processors
+        link: https://aws.amazon.com/blogs/opensource/optimized-video-encoding-with-ffmpeg-on-aws-graviton-processors/
+        type: blog
+    - resource:
+        title: OCI Ampere A1 Compute instances can significantly reduce video encoding costs versus modern CPUs
+        link: https://community.arm.com/arm-community-blogs/b/operating-systems-blog/posts/oracle-cloud-infrastructure-arm-based-a1
+        type: blog
+
+# ================================================================================
+#       FIXED, DO NOT MODIFY
+# ================================================================================
+weight: 21                  # set to always be larger than the content in this path, and one more than 'review'
+title: "Next Steps"         # Always the same
+layout: "learningpathall"   # All files under learning paths have this same wrapper
+---
diff --git a/content/learning-paths/servers-and-cloud-computing/vvenc/_review.md b/content/learning-paths/servers-and-cloud-computing/vvenc/_review.md
@@ -0,0 +1,41 @@
+---
+# ================================================================================
+#       Edit
+# ================================================================================
+
+# Always 3 questions. Should try to test the reader's knowledge, and reinforce the key points you want them to remember.
+    # question:         A one sentence question
+    # answers:          The correct answers (from 2-4 answer options only). Should be surrounded by quotes.
+    # correct_answer:   An integer indicating what answer is correct (index starts from 0)
+    # explanation:      A short (1-3 sentence) explanation of why the correct answer is correct. Can add additional context if desired
+
+
+review:
+    - questions:
+        question: >
+            Does vvenc(H.266 encoder) run on Arm servers?
+        answers:
+            - "Yes"
+            - "No"
+        correct_answer: 1                     
+        explanation: >
+            H.266 codec is fully supported on 64-bit Arm servers running Linux.
+
+    - questions:
+        question: >
+            Does varying the preset settings on the images impact the codec performance?
+        answers:
+            - "Yes"
+            - "No"
+        correct_answer: 1                   
+        explanation: >
+            You can vary the preset settings on the different resolution images and measure the impact on performance.
+               
+
+# ================================================================================
+#       FIXED, DO NOT MODIFY
+# ================================================================================
+title: "Review"                 # Always the same title
+weight: 20                      # Set to always be larger than the content in this path
+layout: "learningpathall"       # All files under learning paths have this same wrapper
+---
diff --git a/content/learning-paths/servers-and-cloud-computing/vvenc/vvenc.md b/content/learning-paths/servers-and-cloud-computing/vvenc/vvenc.md
@@ -0,0 +1,163 @@
+---
+layout: learningpathall
+title: Build and run vvenc (H.266 encoder) on Arm servers
+weight: 2
+---
+
+## Install necessary software packages
+
+`vvenc` is an open-source H.266/VVC encoder that offers very high compression efficiency and performance. There have been significant efforts ongoing to optimize the open-source implementation of the H.266 encoder on Arm Neoverse platforms which supports Neon and SVE/SVE2 instructions. The optimized code is available on [Github](https://github.com/fraunhoferhhi/vvenc)
+
+Install `Cmake` and other dependencies:
+```bash
+sudo apt install git wget cmake p7zip-full -y
+```
+Install llvm compiler to compile the C++ code:
+```bash
+wget https://apt.llvm.org/llvm.sh
+chmod +x llvm.sh
+sudo ./llvm.sh 18 all
+```
+
+## Download and build vvenc source
+
+```bash
+git clone https://github.com/fraunhoferhhi/vvenc.git
+cd vvenc
+CXX=clang++-18 CC=clang-18 cmake -S . -B build/release-static -DVVENC_ENABLE_ARM_SIMD_SVE=1 -DVVENC_ENABLE_ARM_SIMD_SVE2=1
+```
+Make sure sve/sve2 has been enabled in the Makefile:
+```output
+root@iZuf61ixurqifmpxuji4viZ:~/vvenc-1.13.0# CXX=clang++-18 CC=clang-18 cmake -S . -B build/release-static -DVVENC_ENABLE_ARM_SIMD_SVE=1 -DVVENC_ENABLE_ARM_SIMD_SVE2=1
+-- The C compiler identification is Clang 18.1.8
+-- The CXX compiler identification is Clang 18.1.8
+-- Detecting C compiler ABI info
+-- Detecting C compiler ABI info - done
+-- Check for working C compiler: /usr/bin/clang-18 - skipped
+-- Detecting C compile features
+-- Detecting C compile features - done
+-- Detecting CXX compiler ABI info
+-- Detecting CXX compiler ABI info - done
+-- Check for working CXX compiler: /usr/bin/clang++-18 - skipped
+-- Detecting CXX compile features
+-- Detecting CXX compile features - done
+-- CMAKE_MODULE_PATH: updating module path to: /root/vvenc-1.13.0/cmake/modules
+-- normalized target architecture: AARCH64
+-- Performing Test SUPPORTED_Werror_unused_command_line_argument
+-- Performing Test SUPPORTED_Werror_unused_command_line_argument - Success
+-- Performing Test SUPPORTED_march=armv8_2_a+sve
+-- Performing Test SUPPORTED_march=armv8_2_a+sve - Success
+-- Performing Test SVE_COMPILATION_C_TEST_COMPILED
+-- Performing Test SVE_COMPILATION_C_TEST_COMPILED - Success
+-- Performing Test SVE_COMPILATION_CXX_TEST_COMPILED
+-- Performing Test SVE_COMPILATION_CXX_TEST_COMPILED - Success
+-- Performing Test SVE_HEADER_C_TEST_COMPILED
+-- Performing Test SVE_HEADER_C_TEST_COMPILED - Success
+-- Performing Test SVE_HEADER_CXX_TEST_COMPILED
+-- Performing Test SVE_HEADER_CXX_TEST_COMPILED - Success
+-- Performing Test SUPPORTED_march=armv9_a+sve2
+-- Performing Test SUPPORTED_march=armv9_a+sve2 - Success
+-- Performing Test SUPPORTED_msse4_1
+-- Performing Test SUPPORTED_msse4_1 - Failed
+-- Performing Test SUPPORTED_mavx
+-- Performing Test SUPPORTED_mavx - Failed
+-- Performing Test HAVE_INTRIN_mm_storeu_si16
+-- Performing Test HAVE_INTRIN_mm_storeu_si16 - Success
+-- Performing Test HAVE_INTRIN_mm_storeu_si32
+-- Performing Test HAVE_INTRIN_mm_storeu_si32 - Success
+-- Performing Test HAVE_INTRIN_mm_storeu_si64
+-- Performing Test HAVE_INTRIN_mm_storeu_si64 - Success
+-- Performing Test HAVE_INTRIN_mm_loadu_si32
+-- Performing Test HAVE_INTRIN_mm_loadu_si32 - Success
+-- Performing Test HAVE_INTRIN_mm_loadu_si64
+-- Performing Test HAVE_INTRIN_mm_loadu_si64 - Success
+-- Performing Test HAVE_INTRIN_mm_cvtsi128_si64
+-- Performing Test HAVE_INTRIN_mm_cvtsi128_si64 - Success
+-- Performing Test HAVE_INTRIN_mm_cvtsi64_si128
+-- Performing Test HAVE_INTRIN_mm_cvtsi64_si128 - Success
+-- Performing Test HAVE_INTRIN_mm_extract_epi64
+-- Performing Test HAVE_INTRIN_mm_extract_epi64 - Success
+-- Performing Test HAVE_INTRIN_mm256_zeroupper
+-- Performing Test HAVE_INTRIN_mm256_zeroupper - Failed
+-- Performing Test HAVE_INTRIN_mm256_loadu2_m128i
+-- Performing Test HAVE_INTRIN_mm256_loadu2_m128i - Success
+-- Performing Test HAVE_INTRIN_mm256_set_m128i
+-- Performing Test HAVE_INTRIN_mm256_set_m128i - Success
+-- x86 SIMD intrinsics enabled (using SIMDE for non-x86 targets)
+-- AArch64 Neon intrinsics enabled
+-- AArch64 SVE intrinsics enabled
+-- AArch64 SVE2 intrinsics enabled
+-- Looking for pthread.h
+-- Looking for pthread.h - found
+-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
+-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
+-- Found Threads: TRUE
+-- Performing Test SUPPORTED_mxsave
+-- Performing Test SUPPORTED_mxsave - Failed
+-- Performing Test SUPPORTED_msse4_2
+-- Performing Test SUPPORTED_msse4_2 - Failed
+-- Performing Test SUPPORTED_mavx2
+-- Performing Test SUPPORTED_mavx2 - Failed
+-- Configuring done
+-- Generating done
+-- Build files have been written to: /root/vvenc-1.13.0/build/release-static
+```
+Start the build process:
+```bash
+cmake --build build/release-static -j
+```
+
+## Download video streams to run vvenc on and measure the performance
+
+To benchmark the compression efficiency and performance of `vvenc`, you will need a set of video streams to run the codec on. 
+
+Download the `1080P` video files:
+```bash
+cd ../video
+wget http://ultravideo.cs.tut.fi/video/Bosphorus_1920x1080_120fps_420_8bit_YUV_Y4M.7z
+7z -x Bosphorus_1920x1080_120fps_420_8bit_YUV_Y4M.7z
+```
+
+## Run vvenc on the sample video files
+
+To benchmark the performance of `vvenc` over 100 frames of the `1080P` video file, run the command:
+```console
+numactl -C 0-3 bin/release-static/vvencFFapp --preset faster --BitstreamFile stream.266 --Threads 4 --InputFile ~/video/Bosphorus_1920x1080_120fps_420_8bit_YUV.y4m --InputBitDepth 8 --InputChromaFormat 420 --fps 30 --FramesToBeEncoded 100 --SourceWidth 1920 --SourceHeight 1080 --Qp 22 --IntraPeriod 256 --NumPasses 1 --InternalBitDepth 10 --stats 1 --Verbosity 3 --pools ','
+```
+
+You can vary the preset settings and measure the impact on performance.
+
+
+## View Results
+
+The encoding Frame Rate (Frames per second) for the video files is output at the end of each run.
+
+Shown below is example output from running the vvenc h.266 encoding on the 1080P sample video file:
+
+```output
+vvencFFapp: VVenC, the Fraunhofer H.266/VVC Encoder, version 1.13.0 [Linux][clang 18.1.8][64 bit][SIMD=SVE2]
+vvencFFapp [info]: started @ Wed Dec 25 16:06:03 2024
+vvenc [info]: Input File                             : /root/video/Bosphorus_1920x1080_120fps_420_8bit_YUV.y4m
+vvenc [info]: Bitstream File                         : stream.266
+vvenc [info]: Real Format                            : 1920x1080  yuv420p  30 Hz  SDR  600 frames
+vvenc [info]: Frames                                 : encode 100 frames
+vvenc [info]: Internal format                        : 1920x1080  30 Hz  SDR
+vvenc [info]: Threads                                : 4  (parallel frames: 4)
+vvenc [info]: Rate control                           : QP 22
+vvenc [info]: Perceptual optimization                : Disabled
+vvenc [info]: Intra period (keyframe)                : 256
+vvenc [info]: Decoding refresh type                  : CRA
+
+vvenc [info]: stats:  30.0% frame=  30/100 fps=   4.7 avg_fps=   4.7 bitrate=  3174.97 kbps avg_bitrate=  3174.97 kbps elapsed= 00h:00m:07s left= 00h:00mvvenc [info]: stats:  60.0% frame=  60/100 fps=   7.2 avg_fps=   5.7 bitrate=  2405.72 kbps avg_bitrate=  2790.34 kbps elapsed= 00h:00m:11s left= 00h:00mvvenc [info]: stats:  90.0% frame=  90/100 fps=   7.2 avg_fps=   6.1 bitrate=  2445.71 kbps avg_bitrate=  2675.47 kbps elapsed= 00h:00m:15s left= 00h:00mvvenc [info]: stats: 100.0% frame= 100/100 fps=   6.6 avg_fps=   6.6 bitrate=  2490.95 kbps avg_bitrate=  2490.95 kbps elapsed= 00h:00m:16s left= 00h:00m:00s
+vvenc [info]: stats summary: frame= 100/100 avg_fps= 6.6 avg_bitrate= 2490.95 kbps
+vvenc [info]: stats summary: frame I:   1, kbps: 32874.48, AvgQP: 17.00
+vvenc [info]: stats summary: frame P:   0, kbps:      nan, AvgQP: nan
+vvenc [info]: stats summary: frame B:  99, kbps:  2184.04, AvgQP: 27.42
+
+
+vvenc [info]:	Total Frames |   Bitrate     Y-PSNR    U-PSNR    V-PSNR    YUV-PSNR
+vvenc [info]:	      100    a    2490.9480   43.7378   48.7096   47.9283   44.7472
+
+vvencFFapp [info]: finished @ Wed Dec 25 16:06:19 2024
+vvencFFapp [info]: Total Time:       58.962 sec. [user]       15.209 sec. [elapsed]
+```