Updated to multimodal VA LP

pareenaverma · pareenaverma · commit 1e9a32a0a54f · 2025-10-14T14:06:42.000Z
diff --git a/content/learning-paths/mobile-graphics-and-gaming/voice-assistant/2-overview.md b/content/learning-paths/mobile-graphics-and-gaming/voice-assistant/2-overview.md
@@ -39,15 +39,15 @@ The voice assistant pipeline imports and builds a separate module to provide thi
 https://gitlab.arm.com/kleidi/kleidi-examples/speech-to-text
 ```
 
-and build for various platforms to independently benchmark STT functionality:
+You can build the pipeline for various platforms and independently benchmark the STT functionality:
 
 |Platform|Details|
 |---|---|
 |Linux|x86_64 - KleidiAI is disabled by default, aarch64 - KleidiAI is enabled by default.|
 |Android|Cross-compile for an Android device, ensure the Android NDK path is set and correct toolchain file is provided. KleidiAI enabled by default.|
-|MacOS|Native or cross-compilation for a Mac device. KleidiAI and SME kernels can be used if available on device.|
+|macOS|Native or cross-compilation for a Mac device. KleidiAI and SME kernels can be used if available on device.|
 
-Currently, this module uses [whisper.cpp](https://github.com/ggml-org/whisper.cpp) and wraps the backend library by a thin C++ layer. The module also provides JNI bindings for developers targetting Android based applications.
+Currently, this module uses [whisper.cpp](https://github.com/ggml-org/whisper.cpp) and wraps the backend library with a thin C++ layer. The module also provides JNI bindings for developers targeting Android based applications.
 
 {{% notice %}}
 You can get more information on how to build and use this module [here](https://gitlab.arm.com/kleidi/kleidi-examples/speech-to-text/-/blob/main/README.md?ref_type=heads)
@@ -67,15 +67,15 @@ The voice assistant pipeline imports and builds a separate module to provide thi
 https://gitlab.arm.com/kleidi/kleidi-examples/large-language-models
 ```
 
-and build for various platforms to independently benchmark LLM functionality:
+You can build this pipeline for various platforms and independently benchmark the LLM functionality:
 
 |Platform|Details|
 |---|---|
 |Linux|x86_64 - KleidiAI is disabled by default, aarch64 - KleidiAI is enabled by default.|
 |Android|Cross-compile for an Android device, ensure the Android NDK path is set and correct toolchain file is provided. KleidiAI enabled by default.|
-|MacOS|Native or cross-compilation for a Mac device. KleidiAI and SME kernels can be used if available on device.|
+|macOS|Native or cross-compilation for a Mac device. KleidiAI and SME kernels can be used if available on device.|
 
-Currently, this module provides a thin C++ layer as well as JNI bindings for developers targetting Android based applications, supported backends are:
+Currently, this module provides a thin C++ layer as well as JNI bindings for developers targeting Android based applications, supported backends are:
 |Framework|Dependency|Input modalities supported|Output modalities supported|Neural Network|
 |---|---|---|---|---|
 |llama.cpp|https://github.com/ggml-org/llama.cpp|`image`, `text`|`text`|phi-2,Qwen2-VL-2B-Instruct|
@@ -94,4 +94,4 @@ This part of the application pipeline uses the Android Text-to-Speech API along
 
 In synchronous mode, speech playback begins only after the full LLM response is received. By default, the application operates in asynchronous mode, where speech synthesis starts as soon as a full or partial sentence is ready. Remaining tokens are buffered and processed by the Android Text-to-Speech engine to ensure uninterrupted playback.
 
-You are now familiar with the building blocks of this application and can build these independently for various platforms. You can now build the multi-modal Voice Assistant example which runs on Android OS in the next step.
+You are now familiar with the building blocks of this application and can build these independently for various platforms. You can now build the multimodal Voice Assistant example which runs on Android OS in the next step.
diff --git a/content/learning-paths/mobile-graphics-and-gaming/voice-assistant/5-kleidiai.md b/content/learning-paths/mobile-graphics-and-gaming/voice-assistant/5-kleidiai.md
@@ -31,5 +31,5 @@ To disable KleidiAI during build:
 
 KleidiAI simplifies development by abstracting away low-level optimization: developers can write high-level code while the KleidiAI library selects the most efficient implementation at runtime based on the target hardware. This is possible thanks to its deeply optimized micro-kernels tailored for Arm architectures.
 
-As newer versions of the architecture become available, KleidiAI becomes even more powerful: simply updating the library allows applications like the multi-modal Voice Assistant to take advantage of the latest architectural improvements - such as SME2 — without requiring any code changes. This means better performance on newer devices with no additional effort from developers.
+As newer versions of the architecture become available, KleidiAI becomes even more powerful: simply updating the library allows applications like the multimodal Voice Assistant to take advantage of the latest architectural improvements such as SME2, without requiring any code changes. This means better performance on newer devices with no additional effort from developers.
 
diff --git a/content/learning-paths/mobile-graphics-and-gaming/voice-assistant/_index.md b/content/learning-paths/mobile-graphics-and-gaming/voice-assistant/_index.md
@@ -1,15 +1,15 @@
 ---
-title: Accelerate multi-modal Voice Assistant performance with KleidiAI and SME2
+title: Accelerate multimodal Voice Assistant performance with KleidiAI and SME2
 
 minutes_to_complete: 30
 
-who_is_this_for: This is an introductory topic for developers who want to see a pipeline of a multi-modal Voice Assistant application and accelerate the performance on Android devices using KleidiAI and SME2.
+who_is_this_for: This is an introductory topic for developers who want to implement a multimodal pipeline for a Voice Assistant application and accelerate the performance on Android devices using KleidiAI and SME2.
 
 learning_objectives:
-    - Learn about the multi-modal Voice Assistant pipeline and different components used.
+    - Learn about the multimodal Voice Assistant pipeline and different components used.
     - Learn about the functionality of ML components used and how these can be built and benchmarked on various platforms.
-    - Compile and run a multi-modal Voice Assistant example based on Android OS.
-    - Optimize performance of multi-modal Voice Assistant using KleidiAI and SME2.
+    - Compile and run a multimodal Voice Assistant example based on Android OS.
+    - Optimize performance of multimodal Voice Assistant using KleidiAI and SME2.
 
 prerequisites:
     - An Android phone that supports the i8mm Arm architecture feature (8-bit integer matrix multiplication). This Learning Path was tested on a Google Pixel 8 Pro.

Original file line number	Diff line number	Diff line change
`@@ -31,5 +31,5 @@ To disable KleidiAI during build:`
`31`	`31`
`32`	`32`	`KleidiAI simplifies development by abstracting away low-level optimization: developers can write high-level code while the KleidiAI library selects the most efficient implementation at runtime based on the target hardware. This is possible thanks to its deeply optimized micro-kernels tailored for Arm architectures.`
`33`	`33`
`34`		`-As newer versions of the architecture become available, KleidiAI becomes even more powerful: simply updating the library allows applications like the multi-modal Voice Assistant to take advantage of the latest architectural improvements - such as SME2 — without requiring any code changes. This means better performance on newer devices with no additional effort from developers.`
	`34`	`+As newer versions of the architecture become available, KleidiAI becomes even more powerful: simply updating the library allows applications like the multimodal Voice Assistant to take advantage of the latest architectural improvements such as SME2, without requiring any code changes. This means better performance on newer devices with no additional effort from developers.`
`35`	`35`