Edge AI (IoT/Client) Projects (#30)

matt-cossins · web-flow · commit 45d9a131f3a5 · 2025-11-27T08:46:38.000Z
added 4 EdgeAI projects
diff --git a/Projects/Projects/Edge-AI-with-Ethos-U85-NPU.md b/Projects/Projects/Edge-AI-with-Ethos-U85-NPU.md
@@ -0,0 +1,80 @@
+---
+title: "Edge AI with NPU: always-on-AI with ExecuTorch on Cortex-M55 + Ethos-U85 → Cortex-A"
+description: "The vision of Edge AI compute is to embed low-power intelligent sensing, perception, and decision systems everywhere. A low-power always-on-AI island continuously monitors sensory inputs to detect triggers. When a trigger is detected, it wakes up a more capable processor to carry out high-value inference, interaction, or control tasks."
+subjects:
+    - "ML"
+    - "Performance and Architecture"
+    - "Embedded Linux"
+    - "RTOS Fundamentals"
+requires-team:
+    - "No"
+platform:
+    - "IoT"
+    - "Embedded and Microcontrollers"
+    - "AI"
+sw-hw:
+    - "Software"
+    - "Hardware"
+support-level: 
+    - "Self-Service"
+    - "Arm Ambassador Support"
+publication-date: 2025-11-27
+license:
+status:
+    - "Published" 
+donation: 
+---
+
+![educate_on_arm](../../images/Educate_on_Arm_banner.png)
+
+
+## Description
+
+**Why is this important?**
+
+The vision of Edge AI compute is to embed intelligent low-power sensing, perception, and decision systems everywhere (in homes, wearables, infrastructure) so devices can react to subtle cues, adapt to context, and wake higher-power systems only when needed. Rather than sending everything to the cloud or running full-scale models continuously, this Edge AI system operates as a layered hierarchy:  
+- A low-power always-on-AI model continuously monitors sensory inputs (audio, motion, video) to detect triggers or anomalies.  
+- When a trigger is detected, it wakes up a more capable processor (e.g. Cortex-A running a rich OS such as Linux) to carry out further tasks. This could be high-value inference, interaction, or control tasks. It could also involve connecting to other IoT devices or to a Neoverse cloud instance.
+
+This architecture is key to bridging the gap between battery-constrained devices and rich AI services, making systems smarter, more efficient, and responsive without draining resources.
+
+**Project Summary**
+
+Using equipment such as the Alif Ensemble development kit (e.g. E6/E8, which includes Cortex-A, Cortex-M55, and Ethos-U85 cores - or E4 (M55+U85) + Raspberry Pi for Cortex-A), and the ExecuTorch framework, build an Edge AI prototype that implements:  
+
+1. A “wake-up” path: deploy a TOSA-compliant optimized model on the Cortex-M55 + Ethos-U85 pair to continuously monitor sensory signals (audio, motion, video) for wake-word, anomalies, or triggers.  
+2. A subsequent workload path: when a trigger is detected, activate a Cortex-A core to perform more complex tasks, e.g. use an LLM optimised for CPU inference, connect to and manage other IoT devices, or connect to a Neoverse cloud instance for heavier inference.  
+3. Evaluation and documentation: measure accuracy, latency, power consumption, robustness, and compare trade-offs between modalities (audio, video, motion). Demonstrate an end-to-end use case of your choice (e.g. smart assistant, anomaly alert system, gesture control, environment monitoring).  
+
+*Note that the Cortex-A32 included on the Alif DevKits will not be suitable for LLM inference. If using the onboard core for the project, target cloud/IoT connectivity. For LLM inference, consider connecting a Raspberry Pi 5 or similar.*
+
+Example: Use a microphone input to detect “Hey Arm”. After wake-up, launch an optimised LLM on Raspberry Pi Cortex-A to answer questions or control local devices.
+
+You are free to mix and match sensors, modalities, and tasks — as long as the core architecture (wake-on M55/U85, main task on A) is preserved.
+
+Many of these DevKits come with additional Ethos-U55 NPUs onboard - feel free to be creative and distribute different tasks across the different NPUs - what use-cases and applications can you achieve?
+
+## What will you use?
+You should either be familiar with, or willing to learn about, the following:
+-	Programming: Python, C++, Embedded C
+-	ExecuTorch, plus knowledge of model quantization, pruning, conversion. Use of the Vela compiler and TOSA.
+-	Edge/Embedded development: bare-metal or RTOS (e.g. Zephyr), and embedded Linux (e.g. Yocto) or Raspberry Pi OS
+
+
+## Resources from Arm and our partners
+- Arm Developer: [Edge AI](https://developer.arm.com/edge-ai)
+- Learning Path: [Navigating Machine Learning with Ethos-U processors](https://learn.arm.com/learning-paths/microcontrollers/nav-mlek/)    
+- Repository: [AI on Arm course](https://github.com/arm-university/AI-on-Arm)  
+- Example Board: [Alif Ensemble DevKit E8](https://www.keil.arm.com/boards/alif-semiconductor-devkit-e8-gen-1-2558a7b/features/)  
+- PyTorch Blog: [ExecuTorch support for Ethos-U85](https://pytorch.org/blog/pt-executorch-ethos-u85/)
+
+## Support Level
+
+This project is designed to be self-serve but comes with opportunity of some community support from Arm Ambassadors, who are part of the Arm Developer program. If you are not already part of our program, [click here to join](https://www.arm.com/resources/developer-program?#register).
+
+## Benefits 
+
+Standout project contributions will result in digital badges for CV building, recognised by Arm Talent Acquisition. We are currently discussing with national agencies the potential for funding streams for Arm Developer Labs projects, which would flow to you, not us.
+
+
+To receive the benefits, you must show us your project through our [online form](https://forms.office.com/e/VZnJQLeRhD). Please do not include any confidential information in your contribution. Additionally if you are affiliated with an academic institution, please ensure you have the right to share your material.
diff --git a/Projects/Projects/Ethos-U85-NPU-Applications.md b/Projects/Projects/Ethos-U85-NPU-Applications.md
@@ -0,0 +1,117 @@
+---
+title: "Ethos-U85 NPU Applications with TOSA Model Explorer: Exploring Next-Gen Edge AI Inference"
+description: "Push the limits of Edge AI by deploying the heaviest inference applications possible on Ethos-U85. Students will explore transformer-based and TOSA-optimized workloads that demonstrate performance levels on the next-gen of Ethos NPUs."
+subjects:
+  - "ML"
+  - "Performance and Architecture"
+requires-team:
+  - "No"
+platform:
+  - "IoT"
+  - "Embedded and Microcontrollers"
+  - "AI"
+sw-hw:
+  - "Software"
+  - "Hardware"
+support-level:
+  - "Self-Service"
+  - "Arm Ambassador Support"
+publication-date: 2025-11-27
+license:
+status:
+  - "Published"
+donation:
+---
+
+![educate_on_arm](../../images/Educate_on_Arm_banner.png)
+
+## Description
+
+**Why is this important?**
+
+The Arm Ethos-U85 NPU represents a major leap in bringing *heavy inference* to constrained embedded systems. With its full transformer operator support, expanded MAC throughput, and native TOSA compatibility, the Ethos-U85 enables developers to deploy models and workloads that were previously too intensive for MCU-class devices.
+
+This project challenges you to explore the boundaries of what’s possible on Ethos-U85. The goal is to demonstrate inference performance and model complexity that is now achievable due to the architectural improvements and transformer acceleration capabilities of the Ethos-U85.
+
+[Ethos-U85 Launch](https://newsroom.arm.com/blog/ethos-u85)  
+
+**Project Summary**
+
+Using hardware such as the Alif Ensemble E4/E6/E8 DevKits (all include Ethos-U85) or a comparable platform or Arm Fixed Virtual Platform Corstone-320, your task is to design and benchmark an advanced edge inference application that exploits the Ethos-U85’s compute and transformer capabilities.
+
+Your project should include:
+
+1. Model Deployment and Optimization  
+   Select a computationally intensive model — ideally transformer-based or multi-branch convolutional — and deploy it on the Ethos-U85 using:
+   - The TOSA Model Explorer extension to inspect and adapt unsupported or experimental models for TOSA compliance.  
+   - The Vela compiler for optimization.  
+
+   These tools can be used to:
+   - Convert and visualize model graphs in TOSA format.  
+   - Identify unsupported operators.  
+   - Modify or substitute layers for compatibility using the Flatbuffers schema before re-exporting.  
+   - Run Vela for optimized compilation targeting Ethos-U85.
+
+2. Application Demonstration 
+   Implement a working example that highlights the Ethos-U85’s strengths in real-world inference. Possible categories include:  
+   - Transformers on Edge: lightweight BERT, ViT, or audio transformers (e.g. speech or sound event classification).  
+   - High-resolution Vision: semantic segmentation, object detection on large input sizes, or multi-head perception networks.  
+   - Multi-modal Fusion: combining audio, image, or sensor streams for contextual understanding.  
+
+3. Analysis and Benchmarking 
+   Report quantitative results on:
+   - Inference latency, throughput (FPS or tokens/s), and memory footprint.  
+   - Power efficiency under load (optional).  
+   - Comparative performance versus Ethos-U55/U65 (use available benchmarks for reference or utilise the other Ethos-U NPUs provided in the Alif DevKits).  
+   - The effect of TOSA optimization — demonstrate measurable improvements from graph conversion and operator fusion.
+
+---
+
+## What kind of projects should you target?
+
+To clearly demonstrate the leap from Ethos-U55/U65 to U85, choose projects that meet at least one of the following criteria:
+
+- Transformer-heavy architectures: e.g. attention blocks, transformer encoders, ViTs, or hybrid CNN+transformer models.  
+  - *Example:* an audio event detection transformer that must process longer sequences or higher-resolution spectrograms.  
+- High-resolution or multi-branch networks: models with high input dimensionality or multiple processing paths that saturate NPU throughput.  
+  - *Example:* 512×512 semantic segmentation or multi-object detection.  
+- Dense post-processing or large fully connected layers: cases where U55/U65 memory limits or MAC bandwidth previously restricted performance.  
+  - *Example:* large MLP heads or transformer token mixers.  
+- Multi-modal pipelines: combining multiple sensor inputs (e.g. image + IMU + audio) where the NPU must maintain concurrency or shared intermediate representations.  
+
+The Ethos-U85 is ideal for projects where model performance is constrained by attention layers, large activations, or operator types that previously required fallback to the CPU. Use the Ethos-U85 to eliminate those fallbacks and achieve full-NPU execution of advanced topologies.
+
+---
+
+## What will you use?
+You should be familiar with, or willing to learn about:
+- Programming: Python, C/C++  
+- ExecuTorch or TensorFlow Lite (Micro/LiteRT)
+- Techniques for optimising AI models for the edge (quantization, pruning, etc.)
+- Optimization Tools: 
+  - TOSA Model Explorer
+  - .tflite to .tosa converter (if using Tensorflow rather than ExecuTorch)
+  - Vela compiler for Ethos-U  
+- Bare-metal or RTOS (e.g., Zephyr)  
+
+---
+
+## Resources from Arm and our partners
+- Arm Developer: [Edge AI](https://developer.arm.com/edge-ai)
+- Learning Path: [Navigating Machine Learning with Ethos-U processors](https://learn.arm.com/learning-paths/microcontrollers/nav-mlek/)    
+- Repository: [AI on Arm course](https://github.com/arm-university/AI-on-Arm)  
+- Example Board: [Alif Ensemble DevKit E8](https://www.keil.arm.com/boards/alif-semiconductor-devkit-e8-gen-1-2558a7b/features/)  
+- Documentation: [TOSA Specification](https://www.mlplatform.org/tosa/), [TOSA Model Explorer](https://github.com/arm/tosa-adapter-model-explorer), and [TOSA Reference Model](https://gitlab.arm.com/tosa/tosa-reference-model)
+- PyTorch Blog: [ExecuTorch support for Ethos-U85](https://pytorch.org/blog/pt-executorch-ethos-u85/)
+---
+
+## Support Level
+
+This project is designed to be self-serve but comes with opportunity of some community support from Arm Ambassadors, who are part of the Arm Developer program. If you are not already part of our program, [click here to join](https://www.arm.com/resources/developer-program?#register).
+
+## Benefits 
+
+Standout project contributions will result in digital badges for CV building, recognised by Arm Talent Acquisition. We are currently discussing with national agencies the potential for funding streams for Arm Developer Labs projects, which would flow to you, not us.
+
+
+To receive the benefits, you must show us your project through our [online form](https://forms.office.com/e/VZnJQLeRhD). Please do not include any confidential information in your contribution. Additionally if you are affiliated with an academic institution, please ensure you have the right to share your material.
diff --git a/Projects/Projects/NGP.md b/Projects/Projects/NGP.md
@@ -0,0 +1,90 @@
+---
+title: "Game development using Arm Neural Graphics with Unreal Engine"
+description: "Build a playable Unreal Engine 5 game demo that utilises Arm’s Neural Graphics SDK UE plugin for features such as Neural Super Sampling (NSS). Showcase near-identical image quality at lower resolution by driving neural rendering directly in the graphics pipeline."
+subjects:
+  - "ML"
+  - "Gaming"
+  - "Libraries"
+  - "Graphics"
+requires-team:
+  - "No"
+platform:
+  - "Mobile, Graphics, and Gaming"
+  - "Laptops and Desktops"
+  - "AI"
+sw-hw:
+  - "Software"
+support-level:
+  - "Self-Service"
+  - "Arm Ambassador Support"
+publication-date: 2025-11-27
+license:
+status:
+  - "Published"
+donation:
+---
+
+![educate_on_arm](../../images/Educate_on_Arm_banner.png)
+
+## Description
+
+### Why is this important?
+
+Arm neural technology is an industry first, adding dedicated neural accelerators to Arm GPUs, bringing PC-quality, AI powered graphics to mobile for the first time – and laying the foundation for future on-device AI innovation. 
+
+Developers can start building now with the industry’s first open development kit for neural graphics with an Unreal Engine plugin, emulators, and open models on GitHub and Hugging Face. 
+
+[Arm Neural Technology Announcement](https://newsroom.arm.com/news/arm-announces-arm-neural-technology)
+
+Neural Super Sampling (NSS) is Arm’s mobile-optimized AI-driven graphics upscaler that improves image quality while lowering resolution. It builds on a prior Arm solution: Accuracy Super Resolution (ASR). It is supported by an Unreal Engine plugin, streamlining its use as part of a typical industry games development process.
+
+Future SDK support will be provided for Neural Frame Rate Upscaling (NFRU) - so feel free to extend this project using NFRU when released.
+
+### Project Summary
+
+Create a small game scene utilising the Arm Neural Graphics UE plugin to demonstrate:
+- **Near-identical visuals at lower resolution** (render low → upscale with NSS)  
+
+Document your progress and findings and consider alternative applications of the neural technology within games development.
+
+Attempt different environments and objects. For example:
+
+- Daytime vs night
+- Urban city, jungle forest, ocean floor, alien planet, building interiors
+- Complex lighting and shadows
+- NPCs with detailed clothing, faces, hair. Include animations.
+
+Make your scenes dynamic with particle effects, shadows, physics and motion.
+
+---
+
+## Pre-requisites
+- Laptop/PC/Mobile for Android Unreal Engine game development
+- Willingness to learn about games development and graphics, and the increasing use of AI in these fields.
+
+---
+
+## Resources from Arm and partners
+- Get Started Blog: [Start experimenting with NSS today](https://developer.arm.com/community/arm-community-blogs/b/mobile-graphics-and-gaming-blog/posts/how-to-access-arm-neural-super-sampling)
+- Deep Dive Blog: [How NSS works](https://developer.arm.com/community/arm-community-blogs/b/mobile-graphics-and-gaming-blog/posts/how-arm-neural-super-sampling-works)
+- Arm Developer: [Neural Graphics Development Kit](https://developer.arm.com/mobile-graphics-and-gaming/neural-graphics)
+- Learning Path: [Fine-tuning neural graphics models with Model Gym](https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/model-training-gym/)
+- Learning Path: [Neural Super Sampling in Unreal Engine](https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/nss-unreal/)
+- Learning Path: [Getting started with Arm Accuracy Super Resolution (Arm ASR)](https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/get-started-with-arm-asr/)
+- Unreal Engine Intro by Epic Games: [Understanding the basics](https://dev.epicgames.com/documentation/en-us/unreal-engine/understanding-the-basics-of-unreal-engine)
+- Repo: [Arm Neural Graphics SDK](https://github.com/arm/neural-graphics-sdk-for-game-engines)
+- Repo: [Arm Neural Graphics Model Gym](https://github.com/arm/neural-graphics-model-gym)
+- Documentation: [Arm Neural Graphics SDK for Game Engines Developer guide](https://developer.arm.com/documentation/111167/latest/)
+
+---
+
+## Support Level
+
+This project is designed to be self-serve but comes with opportunity of some community support from Arm Ambassadors, who are part of the Arm Developer program. If you are not already part of our program, [click here to join](https://www.arm.com/resources/developer-program?#register).
+
+## Benefits 
+
+Standout project contributions will result in digital badges for CV building, recognised by Arm Talent Acquisition. We are currently discussing with national agencies the potential for funding streams for Arm Developer Labs projects, which would flow to you, not us.
+
+
+To receive the benefits, you must show us your project through our [online form](https://forms.office.com/e/VZnJQLeRhD). Please do not include any confidential information in your contribution. Additionally if you are affiliated with an academic institution, please ensure you have the right to share your material.
diff --git a/Projects/Projects/SME2.md b/Projects/Projects/SME2.md