Arm backend: Document Ethos-U memory modes and add Ethos-U porting guide #14144

gggekov · 2025-09-10T11:46:23Z

Memory modes: The Shared_Sram, Sram_Only and Dedicated_Sram memory modes are specified in the compile spec and are tightly coupled with how the ethos-U scratch buffer and NN should be placed in the embedded application. Different memory modes profoundly impact the performance and memory footprint of the application and it is important to use the NPU in the most suitable memory mode for optimal performance.

Porting guide: A document explaining the key steps to port a new hardware target with an Ethos-U NPU to the Ethos-U backend in ExecuTorch

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218

pytorch-bot · 2025-09-10T11:46:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14144

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 18 New Failures, 1 Cancelled Job, 24 Unrelated Failures

As of commit f97617c with merge base a89b858 ():

NEW FAILURES - The following jobs have failed:

pull / test-moshi-linux / linux-job (gh)
RuntimeError: Command docker exec -t 516e54b991046d629ec72bf0b1cd9dee9227ff23dc7cc5e556f03446c5444516 /exec failed with exit code 1
pull / unittest / macos / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
pull / unittest-editable / macos / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-coreml (dl3) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-coreml (efficient_sam) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-coreml (emformer_join) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-coreml (ic4) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-coreml (resnet50) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-coreml (vit) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-coreml (w2l) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-cpu (emformer_join, xnnpack-quantization-delegation) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-cpu (llama, portable) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-cpu (llama3_2_vision_encoder, portable) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-cpu (vit, xnnpack-quantization-delegation) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-models-macos-mps / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-selective-build-macos (cmake) / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / test-static-llama-ane / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?
trunk / unittest-release / macos / macos-job (gh)
Can't find 'action.yml', 'action.yaml' or 'Dockerfile' under '/Users/ec2-user/runner/_work/executorch/executorch/test-infra/.github/actions/check-disk-space'. Did you forget to run actions/checkout before running your local action?

CANCELLED JOB - The following job was cancelled. Please retry:

pull / test-models-linux (linear, portable, linux.2xlarge) / linux-job (gh)

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-setup-linux-gcc / linux-job (gh) (similar failure)
##[error]The operation was canceled.
pull / unittest-buck / macos / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-custom-ops-macos (cmake) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-llama-runner-mac (fp32, coreml) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-llama-runner-mac (fp32, mps) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-llama-runner-mac (fp32, xnnpack+custom+quantize_kv) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-llama-torchao-lowbit / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-coreml (edsr) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-coreml (emformer_transcribe) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-coreml (ic3) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-coreml (mobilebert) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-coreml (mv2) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-coreml (mv3) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (efficient_sam, portable) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (ic4, xnnpack-quantization-delegation) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (llama2, xnnpack-quantization-delegation) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (mobilebert, xnnpack-quantization-delegation) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (mv3, portable) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (mv3, xnnpack-quantization-delegation) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (resnet50, xnnpack-quantization-delegation) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-models-macos-cpu (w2l, xnnpack-quantization-delegation) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-openvino-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-samsung-models-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

digantdesai

Didn't read too carefully, but love the diagrams :)
check the rendered docs on the PR before merging for formatting related issues mainly.

zingo · 2025-09-10T13:16:47Z

@GeorgeARM there seem to be a link problem see https://github.com/pytorch/executorch/actions/runs/17612710981/job/50038191324?pr=14144
This job need to be working for us to be able to merge.

Running command: docker exec -t 9a40a86ad68e5a1276c257b270a356fd5e3207fd65f96ad6cec20bfeabc4c560 /exec
++ '[' pull_request = pull_request ']'
++ echo 4d6209b 4a85695

./scripts/lint_xrefs.sh 4d6209b 4a85695

docs/source/backends-arm-ethos-u.md:
FAIL examples/arm/ethos-u-porting-guide.md

examples/arm/ethos-u-porting-guide.md:
OK ../../docs/source/backends-arm-ethos-u.md

echo
echo 'Xref lint failed.'
Xref lint failed.
echo 'If this is a transient outage, you can bypass it by adding the skip-xref-lint label to your PR.'
If this is a transient outage, you can bypass it by adding the skip-xref-lint label to your PR.
echo 'Or add @lint-ignore somewhere on the same line as the reference you want to skip checking.'
Or add @lint-ignore somewhere on the same line as the reference you want to skip checking.

zingo · 2025-09-10T13:35:14Z

You seem to be able to reproduce and run on your maching if you run
./scripts/lint_xrefs.sh
on you patch (it will check all files)

gggekov · 2025-09-10T14:54:04Z

Thanks @digantdesai @zingo . The documentation renders correctly in https://docs-preview.pytorch.org/pytorch/executorch/14144/backends-arm-ethos-u.html
with one exception.

At the end of the md document, i am referring an absolute path that doesn't exist yet.
...the [Ethos-U porting guide](https://github.com/pytorch/executorch/blob/main/examples/arm/ethos-u-porting-guide.md).

When the PR gets merged, the https://github.com/pytorch/executorch/blob/main/examples/arm/ethos-u-porting-guide.md webpage will be created and I assume the hyperlink will work correctly. Is this definitely true ? If yes, then I believe that is the reason for the failing CI test also.

Memory modes: The Shared_Sram, Sram_Only and Dedicated_Sram memory modes are specified in the compile spec and are tightly coupled with how the ethos-U scratch buffer and NN should be placed in the embedded application. Different memory modes profoundly impact the performance and memory footprint of the application and it is important to use the NPU in the most suitable memory mode for optimal performance. Porting guide: A document explaining the key steps to port a new hardware target with an Ethos-U NPU to the Ethos-U backend in ExecuTorch Change-Id: I925b8fb5dfb536f5af663cebe000fbb755955fcf

swolchok · 2025-09-11T16:25:11Z