Skip to content

Commit 570572b

Browse files
pytorchbotChester Hu
andauthored
Demo app android xnnpack quick-fix for the bookmark link (#5698)
Demo app android xnnpack quick-fix for the bookmark link (#5642) Summary: Pull Request resolved: #5642 quick fix for the in page link Reviewed By: kirklandsign Differential Revision: D63400245 fbshipit-source-id: 2fe6c71117851b22dd80654f9c19a2c3e0036a03 (cherry picked from commit 6e9efa1) Co-authored-by: Chester Hu <[email protected]>
1 parent de718e6 commit 570572b

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

examples/demo-apps/android/LlamaDemo/docs/delegates/xnnpack_README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
# Building ExecuTorch Android Demo App for Llama running XNNPack
22

3+
**[UPDATE - 09/25]** We have added support for running [Llama 3.2 models](#for-llama-32-1b-and-3b-models) on the XNNPack backend. We currently support inference on their original data type (BFloat16). We have also added instructions to run [Llama Guard 1B models](#for-llama-guard-1b-models) on-device.
4+
35
This tutorial covers the end to end workflow for building an android demo app using CPU on device via XNNPack framework.
46
More specifically, it covers:
57
1. Export and quantization of Llama and Llava models against the XNNPack backend.

0 commit comments

Comments
 (0)