You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Demo app android xnnpack quick-fix for the bookmark link (#5698)
Demo app android xnnpack quick-fix for the bookmark link (#5642)
Summary:
Pull Request resolved: #5642
quick fix for the in page link
Reviewed By: kirklandsign
Differential Revision: D63400245
fbshipit-source-id: 2fe6c71117851b22dd80654f9c19a2c3e0036a03
(cherry picked from commit 6e9efa1)
Co-authored-by: Chester Hu <[email protected]>
Copy file name to clipboardExpand all lines: examples/demo-apps/android/LlamaDemo/docs/delegates/xnnpack_README.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,7 @@
1
1
# Building ExecuTorch Android Demo App for Llama running XNNPack
2
2
3
+
**[UPDATE - 09/25]** We have added support for running [Llama 3.2 models](#for-llama-32-1b-and-3b-models) on the XNNPack backend. We currently support inference on their original data type (BFloat16). We have also added instructions to run [Llama Guard 1B models](#for-llama-guard-1b-models) on-device.
4
+
3
5
This tutorial covers the end to end workflow for building an android demo app using CPU on device via XNNPack framework.
4
6
More specifically, it covers:
5
7
1. Export and quantization of Llama and Llava models against the XNNPack backend.
0 commit comments