Skip to content

Commit 984986e

Browse files
larryliu0820facebook-github-bot
authored andcommitted
Add some Llava related stuff (#5659)
Summary: As titled Pull Request resolved: #5659 Reviewed By: mergennachin Differential Revision: D63412816 Pulled By: larryliu0820 fbshipit-source-id: eb9b35d52395a2ed9de342c3b4745c7aa694ce2b
1 parent 88c2407 commit 984986e

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

examples/demo-apps/android/LlamaDemo/docs/delegates/xnnpack_README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Building ExecuTorch Android Demo App for Llama running XNNPack
1+
# Building ExecuTorch Android Demo App for Llama/Llava running XNNPack
22

33
**[UPDATE - 09/25]** We have added support for running [Llama 3.2 models](#for-llama-32-1b-and-3b-models) on the XNNPack backend. We currently support inference on their original data type (BFloat16). We have also added instructions to run [Llama Guard 1B models](#for-llama-guard-1b-models) on-device.
44

@@ -8,7 +8,7 @@ More specifically, it covers:
88
2. Building and linking libraries that are required to inference on-device for Android platform.
99
3. Building the Android demo app itself.
1010

11-
Phone verified: OnePlus 12. Samsung S23 (Llama only), Samsung S24+ (Llama only), Pixel 8 Pro (Llama only)
11+
Phone verified: OnePlus 12, OnePlus 9 Pro. Samsung S23 (Llama only), Samsung S24+ (Llama only), Pixel 8 Pro (Llama only)
1212

1313

1414
## Known Issues
@@ -101,7 +101,7 @@ python -m examples.models.llama2.export_llama --checkpoint <checkpoint.pth> --pa
101101

102102
You may wonder what the ‘--metadata’ flag is doing. This flag helps export the model with proper special tokens added that the runner can detect EOS tokens easily.
103103

104-
* Convert tokenizer for Llama 2
104+
* Convert tokenizer for Llama 2 and Llava (skip this for Llama 3.x)
105105
```
106106
python -m extension.llm.tokenizer.tokenizer -t <tokenizer.model> -o tokenizer.bin
107107
```

0 commit comments

Comments
 (0)