Skip to content

Commit b37d110

Browse files
authored
Update 4-Prepare-LLaMA-models.md
1 parent 774d6e4 commit b37d110

File tree

1 file changed

+1
-1
lines changed
  • content/learning-paths/smartphones-and-mobile/Build-Llama3-Chat-Android-App-Using-Executorch-And-XNNPACK

1 file changed

+1
-1
lines changed

content/learning-paths/smartphones-and-mobile/Build-Llama3-Chat-Android-App-Using-Executorch-And-XNNPACK/4-Prepare-LLaMA-models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ python3 -m examples.models.llama.export_llama \
4545
--group_size 64 -d fp32 \
4646
--metadata '{"get_bos_id":128000, "get_eos_ids":[128009, 128001, 128006, 128007]}' \
4747
--embedding-quantize 4,32 \
48-
--output_name="llama3_1B_kv_sdpa_xnn_qe_4_128_1024_embedding_4bit.pte" \
48+
--output_name="llama3_1B_kv_sdpa_xnn_qe_4_64_1024_embedding_4bit.pte" \
4949
--max_seq_length 1024
5050
```
5151

0 commit comments

Comments
 (0)