Skip to content

Commit 00ca48b

Browse files
committed
script: refine build-run-ggmlhexagon-android.sh
1 parent 973948b commit 00ca48b

File tree

1 file changed

+11
-13
lines changed

1 file changed

+11
-13
lines changed

scripts/build-run-ggmlhexagon-android.sh

Lines changed: 11 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -62,27 +62,24 @@ GGMLDSP_RELEASE_DATE=20250627
6262

6363
######## part-2: contents in this part can be modified ########
6464

65-
PROMPT_STRING="every day of your life, it is important to take the time to smell the roses — to appreciate the experiences that lead to happiness. This is part of being truly happy.Happiness is a state of mind. It starts with accepting where you are, knowing where you are going and planning to enjoy every moment along the way. You know how to be happy, and feel that you have enough time or money or love or whatever you need to achieve your goals. And just feeling that you have enough of everything means that you do indeed have enough.You have to choose to be happy, and focus upon being happy, in order to be happy. If you instead focus upon knowing that you will be happy if you achieve something, you will never be happy, as you have not learned to smell the roses. The irony is that when you are happy, you are inevitably more productive, and far more likely to achieve what everything-seekers are seeking. you will never be happy, as you have not learned to smell the roses. The irony is that when you are happy, you are inevitably more productive, and far more likely to achieve what everything-seekers are seeking."
6665
PROMPT_STRING="introduce the movie Once Upon a Time in America briefly.\n"
6766

68-
#following two customized models only for self-test
69-
#20.4 MiB in models/t5-very-small-random-F32.gguf
70-
TEST_MODEL_NAME=/sdcard/t5-very-small-random-F32.gguf
71-
#1.0 GiB, can be downloadded via this script in function check_prebuilt_models()
72-
TEST_MODEL_NAME=/sdcard/t5-277M-F32.gguf
73-
67+
#the following LLM models has verified(works fine) with Hexagon-cDSP backend on a Snapdragon 8Elite based Android phone although inference performance is not good at the moment
68+
#for llama-cli, 6.9 GiB, can be downloadded via this script in function check_prebuilt_models()
69+
TEST_MODEL_NAME=/sdcard/gemma-3n-E4B-it-Q8_0.gguf
70+
#for llama-cli, 8.2 GiB
71+
#TEST_MODEL_NAME=/sdcard/Qwen3-8B-Q8_0.gguf
72+
#for llama-cli, 4.0 GiB
73+
#TEST_MODEL_NAME=/sdcard/Qwen3-4B-Q8_0.gguf
74+
#for llama-cli, 3.9 GiB
75+
#TEST_MODEL_NAME=/sdcard/gemma-3-4b-it-Q8_0.gguf
7476
#for llama-cli, 1.6 GiB, can be downloadded via this script in function check_prebuilt_models()
7577
TEST_MODEL_NAME=/sdcard/MiniCPM4-0.5B-F32.gguf
7678
#for llama-cli, 1.1 GiB, will be downloaded automatically via this script when running this script at the first time
7779
TEST_MODEL_NAME=/sdcard/qwen1_5-1_8b-chat-q4_0.gguf
7880
#for llama-cli, 4.5 GiB, will be downloadded automatically via this script when running this script at the first time
7981
TEST_MODEL_NAME=/sdcard/gemma-3n-E2B-it-Q8_0.gguf
8082

81-
#other LLM models for llama-cli
82-
#TEST_MODEL_NAME=/sdcard/Qwen3-8B-Q8_0.gguf
83-
#TEST_MODEL_NAME=/sdcard/Qwen3-4B-Q8_0.gguf
84-
#TEST_MODEL_NAME=/sdcard/gemma-3-4b-it-Q8_0.gguf
85-
8683
#for llama-bench, 4.5 GiB, will be downloadded automatically via this script when running this script at the first time
8784
GGUF_MODEL_NAME=/sdcard/gemma-3n-E2B-it-Q8_0.gguf
8885
#for llama-bench, 1.12 GiB, will be downloadded automatically via this script when running this script at the first time
@@ -107,7 +104,7 @@ GGUF_MODEL_NAME=/sdcard/qwen1_5-1_8b-chat-q4_0.gguf
107104
#HTP_ARCH_VERSION_a=V79
108105

109106
#modify the following two lines to adapt to test phone
110-
#for simplify workflow, only support v75 and v79, or only support 8Gen3 and 8Elite
107+
#for simplify workflow, only support v75 and v79, or only support 8Gen3 and 8Elite at the moment
111108
#v79/8Elite is strongly recommended because:
112109
#1. sometimes the same dsp codes can running well as expected on Snapdragon 8Elite based phone
113110
# but can't works as expected on other Snapdragon based phone(e.g. 8Gen3).
@@ -436,6 +433,7 @@ function check_prebuilt_models()
436433
check_and_download_model qwen1_5-1_8b-chat-q4_0.gguf https://huggingface.co/Qwen/Qwen1.5-1.8B-Chat-GGUF/resolve/main/qwen1_5-1_8b-chat-q4_0.gguf
437434
#check_and_download_model MiniCPM4-0.5B-F32.gguf https://huggingface.co/zhouwg/kantv/resolve/main/MiniCPM4-0.5B-F32.gguf
438435
#check_and_download_model t5-277M-F32.gguf https://huggingface.co/zhouwg/kantv/resolve/main/t5-277M-F32.gguf
436+
#check_and_download_model gemma-3n-E4B-it-Q8_0.gguf https://huggingface.co/ggml-org/gemma-3n-E4B-it-GGUF/resolve/main/gemma-3n-E4B-it-Q8_0.gguf
439437
check_and_download_model gemma-3n-E2B-it-Q8_0.gguf https://huggingface.co/ggml-org/gemma-3n-E2B-it-GGUF/resolve/main/gemma-3n-E2B-it-Q8_0.gguf
440438

441439
set -e

0 commit comments

Comments
 (0)