You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+28-2Lines changed: 28 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,12 +10,14 @@ Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others)
10
10
11
11
### Recent API changes
12
12
13
-
-[2024 Mar 8]`llama_kv_cache_seq_rm()` returns a `bool` instead of `void`, and new `llama_n_max_seq()` returns the upper limit of acceptable `seq_id` in batches (relevant when dealing with multiple sequences) https://github.com/ggerganov/llama.cpp/pull/5328
13
+
-[2024 Mar 13] Add `llama_synchronize()` + `llama_context_params.n_ubatch`https://github.com/ggerganov/llama.cpp/pull/6017
14
+
-[2024 Mar 8]`llama_kv_cache_seq_rm()` returns a `bool` instead of `void`, and new `llama_n_seq_max()` returns the upper limit of acceptable `seq_id` in batches (relevant when dealing with multiple sequences) https://github.com/ggerganov/llama.cpp/pull/5328
14
15
-[2024 Mar 4] Embeddings API updated https://github.com/ggerganov/llama.cpp/pull/5796
15
16
-[2024 Mar 3]`struct llama_context_params`https://github.com/ggerganov/llama.cpp/pull/5849
16
17
17
18
### Hot topics
18
19
20
+
- Multi-GPU pipeline parallelizm support https://github.com/ggerganov/llama.cpp/pull/6017
19
21
- Looking for contributions to add Deepseek support: https://github.com/ggerganov/llama.cpp/issues/5981
Install [termux](https://termux.dev/) on your device and run `termux-setup-storage` to get access to your SD card.
913
-
Finally, copy the `llama` binary and the model files to your device storage. Here is a demo of an interactive session running on Pixel 5 phone:
918
+
Finally, copy these built `llama` binaries and the model file to your device storage. Because the file permissions in the Android sdcard cannot be changed, you can copy the executable files to the `/data/data/com.termux/files/home/bin` path, and then execute the following commands in Termux to add executable permission:
919
+
920
+
(Assumed that you have pushed the built executable files to the /sdcard/llama.cpp/bin path using `adb push`)
Download model [llama-2-7b-chat.Q4_K_M.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/blob/main/llama-2-7b-chat.Q4_K_M.gguf), and push it to `/sdcard/llama.cpp/`, then move it to `/data/data/com.termux/files/home/model/`
0 commit comments