Skip to content

Commit 68508e9

Browse files
committed
docs(readme): update Obtain the model section
1 parent a75662c commit 68508e9

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,7 @@ Add proguard rule if it's enabled in project (android/app/proguard-rules.pro):
3131

3232
You can search HuggingFace for available models (Keyword: [`GGUF`](https://huggingface.co/search/full-text?q=GGUF&type=model)).
3333

34+
<<<<<<< Updated upstream
3435
For create a GGUF model manually, for example in Llama 2:
3536

3637
Download the Llama 2 model
@@ -61,6 +62,9 @@ make quantize
6162
# quantize the model to 4-bits (using q4_0 method)
6263
./quantize ./models/7B/ggml-model-f16.gguf ./models/7B/ggml-model-q4_0.gguf q4_0
6364
```
65+
=======
66+
For get a GGUF model or quantize manually, see [`Prepare and Quantize`](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#prepare-and-quantize) section in llama.cpp.
67+
>>>>>>> Stashed changes
6468
6569
## Usage
6670

0 commit comments

Comments
 (0)