suggestions from code review.

Vaibhavs10 · Vaibhavs10 · commit 9b9edf5f202c · 2025-01-17T16:08:54.000+01:00
diff --git a/docs/hub/gguf-llamacpp.md b/docs/hub/gguf-llamacpp.md
@@ -30,18 +30,15 @@ cd llama.cpp && LLAMA_CURL=1 make
 Once installed, you can use the `llama-cli` or `llama-server` as follows:
 
 ```bash
-llama-cli
- llama-cli -hf bartowski/Llama-3.2-3B-Instruct-GGUF:Q8_0
-  -p "You are a helpful assistant" -cnv
+llama-cli -hf bartowski/Llama-3.2-3B-Instruct-GGUF:Q8_0
 ```
 
 Note: You can remove `-cnv` to run the CLI in chat completion mode.
 
 Additionally, you can invoke an OpenAI spec chat completions endpoint directly using the llama.cpp server:
 
 ```bash
-llama-server \
-  -hf bartowski/Llama-3.2-3B-Instruct-GGUF:Q8_0
+llama-server -hf bartowski/Llama-3.2-3B-Instruct-GGUF:Q8_0
 ```
 
 After running the server you can simply utilise the endpoint as below: