minor changes

zenonxiu81 · zenonxiu81 · commit 5d8618690a0e · 2025-09-09T09:51:34.000+08:00
diff --git a/content/learning-paths/servers-and-cloud-computing/llama_cpp_streamline/Analyzing_token_generation_at_Prefill_and_Decode_stage.md b/content/learning-paths/servers-and-cloud-computing/llama_cpp_streamline/Analyzing_token_generation_at_Prefill_and_Decode_stage.md
@@ -54,6 +54,19 @@ To add Annotation Markers to llama-cli, change the llama-cli code *llama.cpp/too
 ```
 and the Annotation Marker code in the 'main' function,
 
+Firstly, add the Streamline Annotation setup code after *common_init*, 
+```c
+    common_init();
+ 
+    //Add the Annotation setup code
+    ANNOTATE_SETUP;
+
+```
+
+
+then add the Annotation Marker generation code here, 
+
+
 ```c
           for (int i = 0; i < (int) embd.size(); i += params.n_batch) {
                 int n_eval = (int) embd.size() - i;
diff --git a/content/learning-paths/servers-and-cloud-computing/llama_cpp_streamline/_index.md b/content/learning-paths/servers-and-cloud-computing/llama_cpp_streamline/_index.md
@@ -1,5 +1,5 @@
 ---
-title: Use Streamline to analyze LLM running on CPU with llama.cpp
+title: Use Streamline to analyze LLM running on CPU with llama.cpp and KleidiAI
 
 minutes_to_complete: 50