Format llama read me page (#12782)

cccclai · facebook-github-bot · commit c29de1326b2d · 2025-07-24T10:59:43.000-07:00
Summary:

Current rendering is a bit off, it's supposed to be text but rendered as code

before:
 {F1980589435}

after:
 {F1980589702}

Differential Revision: D78847177
diff --git a/examples/models/llama/README.md b/examples/models/llama/README.md
@@ -220,14 +220,15 @@ You can export and run the original Llama 3 8B instruct model.
 ```
 python -m extension.llm.export.export_llm \
     --config examples/models/llama/config/llama_q8da4w.yaml
-    +base.model_clas="llama3"
+    +base.model_class="llama3"
     +base.checkpoint=<consolidated.00.pth.pth> \
     +base.params=<params.json>
 ```
-    Due to the larger vocabulary size of Llama 3, we recommend quantizing the embeddings with `quantization.embedding_quantize=\'4,32\'` as shown above to further reduce the model size.
 
+Due to the larger vocabulary size of Llama 3, we recommend quantizing the embeddings with `quantization.embedding_quantize=\'4,32\'` as shown above to further reduce the model size.
 
-    If you're interested in deploying on non-CPU backends, [please refer the non-cpu-backend section](non_cpu_backends.md)
+
+If you're interested in deploying on non-CPU backends, [please refer the non-cpu-backend section](non_cpu_backends.md)
 
 ## Step 3: Run on your computer to validate