Skip to content

Commit 56b11ca

Browse files
committed
up
1 parent 62549a8 commit 56b11ca

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

examples/models/llama/README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -345,6 +345,8 @@ We now give instructions for quantizating and running your model with low-bit ke
345345
EXECUTORCH_BUILD_KERNELS_TORCHAO=1 python install_executorch.py
346346
```
347347
348+
(If you'd like lowbit to use KleidiAI when available, you can instead install with `EXECUTORCH_BUILD_KERNELS_TORCHAO=1 TORCHAO_BUILD_KLEIDIAI=1 python install_executorch.py`.)
349+
348350
Also note that low-bit quantization often requires QAT (quantization-aware training) to give good quality results.
349351
350352
First export your model for lowbit quantization (step 2 above):

0 commit comments

Comments
 (0)