Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Commit 74363e4

Browse files
metascroyJack-Khuu
andcommitted
Update docs/quantization.md
Co-authored-by: Jack-Khuu <[email protected]>
1 parent bdac616 commit 74363e4

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/quantization.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -140,7 +140,7 @@ The quantization scheme embedding:wx quantizes embeddings in a groupwise manner
140140
You should expect high performance on ARM CPU if groupsize is divisible by 32. With other platforms and argument choices, a slow fallback kernel will be used. You will see warnings about this during quantization.
141141

142142
### Setup
143-
If you are using the torchao ops from python, they are available out of the box on a Mac with Apple Silicon, and you can skip these setup steps.
143+
If you are using the torchao ops from python (i.e not with a C++ runner), they are available out of the box on a Mac with Apple Silicon, and you can skip these setup steps.
144144

145145
If you plan to use the kernels from the AOTI/ExecuTorch C++ runners, follow the setup steps below.
146146

0 commit comments

Comments
 (0)