Update docs/quantization.md

metascroy · Jack-Khuu · metascroy · commit 74363e432ff1 · 2025-01-29T17:15:26.000-08:00
Co-authored-by: Jack-Khuu &lt;jack.khuu.7@gmail.com&gt;
diff --git a/docs/quantization.md b/docs/quantization.md
@@ -140,7 +140,7 @@ The quantization scheme embedding:wx quantizes embeddings in a groupwise manner
 You should expect high performance on ARM CPU if groupsize is divisible by 32.  With other platforms and argument choices, a slow fallback kernel will be used.  You will see warnings about this during quantization.
 
 ### Setup
-If you are using the torchao ops from python, they are available out of the box on a Mac with Apple Silicon, and you can skip these setup steps.
+If you are using the torchao ops from python (i.e not with a C++ runner), they are available out of the box on a Mac with Apple Silicon, and you can skip these setup steps.
 
 If you plan to use the kernels from the AOTI/ExecuTorch C++ runners, follow the setup steps below.