Update prototype_source/max_autotune_on_CPU_tutorial.rst

chunyuan-w · svekars · web-flow · commit a43f7b99a5d0 · 2024-10-02T10:28:04.000+08:00
Co-authored-by: Svetlana Karslioglu &lt;svekars@meta.com&gt;
diff --git a/prototype_source/max_autotune_on_CPU_tutorial.rst b/prototype_source/max_autotune_on_CPU_tutorial.rst
@@ -96,7 +96,7 @@ In this example, C++ template outperforms ATen kernel so that it will be selecte
 
 
 We could check the generated output code by setting ``export TORCH_LOGS="+output_code"``.
-When CPP template is selected, we won't have ``torch.ops.mkldnn._linear_pointwise.default`` (for bfloat16) or ``torch.ops.mkl._mkl_linear.default`` (for float32)
+When C++ template is selected, we won't have ``torch.ops.mkldnn._linear_pointwise.default`` (for bfloat16) or ``torch.ops.mkl._mkl_linear.default`` (for float32)
 in the generated code anymore, instead, we'll find kernel based on CPP GEMM template ``cpp_fused__to_copy_relu_1``
 (only part of the code is demonstrated below for simplicity) with the bias and relu epilogues fused inside the CPP GEMM template kernel.