Skip to content

Commit a43f7b9

Browse files
chunyuan-wsvekars
andauthored
Update prototype_source/max_autotune_on_CPU_tutorial.rst
Co-authored-by: Svetlana Karslioglu <[email protected]>
1 parent f9b4159 commit a43f7b9

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

prototype_source/max_autotune_on_CPU_tutorial.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -96,7 +96,7 @@ In this example, C++ template outperforms ATen kernel so that it will be selecte
9696
9797
9898
We could check the generated output code by setting ``export TORCH_LOGS="+output_code"``.
99-
When CPP template is selected, we won't have ``torch.ops.mkldnn._linear_pointwise.default`` (for bfloat16) or ``torch.ops.mkl._mkl_linear.default`` (for float32)
99+
When C++ template is selected, we won't have ``torch.ops.mkldnn._linear_pointwise.default`` (for bfloat16) or ``torch.ops.mkl._mkl_linear.default`` (for float32)
100100
in the generated code anymore, instead, we'll find kernel based on CPP GEMM template ``cpp_fused__to_copy_relu_1``
101101
(only part of the code is demonstrated below for simplicity) with the bias and relu epilogues fused inside the CPP GEMM template kernel.
102102

0 commit comments

Comments
 (0)