Skip to content

Commit 220decc

Browse files
committed
Update on "Documentation Updates"
Summary: Updating README with better examples, updating class and api documentation and removing the unnecessary int_mm_fused_mul option from dynamic quant Test Plan: python test/test.py Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
1 parent 3eb1458 commit 220decc

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ torchao 0.0.1 <install dir>
2828
Relevant APIs can be found in torchao.quantization.quant_api
2929

3030
Note: While these techniques are designed to improve model performance, in some cases the opposite can occur.
31-
This is because quantization adds additional overhead to the model that is hopefully made up for by faster matmuls (for dynamic quantization) or loading weights faster (for weight-only quantization). If your matmuls are small enough or your non-quantized perf isn't bottlenecked by weight load time, these techniques may reduce performance.
31+
This is because quantization adds additional overhead to the model that is hopefully made up for by faster matmuls (dynamic quantization) or loading weights faster (weight-only quantization). If your matmuls are small enough or your non-quantized perf isn't bottlenecked by weight load time, these techniques may reduce performance.
3232

3333
### A8W8 Dynamic Quantization
3434

0 commit comments

Comments
 (0)