@@ -13,14 +13,14 @@ determine how it fits with your use case.
13
13
* To quickly find the APIs you need for your use case, see the
14
14
[ quantization aware training comprehensive guide] ( training_comprehensive_guide.md ) .
15
15
16
- ### Overview
16
+ ## Overview
17
17
18
18
Quantization aware training emulates inference-time quantization, creating a
19
19
model that downstream tools will use to produce actually quantized models.
20
20
The quantized models use lower-precision (e.g. 8-bit instead of 32-bit float),
21
21
leading to benefits during deployment.
22
22
23
- #### Deploy with quantization
23
+ ### Deploy with quantization
24
24
25
25
Quantization brings improvements via model compression and latency reduction.
26
26
With the API defaults, the model size shrinks by 4x, and we typically see
@@ -31,7 +31,7 @@ such as the [EdgeTPU](https://coral.ai/docs/edgetpu/benchmarks/) and NNAPI.
31
31
The technique is used in production in speech, vision, text, and translate use
32
32
cases. The code currently supports a subset of these models.
33
33
34
- #### Experiment with quantization and associated hardware
34
+ ### Experiment with quantization and associated hardware
35
35
36
36
Users can configure the quantization parameters (e.g. number of bits) and to
37
37
some degree, the underlying algorithms. With these changes from the API
@@ -40,7 +40,7 @@ defaults, there is no supported path to deployment.
40
40
APIs specific to this configuration are experimental and not subject to backward
41
41
compatibility.
42
42
43
- #### API compatibility
43
+ ### API compatibility
44
44
45
45
Users can apply quantization with the following APIs:
46
46
@@ -56,7 +56,7 @@ It is on our roadmap to add support in the following areas:
56
56
* Model building: clarify how Subclassed Models have limited to no support
57
57
* Distributed training: ` tf.distribute `
58
58
59
- #### General support matrix
59
+ ### General support matrix
60
60
61
61
Support is available in the following areas:
62
62
@@ -85,9 +85,9 @@ to launch. -->
85
85
require the training step.
86
86
* Stabilize APIs.
87
87
88
- ### Results
88
+ ## Results
89
89
90
- #### Image classification with tools
90
+ ### Image classification with tools
91
91
92
92
<figure >
93
93
<table >
@@ -116,7 +116,7 @@ to launch. -->
116
116
117
117
The models were tested on Imagenet and evaluated in both TensorFlow and TFLite.
118
118
119
- #### Image classification for technique
119
+ ### Image classification for technique
120
120
121
121
<figure >
122
122
<table >
@@ -139,7 +139,7 @@ The models were tested on Imagenet and evaluated in both TensorFlow and TFLite.
139
139
140
140
The models were tested on Imagenet and evaluated in both TensorFlow and TFLite.
141
141
142
- ### Examples
142
+ ## Examples
143
143
144
144
In addition to the
145
145
[ quantization aware training example] ( training_example.md ) ,
0 commit comments