Addressed reviewer's comments.

wwwind · wwwind · commit 47eda033f3c7 · 2021-12-01T15:41:09.000Z
Change-Id: I8fecf1d901e26951011c0e08079bb4807d7eb0c1
diff --git a/tensorflow_model_optimization/g3doc/guide/pruning/index.md b/tensorflow_model_optimization/g3doc/guide/pruning/index.md
@@ -131,9 +131,9 @@ in [Arm’s ML Examples repository](https://github.com/ARM-software/ML-examples/
   <table>
     <tr>
       <th>Model</th>
-      <th>Unpruned</th>
-      <th>Sparsity 2 by 4 </th>
-      <th>Sparsity, 50% </th>
+      <th>Non-sparse Accuracy</th>
+      <th>Structured Sparse Accuracy (2 by 4 pattern)</th>
+      <th>Random Sparse Accuracy (target sparsity 50%)</th>
     </tr>
     <tr>
       <td>DS-CNN-L</td>
diff --git a/tensorflow_model_optimization/g3doc/guide/pruning/pruning_with_sparsity_2_by_4.ipynb b/tensorflow_model_optimization/g3doc/guide/pruning/pruning_with_sparsity_2_by_4.ipynb
@@ -93,7 +93,7 @@
         "Structural pruning zeroes out model weights at the beginning of the training\n",
         "process according to the following pattern: M weights are set to zero in the\n",
         "block of N weights. It is important to notice that this pattern affects only the last dimension of the weight tensor for the model that is converted by TensorFlow Lite. For example, `Conv2D` layer weights in TensorFlow Lite have the structure [channel_out, height, width, channel_in] and `Dense` layer weights have the structure [channel_out, channel_in]. The sparsity pattern is applied to the weights in the last dimension: channel_in.\n",
-        "Special hardware can benefit from this type of sparsity in the model and inference time can have a speedup up to 2x. Because this pattern lock in sparsity is more restrictive, the accuracy achieved after fine-tuning is worse than with the magnitude-based pruning.\n",
+        "Special hardware can benefit from this type of sparsity in the model and inference time can have a significant speedup. Because this pattern lock in sparsity is more restrictive, the accuracy achieved after fine-tuning is worse than with the magnitude-based pruning.\n",
         "It is important to indicate that the pattern is valid only for the model that is converted to tflite.\n",
         "If the model is quantized, then the accuracy could be improved using [collaborative optimization technique](https://blog.tensorflow.org/2021/10/Collaborative-Optimizations.html): Sparsity preserving quantization aware training."
       ],
@@ -557,10 +557,12 @@
       "metadata": {}
     },
     {
-      "cell_type": "markdown",
+      "cell_type": "code",
+      "execution_count": null,
       "source": [
-        "`python ./tensorflow_model_optimization/python/core/sparsity/keras/tools/check_sparsity_m_by_n.py --model_tflite=pruned_model.tflite --m_by_n=2,4`"
+        "! python ./tensorflow_model_optimization/python/core/sparsity/keras/tools/check_sparsity_m_by_n.py --model_tflite=pruned_model.tflite --m_by_n=2,4\n"
       ],
+      "outputs": [],
       "metadata": {}
     }
   ],