Update README.md

jimmytwei · web-flow · commit 4f58547204a6 · 2024-03-25T13:52:42.000-07:00
diff --git a/AI-and-Analytics/Getting-Started-Samples/INC-Sample-for-Tensorflow/README.md b/AI-and-Analytics/Getting-Started-Samples/INC-Sample-for-Tensorflow/README.md
@@ -1,34 +1,35 @@
-# `Intel® Neural Compressor (INC) TensorFlow* Getting Started*` Sample
+# `Intel® Neural Compressor TensorFlow* Getting Started*` Sample
 
-The  `Intel® Neural Compressor (INC) TensorFlow* Getting Started*` Sample demonstrates using the Intel® Neural Compressor (INC), which is part of the Intel® AI Tools with the with Intel® Optimizations for TensorFlow* to speed up inference by simplifying the process of converting the FP32 model to INT8/BF16.
+The  `Intel® Neural Compressor TensorFlow* Getting Started*` Sample demonstrates using the Intel® Neural Compressor, which is part of the Intel® AI Tools with the with Intel® Optimizations for TensorFlow* to speed up inference by simplifying the process of converting the FP32 model to INT8/BF16.
 
-| Area                     | Description
+| Property                 | Description
 |:---                      |:---
-| What you will learn      | How to use Intel® Neural Compressor (INC) tool to quantize the AI model based on TensorFlow* and speed up the inference on Intel® Xeon® CPUs
-| Time to complete         | 10 minutes
 | Category                 | Getting Started
+| What you will learn      | How to use Intel® Neural Compressor tool to quantize the AI model based on TensorFlow* and speed up the inference on Intel® Xeon® CPUs
+| Time to complete         | 10 minutes
+
 
 ## Purpose
 
-This sample shows the process of building a convolutional neural network (CNN) model to recognize handwritten numbers and demonstrates how to increase the inference performance by using Intel® Neural Compressor (INC). Low-precision optimizations can speed up inference. Intel® Neural Compressor (INC) simplifies the process of converting the FP32 model to INT8/BF16. At the same time, Intel® Neural Compressor (INC) tunes the quantization method to reduce the accuracy loss, which is a big blocker for low-precision inference.
+This sample shows the process of building a convolutional neural network (CNN) model to recognize handwritten numbers and demonstrates how to increase the inference performance by using Intel® Neural Compressor. Low-precision optimizations can speed up inference. Intel® Neural Compressor simplifies the process of converting the FP32 model to INT8/BF16. At the same time, Intel® Neural Compressor tunes the quantization method to reduce the accuracy loss, which is a big blocker for low-precision inference.
 
 You can achieve higher inference performance by converting the FP32 model to INT8 or BF16 model. Additionally, Intel® Deep Learning Boost (Intel® DL Boost) in Intel® Xeon® Scalable processors and Xeon® processors provides hardware acceleration for INT8 and BF16 models.
 
-You will learn how to train a CNN model with Keras and TensorFlow*, use Intel® Neural Compressor (INC) to quantize the model, and compare the performance to see the benefit of Intel® Neural Compressor (INC).
+You will learn how to train a CNN model with Keras and TensorFlow*, use Intel® Neural Compressor to quantize the model, and compare the performance to see the benefit of Intel® Neural Compressor.
 
 ## Prerequisites
 
 | Optimized for                     | Description
 |:---                               |:---
 | OS                                | Ubuntu* 20.04 (or newer) <br> Windows 11, 10*
 | Hardware                          | Intel® Core™ Gen10 Processor <br> Intel® Xeon® Scalable Performance processors
-| Software                          | Intel® Neural Compressor (INC), Intel Optimization for TensorFlow
+| Software                          | Intel® Neural Compressor, Intel Optimization for TensorFlow
 
-### Intel® Neural Compressor (INC) and Sample Code Versions
+### Intel® Neural Compressor and Sample Code Versions
 
->**Note**: See the [Intel® Neural Compressor (INC)](https://github.com/intel/neural-compressor) GitHub repository for more information and recent changes. 
+>**Note**: See the [Intel® Neural Compressor](https://github.com/intel/neural-compressor) GitHub repository for more information and recent changes.
 
-This sample is updated regularly to match the Intel® Neural Compressor (INC) version in the latest Intel® AI Tools release. If you want to get the sample code for an earlier toolkit release, check out the corresponding git tag.
+This sample is updated regularly to match the Intel® Neural Compressor version in the latest Intel® AI Tools release. If you want to get the sample code for an earlier toolkit release, check out the corresponding git tag.
 
 1. List the available git tags.
    ```
@@ -63,14 +64,14 @@ You will need to download and install the following toolkits, tools, and compone
 The sample demonstrates how to:
 
 - Use Keras from TensorFlow* to build and train a CNN model.
-- Define a function and class for Intel® Neural Compressor (INC) to
+- Define a function and class for Intel® Neural Compressor to
   quantize the CNN model.
-  - The Intel® Neural Compressor (INC) can run on any Intel® CPU to quantize the AI model.
+  - The Intel® Neural Compressor can run on any Intel® CPU to quantize the AI model.
   - The quantized AI model has better inference performance than the FP32 model on Intel CPUs.
   - Specifically, the latest Intel® Xeon® Scalable  processors and  Xeon® processors provide hardware acceleration for such tasks.
 - Test the performance of the FP32 model and INT8 (quantization) model.
 
-## Prepare the Environment
+## Environment Setup
 If you have already set up the PIP or Conda environment and installed AI Tools go directly to Run the Notebook.
 
 ### On Linux* (Only applicable to AI Tools Offline Installer)
@@ -79,9 +80,13 @@ If you have already set up the PIP or Conda environment and installed AI Tools g
 
 When working with the command-line interface (CLI), you should configure the oneAPI toolkits using environment variables. Set up your CLI environment by sourcing the `setvars` script every time you open a new terminal window. This practice ensures that your compiler, libraries, and tools are ready for development.
 
-#### Activate Conda
+#### Setup Conda Environment
 
-You can list the available conda environments using a command similar to the following
+You can list the available conda environments using a command similar to the following.
+
+##### Option 1: Clone Conda Environment from AI Toolkit Conda Environment
+
+Please confirm to install Intel AI Toolkit!
 
 ```
 conda info -e
@@ -115,11 +120,27 @@ tensorflow-2.3.0         /opt/intel/oneapi/intelpython/latest/envs/tensorflow-2.
       ```
       source activate usr_tensorflow
       ```
-2. Install Intel® Neural Compressor (INC) from the local channel.
+2. Install Intel® Neural Compressor from the local channel.
    ```
    conda install -c ${ONEAPI_ROOT}/conda_channel neural-compressor -y --offline
    ```
 
+##### Option 2: Create Conda Environment
+
+Configure Conda for **user_tensorflow** by entering commands similar to the following:
+   ```
+   conda deactivate
+   conda env remove -n user_tensorflow
+   conda create -n user_tensorflow python=3.9 -y
+   conda activate user_tensorflow
+   conda install -n user_tensorflow pycocotools -c esri -y
+   conda install -n user_tensorflow neural-compressor tensorflow -c conda-forge -c intel -y
+   conda install -n user_tensorflow jupyter runipy notebook -y
+   conda install -c anaconda ipykernel
+   python -m ipykernel install --user --nam=user_tensorflow
+   ```
+
+
 #### Configure Jupyter Notebook
 
 1. Create a new kernel for the Jupyter notebook based on your activated conda environment.
@@ -133,7 +154,7 @@ tensorflow-2.3.0         /opt/intel/oneapi/intelpython/latest/envs/tensorflow-2.
 
 #### Configure Conda
 
-1. Configure Conda for **user_tensorflow** by entering commands similar to the following:
+Configure Conda for **user_tensorflow** by entering commands similar to the following:
    ```
    conda deactivate
    conda env remove -n user_tensorflow
@@ -142,9 +163,11 @@ tensorflow-2.3.0         /opt/intel/oneapi/intelpython/latest/envs/tensorflow-2.
    conda install -n user_tensorflow pycocotools -c esri -y
    conda install -n user_tensorflow neural-compressor tensorflow -c conda-forge -c intel -y
    conda install -n user_tensorflow jupyter runipy notebook -y
+   conda install -c anaconda ipykernel
+   python -m ipykernel install --user --nam=user_tensorflow
    ```
 
-## Run the `Intel® Neural Compressor (INC) TensorFlow* Getting Started*` Sample
+## Run the `Intel® Neural Compressor TensorFlow* Getting Started*` Sample
 
 > **Note**: If you have not already done so, set up your CLI
 > environment by sourcing  the `setvars` script in the root of your oneAPI installation.
@@ -160,7 +183,7 @@ tensorflow-2.3.0         /opt/intel/oneapi/intelpython/latest/envs/tensorflow-2.
 >
 > For more information on configuring environment variables, see *[Use the setvars Script with Linux* or macOS*](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-linux-or-macos.html)* or *[Use the setvars Script with Windows*](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-windows.html)*.
 
-### Steps for Intel AI Tools Offline Installer
+### Active Conda Environment
 
 1. Ensure you activate the conda environment.
    ```
@@ -212,11 +235,34 @@ tensorflow-2.3.0         /opt/intel/oneapi/intelpython/latest/envs/tensorflow-2.
 4. Change the kernel to **user_tensorflow**.
 5. Run every cell in the Notebook in sequence.
 
+## Example Output
+
+You should see log print and images showing the performance comparison with absolute and relative data and analysis between FP32 and INT8.
+
+Following is an example. Your data should be different with them.
+
+```
+#absolute data
+throughputs_times [1, 2.51508607887295]
+latencys_times [1, 0.38379207710795576]
+accuracys_times [0, -0.009999999999990905]
+
+#relative data
+throughputs_times [1, 2.51508607887295]
+latencys_times [1, 0.38379207710795576]
+accuracys_times [0, -0.009999999999990905]
+```
+
+![Absolute Performance](img/inc_ab_perf_data.png)
+![Relative Performance](img/inc_re_perf_data.png)
 
 #### Troubleshooting
 
 If you receive an error message, troubleshoot the problem using the **Diagnostics Utility for Intel® oneAPI Toolkits**. The diagnostic utility provides configuration and system checks to help find missing dependencies, permissions errors, and other issues. See the [Diagnostics Utility for Intel® oneAPI Toolkits User Guide](https://www.intel.com/content/www/us/en/develop/documentation/diagnostic-utility-user-guide/top.html) for more information on using the utility.
 
+## Related Samples
+
+[Pytorch `Getting Started with Intel® Neural Compressor for Quantization` Sample](../INC-Quantization-Sample-for-PyTorch)
 
 ## License