update with working instructions for INC

alexsin368 · alexsin368 · commit 7dbc9a12f329 · 2024-11-26T16:13:23.000-08:00
diff --git a/AI-and-Analytics/End-to-end-Workloads/LanguageIdentification/Inference/quantize_model.py b/AI-and-Analytics/End-to-end-Workloads/LanguageIdentification/Inference/quantize_model.py
@@ -18,8 +18,6 @@
 from neural_compressor.utils.pytorch import load
 from speechbrain.pretrained import EncoderClassifier 
 
-DEFAULT_EVAL_DATA_PATH = "/data/commonVoice/dev"
-
 def prepare_dataset(path):
     data_list = []
     for dir_name in os.listdir(path):
@@ -33,7 +31,7 @@ def main(argv):
     import argparse
     parser = argparse.ArgumentParser()
     parser.add_argument('-p', type=str, required=True, help="Path to the model to be optimized")
-    parser.add_argument('-datapath', type=str, default=DEFAULT_EVAL_DATA_PATH, help="Path to evaluation dataset")
+    parser.add_argument('-datapath', type=str, required=True, help="Path to evaluation dataset")
     args = parser.parse_args()
 
     model_path = args.p
diff --git a/AI-and-Analytics/End-to-end-Workloads/LanguageIdentification/README.md b/AI-and-Analytics/End-to-end-Workloads/LanguageIdentification/README.md
@@ -360,9 +360,9 @@ The following examples describe how to use the scripts to produce specific outco
 
 1. To improve inference latency, you can use the Intel® Neural Compressor (INC) to quantize the trained model from FP32 to INT8 by running `quantize_model.py`.
    ```bash
-   python quantize_model.py -p ./lang_id_commonvoice_model -datapath $COMMON_VOICE_PATH/dev
+   python quantize_model.py -p ./lang_id_commonvoice_model -datapath $COMMON_VOICE_PATH/processed_data/dev
    ```
-   Use the `-datapath` argument to specify a custom evaluation dataset. By default, the datapath is set to the `$COMMON_VOICE_PATH/dev` folder that was generated from the data preprocessing scripts in the `Training` folder.
+   Use the `-datapath` argument to specify a custom evaluation dataset. By default, the datapath is set to the `$COMMON_VOICE_PATH/processed_data/dev` folder that was generated from the data preprocessing scripts in the `Training` folder.
 
    After quantization, the model will be stored in `lang_id_commonvoice_model_INT8` and `neural_compressor.utils.pytorch.load` will have to be used to load the quantized model for inference. If `self.language_id` is the original model and `data_path` is the path to the audio file:
    ```
@@ -372,13 +372,16 @@ The following examples describe how to use the scripts to produce specific outco
    prediction = self.model_int8(signal)
    ```
 
-**(Optional) Comparing Predictions with Ground Truth**
+   The code above is integrated into `inference_custom.py`. You can now run inference on your data using this INT8 model:
+   ```bash
+   python inference_custom.py -p data_custom -d 3 -s 50 --vad --int8_model --verbose
+   ```
 
-You can choose to modify `audio_ground_truth_labels.csv` to include the name of the audio file and expected audio label (like, `en` for English), then run `inference_custom.py` with the `--ground_truth_compare` option. By default, this is disabled.  
+   >**Note**: The `--verbose` option is required to view the latency measurements.
 
-### Troubleshooting
+**(Optional) Comparing Predictions with Ground Truth**
 
-If the model appears to be giving the same output regardless of input, try running `clean.sh` to remove the `RIR_NOISES` and `speechbrain` folders. Redownload that data after cleaning by running `initialize.sh` and either `inference_commonVoice.py` or `inference_custom.py`.
+You can choose to modify `audio_ground_truth_labels.csv` to include the name of the audio file and expected audio label (like, `en` for English), then run `inference_custom.py` with the `--ground_truth_compare` option. By default, this is disabled.  
 
 ## License