Merge pull request #216339 from alexeyo26/alexeyo/mi-security-datasets

prmerger-automator[bot] · web-flow · commit a272836560ab · 2022-10-28T15:16:00.000Z
[CogSvc] Speech. MI security reference for Custom Speech datasets
diff --git a/articles/cognitive-services/Speech-Service/how-to-custom-speech-upload-data.md b/articles/cognitive-services/Speech-Service/how-to-custom-speech-upload-data.md
@@ -8,7 +8,7 @@ manager: nitinme
 ms.service: cognitive-services
 ms.subservice: speech-service
 ms.topic: how-to
-ms.date: 05/08/2022
+ms.date: 10/28/2022
 ms.author: eur
 zone_pivot_groups: speech-studio-cli-rest
 ---
@@ -30,7 +30,11 @@ To upload your own datasets in Speech Studio, follow these steps:
 1. Select **Custom Speech** > Your project name > **Speech datasets** > **Upload data**.
 1. Select the **Training data** or **Testing data** tab.
 1. Select a dataset type, and then select **Next**.
-1. Specify the dataset location, and then select **Next**. You can choose a local file or enter a remote location such as Azure Blob public access URL.
+1. Specify the dataset location, and then select **Next**. You can choose a local file or enter a remote location such as Azure Blob URL.
+
+    > [!NOTE]
+    > If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism). 
+
 1. Enter the dataset name and description, and then select **Next**.
 1. Review your settings, and then select **Save and close**.
 
@@ -47,6 +51,10 @@ To create a dataset and connect it to an existing project, use the `spx csr data
 - Set the `project` parameter to the ID of an existing project. This is recommended so that you can also view and manage the dataset in Speech Studio. You can run the `spx csr project list` command to get available projects.
 - Set the required `kind` parameter. The possible set of values for dataset kind are: Language, Acoustic, Pronunciation, and AudioFiles.
 - Set the required `contentUrl` parameter. This is the location of the dataset.
+
+    > [!NOTE]
+    > If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism).
+
 - Set the required `language` parameter. The dataset locale must match the locale of the project. The locale can't be changed later. The Speech CLI `language` parameter corresponds to the `locale` property in the JSON request and response.
 - Set the required `name` parameter. This is the name that will be displayed in the Speech Studio. The Speech CLI `name` parameter corresponds to the `displayName` property in the JSON request and response.
 
@@ -101,6 +109,10 @@ To create a dataset and connect it to an existing project, use the [CreateDatase
 - Set the `project` property to the URI of an existing project. This is recommended so that you can also view and manage the dataset in Speech Studio. You can make a [GetProjects](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/GetProjects) request to get available projects.
 - Set the required `kind` property. The possible set of values for dataset kind are: Language, Acoustic, Pronunciation, and AudioFiles.
 - Set the required `contentUrl` property. This is the location of the dataset.
+
+    > [!NOTE]
+    > If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism). 
+
 - Set the required `locale` property. The dataset locale must match the locale of the project. The locale can't be changed later. 
 - Set the required `displayName` property. This is the name that will be displayed in the Speech Studio.