Skip to content

Commit a272836

Browse files
Merge pull request #216339 from alexeyo26/alexeyo/mi-security-datasets
[CogSvc] Speech. MI security reference for Custom Speech datasets
2 parents 8eab223 + 2ee8966 commit a272836

File tree

1 file changed

+14
-2
lines changed

1 file changed

+14
-2
lines changed

articles/cognitive-services/Speech-Service/how-to-custom-speech-upload-data.md

Lines changed: 14 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: speech-service
1010
ms.topic: how-to
11-
ms.date: 05/08/2022
11+
ms.date: 10/28/2022
1212
ms.author: eur
1313
zone_pivot_groups: speech-studio-cli-rest
1414
---
@@ -30,7 +30,11 @@ To upload your own datasets in Speech Studio, follow these steps:
3030
1. Select **Custom Speech** > Your project name > **Speech datasets** > **Upload data**.
3131
1. Select the **Training data** or **Testing data** tab.
3232
1. Select a dataset type, and then select **Next**.
33-
1. Specify the dataset location, and then select **Next**. You can choose a local file or enter a remote location such as Azure Blob public access URL.
33+
1. Specify the dataset location, and then select **Next**. You can choose a local file or enter a remote location such as Azure Blob URL.
34+
35+
> [!NOTE]
36+
> If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism).
37+
3438
1. Enter the dataset name and description, and then select **Next**.
3539
1. Review your settings, and then select **Save and close**.
3640

@@ -47,6 +51,10 @@ To create a dataset and connect it to an existing project, use the `spx csr data
4751
- Set the `project` parameter to the ID of an existing project. This is recommended so that you can also view and manage the dataset in Speech Studio. You can run the `spx csr project list` command to get available projects.
4852
- Set the required `kind` parameter. The possible set of values for dataset kind are: Language, Acoustic, Pronunciation, and AudioFiles.
4953
- Set the required `contentUrl` parameter. This is the location of the dataset.
54+
55+
> [!NOTE]
56+
> If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism).
57+
5058
- Set the required `language` parameter. The dataset locale must match the locale of the project. The locale can't be changed later. The Speech CLI `language` parameter corresponds to the `locale` property in the JSON request and response.
5159
- Set the required `name` parameter. This is the name that will be displayed in the Speech Studio. The Speech CLI `name` parameter corresponds to the `displayName` property in the JSON request and response.
5260

@@ -101,6 +109,10 @@ To create a dataset and connect it to an existing project, use the [CreateDatase
101109
- Set the `project` property to the URI of an existing project. This is recommended so that you can also view and manage the dataset in Speech Studio. You can make a [GetProjects](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/GetProjects) request to get available projects.
102110
- Set the required `kind` property. The possible set of values for dataset kind are: Language, Acoustic, Pronunciation, and AudioFiles.
103111
- Set the required `contentUrl` property. This is the location of the dataset.
112+
113+
> [!NOTE]
114+
> If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism).
115+
104116
- Set the required `locale` property. The dataset locale must match the locale of the project. The locale can't be changed later.
105117
- Set the required `displayName` property. This is the name that will be displayed in the Speech Studio.
106118

0 commit comments

Comments
 (0)