Skip to content

Commit 15e1896

Browse files
Merge pull request #238512 from alexeyo26/alexeyo/enhance-dataset-url-info
[CogSvc] Speech. Explaining details on Custom Speech Data set URL requirements
2 parents 98e12a7 + 3357037 commit 15e1896

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

articles/cognitive-services/Speech-Service/how-to-custom-speech-upload-data.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ To upload your own datasets in Speech Studio, follow these steps:
3030
1. Select **Custom Speech** > Your project name > **Speech datasets** > **Upload data**.
3131
1. Select the **Training data** or **Testing data** tab.
3232
1. Select a dataset type, and then select **Next**.
33-
1. Specify the dataset location, and then select **Next**. You can choose a local file or enter a remote location such as Azure Blob URL.
33+
1. Specify the dataset location, and then select **Next**. You can choose a local file or enter a remote location such as Azure Blob URL. If you select remote location, it should be an URL that can be retrieved with a simple anonymous GET request. For example, a [SAS URL](/azure/storage/common/storage-sas-overview) or a publicly accessible URL. URLs that require extra authorization, or expect user interaction are not supported.
3434

3535
> [!NOTE]
3636
> If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism).
@@ -50,7 +50,7 @@ To create a dataset and connect it to an existing project, use the `spx csr data
5050

5151
- Set the `project` parameter to the ID of an existing project. This is recommended so that you can also view and manage the dataset in Speech Studio. You can run the `spx csr project list` command to get available projects.
5252
- Set the required `kind` parameter. The possible set of values for dataset kind are: Language, Acoustic, Pronunciation, and AudioFiles.
53-
- Set the required `contentUrl` parameter. This is the location of the dataset.
53+
- Set the required `contentUrl` parameter. This is the location of the dataset. The `contentUrl` parameter should be an URL that can be retrieved with a simple anonymous GET request. For example, a [SAS URL](/azure/storage/common/storage-sas-overview) or a publicly accessible URL. URLs that require extra authorization, or expect user interaction are not supported.
5454

5555
> [!NOTE]
5656
> If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism).
@@ -108,7 +108,7 @@ To create a dataset and connect it to an existing project, use the [Datasets_Cre
108108

109109
- Set the `project` property to the URI of an existing project. This is recommended so that you can also view and manage the dataset in Speech Studio. You can make a [Projects_List](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-1/operations/Projects_List) request to get available projects.
110110
- Set the required `kind` property. The possible set of values for dataset kind are: Language, Acoustic, Pronunciation, and AudioFiles.
111-
- Set the required `contentUrl` property. This is the location of the dataset.
111+
- Set the required `contentUrl` property. This is the location of the dataset. The `contentUrl` parameter should be an URL that can be retrieved with a simple anonymous GET request. For example, a [SAS URL](/azure/storage/common/storage-sas-overview) or a publicly accessible URL. URLs that require extra authorization, or expect user interaction are not supported.
112112

113113
> [!NOTE]
114114
> If you use Azure Blob URL, you can ensure maximum security of your dataset files by using trusted Azure services security mechanism. You will use the same techniques as for Batch transcription and plain Storage Account URLs for your dataset files. See details [here](batch-transcription-audio-data.md#trusted-azure-services-security-mechanism).

0 commit comments

Comments
 (0)