Merge pull request #237033 from eric-urban/eur/rest-qs

Jill Grant · web-flow · commit 7c14c64a2316 · 2023-05-04T21:20:40.000-06:00
rest qs
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/speech-to-text-basics/rest.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/speech-to-text-basics/rest.md
@@ -14,7 +14,7 @@ ms.author: eur
 
 [!INCLUDE [Prerequisites](../../common/azure-prerequisites.md)]
 
-You will also need a `.wav` audio file on your local machine. You can use your own `.wav` file (up to 30 seconds) or download the [https://crbn.us/whatstheweatherlike.wav](https://crbn.us/whatstheweatherlike.wav) sample file.
+You will also need a `.wav` audio file on your local machine. You can use your own `.wav` file (up to 60 seconds) or download the [https://crbn.us/whatstheweatherlike.wav](https://crbn.us/whatstheweatherlike.wav) sample file.
 
 ### Set environment variables
 
@@ -29,13 +29,10 @@ At a command prompt, run the following cURL command. Replace `YourAudioFile.wav`
 # [Windows](#tab/windows)
 
 ```terminal
-audio_file=@'YourAudioFile.wav'
-
-curl --location --request POST \
-"https://${SPEECH_REGION}.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US" \
---header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" \
---header "Content-Type: audio/wav" \
---data-binary $audio_file
+curl --location --request POST "https://%SPEECH_REGION%.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US&format=detailed" ^
+--header "Ocp-Apim-Subscription-Key: %SPEECH_KEY%" ^
+--header "Content-Type: audio/wav" ^
+--data-binary "@YourAudioFile.wav"
 ```
 
 # [Linux](#tab/linux)
@@ -44,9 +41,9 @@ curl --location --request POST \
 audio_file=@'YourAudioFile.wav'
 
 curl --location --request POST \
-"https://${SPEECH_REGION}.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US" ^
---header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" ^
---header "Content-Type: audio/wav" ^
+"https://${SPEECH_REGION}.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US&format=detailed" \
+--header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" \
+--header "Content-Type: audio/wav" \
 --data-binary $audio_file
 ```
 
@@ -56,9 +53,9 @@ curl --location --request POST \
 audio_file=@'YourAudioFile.wav'
 
 curl --location --request POST \
-"https://${SPEECH_REGION}.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US" ^
---header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" ^
---header "Content-Type: audio/wav" ^
+"https://${SPEECH_REGION}.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US&format=detailed" \
+--header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" \
+--header "Content-Type: audio/wav" \
 --data-binary $audio_file
 ```
 
@@ -67,7 +64,7 @@ curl --location --request POST \
 > [!IMPORTANT]
 > Make sure that you set the `SPEECH__KEY` and `SPEECH__REGION` environment variables as described [above](#set-environment-variables). If you don't set these variables, the sample will fail with an error message.
 
-You should receive a response similar to what is shown here. The `DisplayText` should be the text that was recognized from your audio file. Up to 30 seconds of audio will be recognized and converted to text.
+You should receive a response similar to what is shown here. The `DisplayText` should be the text that was recognized from your audio file. Up to 60 seconds of audio will be recognized and converted to text.
 
 ```console
 {
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/text-to-speech-basics/rest.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/text-to-speech-basics/rest.md
@@ -27,26 +27,22 @@ At a command prompt, run the following cURL command. Optionally you can rename `
 # [Windows](#tab/windows)
 
 ```terminal
-curl --location --request POST "https://${SPEECH_REGION}.tts.speech.microsoft.com/cognitiveservices/v1" \
---header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" \
---header 'Content-Type: application/ssml+xml' \
---header 'X-Microsoft-OutputFormat: audio-16khz-128kbitrate-mono-mp3' \
---header 'User-Agent: curl' \
---data-raw '<speak version='\''1.0'\'' xml:lang='\''en-US'\''>
-    <voice xml:lang='\''en-US'\'' xml:gender='\''Female'\'' name='\''en-US-JennyNeural'\''>
-        my voice is my passport verify me
-    </voice>
-</speak>' > output.mp3
+curl --location --request POST "https://%SPEECH_REGION%.tts.speech.microsoft.com/cognitiveservices/v1" ^
+--header "Ocp-Apim-Subscription-Key: %SPEECH_KEY%" ^
+--header "Content-Type: application/ssml+xml" ^
+--header "X-Microsoft-OutputFormat: audio-16khz-128kbitrate-mono-mp3" ^
+--header "User-Agent: curl" ^
+--data-raw "<speak version='1.0' xml:lang='en-US'><voice xml:lang='en-US' xml:gender='Female' name='en-US-JennyNeural'>my voice is my passport verify me</voice></speak>" --output output.mp3
 ```
 
 # [Linux](#tab/linux)
 
 ```terminal
-curl --location --request POST "https://${SPEECH_REGION}.tts.speech.microsoft.com/cognitiveservices/v1" ^
---header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" ^
---header 'Content-Type: application/ssml+xml' ^
---header 'X-Microsoft-OutputFormat: audio-16khz-128kbitrate-mono-mp3' ^
---header 'User-Agent: curl' ^
+curl --location --request POST "https://${SPEECH_REGION}.tts.speech.microsoft.com/cognitiveservices/v1" \
+--header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" \
+--header 'Content-Type: application/ssml+xml' \
+--header 'X-Microsoft-OutputFormat: audio-16khz-128kbitrate-mono-mp3' \
+--header 'User-Agent: curl' \
 --data-raw '<speak version='\''1.0'\'' xml:lang='\''en-US'\''>
     <voice xml:lang='\''en-US'\'' xml:gender='\''Female'\'' name='\''en-US-JennyNeural'\''>
         my voice is my passport verify me
@@ -57,11 +53,11 @@ curl --location --request POST "https://${SPEECH_REGION}.tts.speech.microsoft.co
 # [macOS](#tab/macos)
 
 ```terminal
-curl --location --request POST "https://${SPEECH_REGION}.tts.speech.microsoft.com/cognitiveservices/v1" ^
---header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" ^
---header 'Content-Type: application/ssml+xml' ^
---header 'X-Microsoft-OutputFormat: audio-16khz-128kbitrate-mono-mp3' ^
---header 'User-Agent: curl' ^
+curl --location --request POST "https://${SPEECH_REGION}.tts.speech.microsoft.com/cognitiveservices/v1" \
+--header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" \
+--header 'Content-Type: application/ssml+xml' \
+--header 'X-Microsoft-OutputFormat: audio-16khz-128kbitrate-mono-mp3' \
+--header 'User-Agent: curl' \
 --data-raw '<speak version='\''1.0'\'' xml:lang='\''en-US'\''>
     <voice xml:lang='\''en-US'\'' xml:gender='\''Female'\'' name='\''en-US-JennyNeural'\''>
         my voice is my passport verify me
diff --git a/articles/cognitive-services/Speech-Service/rest-speech-to-text-short.md b/articles/cognitive-services/Speech-Service/rest-speech-to-text-short.md
@@ -20,7 +20,7 @@ Use cases for the speech-to-text REST API for short audio are limited. Use it on
 
 Before you use the speech-to-text REST API for short audio, consider the following limitations:
 
-* Requests that use the REST API for short audio and transmit audio directly can contain no more than 30 seconds of audio. The input [audio formats](#audio-formats) are more limited compared to the [Speech SDK](speech-sdk.md).
+* Requests that use the REST API for short audio and transmit audio directly can contain no more than 60 seconds of audio. The input [audio formats](#audio-formats) are more limited compared to the [Speech SDK](speech-sdk.md).
 * The REST API for short audio returns only final results. It doesn't provide partial results.
 * [Speech translation](speech-translation.md) is not supported via REST API for short audio. You need to use [Speech SDK](speech-sdk.md).
 * [Batch transcription](batch-transcription.md) and [Custom Speech](custom-speech-overview.md) are not supported via REST API for short audio. You should always use the [Speech to Text REST API](rest-speech-to-text.md) for batch transcription and Custom Speech.