Merge pull request #115139 from v-demjoh/spx-qs-2

megvanhuygen · web-flow · commit 990e222f7c0b · 2020-05-20T11:26:33.000-07:00
Quickstarts for SPX tool
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/from-file/spx/header.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/from-file/spx/header.md
@@ -0,0 +1,15 @@
+---
+title: "Quickstart: Recognize speech from a microphone - Speech service"
+titleSuffix: Azure Cognitive Services
+description: TBD
+services: cognitive-services
+author: v-demjoh
+manager: erhopf
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 5/13/2020
+ms.author: v-demjoh
+---
+
+In this quickstart, you use the SPX tool from the command line to recognize speech recorded in a sound file, and produce a text transcription. It's easy to use the SPX tool to perform common recognition tasks, such as transcribing conversations. After a one-time configuration, the SPX tool lets you transcribe audio into text interactively with a microphone or from files using a batch script.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/from-file/spx/spx.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/from-file/spx/spx.md
@@ -0,0 +1,31 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.topic: include
+ms.date: 05/13/2020
+ms.author: v-demjoh
+---
+
+## Find a file that contains speech
+
+The SPX tool can recognize speech in many file formats and natural languages. For this quickstart, you can use
+a WAV file (16kHz or 8kHz, 16-bit, and mono PCM) that contains English speech.
+
+1. Download the <a href="https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/csharp/sharedcontent/console/whatstheweatherlike.wav" download="whatstheweatherlike" target="_blank">whatstheweatherlike.wav <span class="docon docon-download x-hidden-focus"></span></a>.
+2. Copy the `whatstheweatherlike.wav` file to the same directory as the SPX tool binary file.
+
+## Run the SPX tool
+
+Now you're ready to run the SPX tool to recognize speech found in the sound file.
+
+From the command line, change to the directory that contains the SPX tool binary file, and type:
+
+```bash
+spx recognize --file whatstheweatherlike.wav
+```
+
+> [!NOTE]
+> The SPX tool defaults to English. You can choose a different language [from the Speech-to-text table](../../../../language-support.md).
+> For example, add `--source de-DE` to recognize German speech.
+
+The SPX tool will show a text transcription of the speech on the screen. Then the SPX tool will close.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/from-microphone/spx/header.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/from-microphone/spx/header.md
@@ -0,0 +1,15 @@
+---
+title: "Quickstart: Recognize speech from a microphone - Speech service"
+titleSuffix: Azure Cognitive Services
+description: TBD
+services: cognitive-services
+author: v-demjoh
+manager: erhopf
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 5/13/2020
+ms.author: v-demjoh
+---
+
+In this quickstart, you use the SPX tool from the command line to interactively recognize speech from a microphone input, and get the text transcription from captured audio. It's easy to use the SPX tool to perform common recognition tasks, such as transcribing conversations. After a one-time configuration, the SPX tool lets you transcribe audio into text interactively with a microphone or from files using a batch script.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/from-microphone/spx/spx.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/from-microphone/spx/spx.md
@@ -0,0 +1,27 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.topic: include
+ms.date: 05/13/2020
+ms.author: v-demjoh
+---
+
+## Enable microphone
+
+Plug in and turn on your PC microphone, and turn off any apps that might also use the microphone. Some computers have a built-in microphone,
+while others require configuration of a Bluetooth device.
+
+## Run the SPX tool
+
+Now you're ready to run the SPX tool to recognize speech from your microphone.
+
+1. **Start your app** - From the command line, change to the directory that contains the SPX tool binary file, and type:
+    ```bash
+    spx recognize --microphone
+    ```
+
+    > [!NOTE]
+    > The SPX tool defaults to English. You can choose a different language [from the Speech-to-text table](../../../../language-support.md).
+    > For example, add `--source de-DE` to recognize German speech.
+
+2. **Start recognition** - Speak into the microphone. You will see transcription of your words into text in real-time. The SPX tool will stop after a period of silence, or when you press ctrl-C.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/spx-next-steps.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/spx-next-steps.md
@@ -0,0 +1,14 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.topic: include
+ms.date: 05/15/2020
+ms.author: v-demjoh
+---
+
+## Next steps
+
+Continue exploring the basics to learn about other features of the SPX tool.
+
+> [!div class="nextstepaction"]
+> [Explore SPX tool basics](../../spx-basics.md)
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt-multiple-languages/spx/header.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt-multiple-languages/spx/header.md
@@ -0,0 +1,13 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+In this quickstart, you use the SPX tool from the command line to convert speech from a microphone input
+to text in multiple other languages.
+After a one-time configuration, the SPX tool lets you translate speech using commands from the command line.
+
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt-multiple-languages/spx/spx.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt-multiple-languages/spx/spx.md
@@ -0,0 +1,26 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+
+## Run the SPX tool
+
+Now you're ready to run the SPX tool to translate speech into text in two different languages.
+
+From the command line, change to the directory that contains the SPX tool binary file, and type:
+
+```bash
+spx translate --microphone --target de-DE --target es-MX
+```
+
+The SPX tool will translate natural language spoken English into text printed in German and (Mexican) Spanish.
+Press ENTER to stop the tool.
+
+> [!NOTE]
+> The SPX tool defaults to English. You can choose a different language [from the Speech-to-text table](../../../../language-support.md).
+> For example, add `--source ja-JP` to recognize Japanese speech.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt/spx/header.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt/spx/header.md
@@ -0,0 +1,13 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+In this quickstart, you use the SPX tool from the command line to convert speech from a microphone input
+to text in another language.
+After a one-time configuration, the SPX tool lets you translate speech using commands from the command line.
+
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt/spx/spx.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/translate-stt/spx/spx.md
@@ -0,0 +1,26 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+
+## Run the SPX tool
+
+Now you're ready to run the SPX tool to translate speech into text in a different language.
+
+From the command line, change to the directory that contains the SPX tool binary file, and type:
+
+```bash
+spx translate --microphone --target de-DE
+```
+
+The SPX tool will translate natural language spoken English into text printed in German.
+Press ENTER to stop the tool.
+
+> [!NOTE]
+> The SPX tool defaults to English. You can choose a different language [from the Speech-to-text table](../../../../language-support.md).
+> For example, add `--source ja-JP` to recognize Japanese speech.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/tts-audio-file/spx/header.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/tts-audio-file/spx/header.md
@@ -0,0 +1,13 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+In this quickstart, you use the SPX tool from the command line to convert text to speech stored in an audio file. 
+The text-to-speech service provides many options for synthesized voices, 
+under [text-to-speech language support](../../../../language-support.md#text-to-speech). 
+After a one-time configuration, the SPX tool lets you synthesize speech from text using commands from the command line.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/tts-audio-file/spx/spx.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/tts-audio-file/spx/spx.md
@@ -0,0 +1,21 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+## Run the SPX tool
+
+Now you're ready to run the SPX tool to synthesize speech from text into a new audio file.
+
+From the command line, change to the directory that contains SPX tool binary file, and type:
+
+```bash
+spx synthesize --text "The speech synthesizer greets you!" --audio output greetings.wav
+```
+
+The SPX tool will produce natural language in English into the `greetings.wav` audio file.
+In Windows, you can play the audio file by entering `start greetings.wav`.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/tts/spx/header.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/tts/spx/header.md
@@ -0,0 +1,10 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+In this quickstart, you use the SPX tool from the command line to convert text to speech you hear from your computer's audio speaker. The text-to-speech service provides many options for synthesized voices, under [text-to-speech language support](../../../../language-support.md#text-to-speech). After a one-time configuration, the SPX tool lets you synthesize speech from text using commands from the command line.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/tts/spx/spx.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/tts/spx/spx.md
@@ -0,0 +1,21 @@
+---
+author: v-demjoh
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: include
+ms.date: 05/18/2020
+ms.author: v-demjoh
+---
+
+
+## Run the SPX tool
+
+Now you're ready to run the SPX tool to synthesize speech from text.
+
+From the command line, change to the directory that contains the SPX tool binary file, and type:
+
+```bash
+spx synthesize --text "The speech synthesizer greets you!"
+```
+
+The SPX tool will produce natural language in English through the computer speaker.
diff --git a/articles/cognitive-services/Speech-Service/includes/spx-setup.md b/articles/cognitive-services/Speech-Service/includes/spx-setup.md
@@ -0,0 +1,51 @@
+---
+author: v-demjoh
+manager: nitinme
+ms.service: cognitive-services
+ms.topic: include
+ms.date: 05/15/2020
+ms.author: v-demjoh
+---
+
+## Prerequisites
+
+The only prerequisite is an Azure Speech subscription. See the [guide](../get-started.md#new-resource) on creating a new subscription if you don't already have one.
+
+## Download and install
+
+#### [Windows Install](#tab/windowsinstall)
+
+Follow these steps to install the SPX tool on Windows:
+
+1. Install either [.NET Framework 4.7](https://dotnet.microsoft.com/download/dotnet-framework/net471) or [.NET Core 3.0](https://dotnet.microsoft.com/download/dotnet-core/3.0)
+2. Download the SPX tool [zip archive](https://aka.ms/speech/spx-zips.zip), then extract it.
+3. Go to the root directory `spx-zips` that you extracted from the download, and extract the subdirectory that you need (`spx-net471` for .NET Framework 4.7, or `spx-netcore-win-x64` for .NET Core 3.0 on an x64 CPU).
+
+In the command prompt, change directory to this location, and then type `spx` to see help for the SPX tool.
+
+#### [Linux Install](#tab/linuxinstall)
+
+Follow these steps to install the SPX tool on Linux on an x64 CPU:
+
+1. Install [.NET Core 3.0](https://dotnet.microsoft.com/download/dotnet-core/3.0).
+2. Download the SPX tool [zip archive](https://aka.ms/speech/spx-zips.zip), then extract it.
+3. Go to the root directory `spx-zips` that you extracted from the download, and extract `spx-netcore-30-linux-x64` to a new `~/spx` directory.
+4. In a terminal, type these commands:
+   1. `cd ~/spx`
+   2. `sudo chmod +r+x spx`
+   3. `PATH=~/spx:$PATH`
+
+Type `spx` to see help for the SPX tool.
+
+***
+
+## Create subscription config
+
+To start using SPX, you first need to enter your Speech subscription key and region information. See the [region support](https://docs.microsoft.com/azure/cognitive-services/speech-service/regions#speech-sdk) page to find your region identifier. Once you have your subscription key and region identifier (ex. `eastus`, `westus`), run the following commands.
+
+```shell
+spx config @key --set YOUR-SUBSCRIPTION-KEY
+spx config @region --set YOUR-REGION-ID
+```
+
+Your subscription authentication is now stored for future SPX requests. If you need to remove either of these stored values, run `spx config @region --clear` or `spx config @key --clear`.
diff --git a/articles/cognitive-services/Speech-Service/quickstarts/speech-to-text-from-file.md b/articles/cognitive-services/Speech-Service/quickstarts/speech-to-text-from-file.md
@@ -10,7 +10,7 @@ ms.subservice: speech-service
 ms.topic: quickstart
 ms.date: 02/10/2020
 ms.author: dapine
-zone_pivot_groups: programming-languages-set-two-with-js
+zone_pivot_groups: programming-languages-set-two-with-js-spx
 ---
 
 # Quickstart: Recognize speech from an audio file
@@ -45,6 +45,13 @@ zone_pivot_groups: programming-languages-set-two-with-js
 [!INCLUDE [python](../includes/quickstarts/from-file/javascript/javascript.md)]
 ::: zone-end
 
+::: zone pivot="programmer-tool-spx"
+[!INCLUDE [SPX Header](../includes/quickstarts/from-file/spx/header.md)]
+[!INCLUDE [SPX Setup](../includes/spx-setup.md)]
+[!INCLUDE [spx](../includes/quickstarts/from-file/spx/spx.md)]
+[!INCLUDE [next steps to spx basics](../includes/quickstarts/spx-next-steps.md)]
+::: zone-end
+
 ::: zone pivot="programming-language-more"
 [!INCLUDE [Header](../includes/quickstarts/from-file/more/header.md)]
 [!INCLUDE [More samples](../includes/quickstarts/from-file/more/more.md)]
diff --git a/articles/cognitive-services/Speech-Service/quickstarts/speech-to-text-from-microphone.md b/articles/cognitive-services/Speech-Service/quickstarts/speech-to-text-from-microphone.md
@@ -10,7 +10,7 @@ ms.subservice: speech-service
 ms.topic: quickstart
 ms.date: 02/10/2020
 ms.author: dapine
-zone_pivot_groups: programming-languages-set-two-with-js-go
+zone_pivot_groups: programming-languages-set-two-with-js-go-spx
 ---
 
 # Quickstart: Recognize speech from a microphone
@@ -57,6 +57,18 @@ zone_pivot_groups: programming-languages-set-two-with-js-go
 
 ::: zone-end
 
+::: zone pivot="programmer-tool-spx"
+
+[!INCLUDE [SPX Header](../includes/quickstarts/from-microphone/spx/header.md)]
+
+[!INCLUDE [](../includes/spx-setup.md)]
+
+[!INCLUDE [spx](../includes/quickstarts/from-microphone/spx/spx.md)]
+
+[!INCLUDE [next steps to spx basics](../includes/quickstarts/spx-next-steps.md)]
+
+::: zone-end
+
 ::: zone pivot="programming-language-go"
 
 [!INCLUDE [Header](../includes/quickstarts/from-microphone/header.md)]
diff --git a/articles/cognitive-services/Speech-Service/quickstarts/text-to-speech-audio-file.md b/articles/cognitive-services/Speech-Service/quickstarts/text-to-speech-audio-file.md
@@ -10,7 +10,7 @@ ms.subservice: speech-service
 ms.topic: quickstart
 ms.date: 02/10/2020
 ms.author: trbye
-zone_pivot_groups: programming-languages-set-two-with-js
+zone_pivot_groups: programming-languages-set-two-with-js-spx
 ---
 
 # Quickstart: Synthesize speech into an audio file
@@ -45,6 +45,13 @@ zone_pivot_groups: programming-languages-set-two-with-js
 [!INCLUDE [javascript](../includes/quickstarts/tts-audio-file/javascript/javascript.md)]
 ::: zone-end
 
+::: zone pivot="programmer-tool-spx"
+[!INCLUDE [Header](../includes/quickstarts/tts-audio-file/spx/header.md)]
+[!INCLUDE [SPX Setup](../includes/spx-setup.md)]
+[!INCLUDE [Header](../includes/quickstarts/tts-audio-file/spx/spx.md)]
+[!INCLUDE [next steps to spx basics](../includes/quickstarts/spx-next-steps.md)]
+::: zone-end
+
 ::: zone pivot="programming-language-more"
 [!INCLUDE [Header](../includes/quickstarts/tts-audio-file/more/header.md)]
 [!INCLUDE [More samples](../includes/quickstarts/tts-audio-file/more/more.md)]
diff --git a/articles/cognitive-services/Speech-Service/quickstarts/text-to-speech.md b/articles/cognitive-services/Speech-Service/quickstarts/text-to-speech.md
diff --git a/articles/cognitive-services/Speech-Service/quickstarts/translate-speech-to-text-multiple-languages.md b/articles/cognitive-services/Speech-Service/quickstarts/translate-speech-to-text-multiple-languages.md
diff --git a/articles/cognitive-services/Speech-Service/quickstarts/translate-speech-to-text.md b/articles/cognitive-services/Speech-Service/quickstarts/translate-speech-to-text.md
diff --git a/articles/cognitive-services/Speech-Service/spx-basics.md b/articles/cognitive-services/Speech-Service/spx-basics.md
diff --git a/articles/zone-pivot-groups.yml b/articles/zone-pivot-groups.yml