adding spx to master

Trevor Bye · Trevor Bye · commit c9b2993e21d0 · 2020-05-08T11:19:03.000-07:00
diff --git a/articles/cognitive-services/Speech-Service/index-speech-to-text.yml b/articles/cognitive-services/Speech-Service/index-speech-to-text.yml
@@ -24,6 +24,8 @@ landingContent:
         url: batch-transcription.md
       - text: Speech recognition basics
         url: speech-to-text-basics.md
+      - text: Use SPX for speech-to-text with no code
+        url: spx-overview.md
     - linkListType: quickstart
       links:
         - text: Recognize speech with microphone input
diff --git a/articles/cognitive-services/Speech-Service/index-speech-translation.yml b/articles/cognitive-services/Speech-Service/index-speech-translation.yml
@@ -22,6 +22,8 @@ landingContent:
         url: speech-translation.md
       - text: Speech translation basics
         url: speech-translation-basics.md
+      - text: Use SPX to translate speech with no code
+        url: spx-overview.md
     - linkListType: quickstart
       links:
         - text: Translate speech-to-text
diff --git a/articles/cognitive-services/Speech-Service/index-text-to-speech.yml b/articles/cognitive-services/Speech-Service/index-text-to-speech.yml
@@ -22,6 +22,8 @@ landingContent:
         url: text-to-speech.md
       - text: Speech synthesis basics
         url: text-to-speech-basics.md
+      - text: Use SPX for text-to-speech with no code
+        url: spx-overview.md
     - linkListType: quickstart
       links:
         - text: Synthesize speech to a speaker
diff --git a/articles/cognitive-services/Speech-Service/index.yml b/articles/cognitive-services/Speech-Service/index.yml
@@ -118,6 +118,17 @@ conceptualContent:
       footerLink:
         text: See more
         url: index-voice-assistants.yml
+    - title: Tools
+      links:
+        - itemType: overview
+          text: About SPX - use the Speech service with no code
+          url: spx-overview.md
+        - itemType: how-to-guide
+          text: SPX basics
+          url: spx-basics.md
+        - itemType: overview
+          text: About Speech Studio - no-code Speech service customization
+          url: https://speech.microsoft.com
     - title: Hosting
       links:
         - itemType: how-to-guide
diff --git a/articles/cognitive-services/Speech-Service/spx-basics.md b/articles/cognitive-services/Speech-Service/spx-basics.md
@@ -0,0 +1,130 @@
+---
+title: "SPX basics - Speech service"
+titleSuffix: Azure Cognitive Services
+description: Learn how to use the SPX command line tool to work with the Speech SDK with no code and minimal setup. 
+services: cognitive-services
+author: trevorbye
+manager: nitinme
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: quickstart
+ms.date: 04/04/2020
+ms.author: trbye
+---
+
+# Learn the basics of SPX
+
+In this article, you learn the basic usage patterns of SPX, a command line tool to use the Speech service without writing code. You can quickly test out the main features of the Speech service, without creating development environments or writing any code, to see if your use-cases can be adequately met. Additionally, SPX is production ready and can be used to automate simple workflows in the Speech service, using `.bat` or shell scripts.
+
+## Prerequisites
+
+The only prerequisite is an Azure Speech subscription. See the [guide](get-started.md#new-resource) on creating a new subscription if you don't already have one.
+
+## Download and install
+
+SPX is available on Windows and Linux. Start by downloading the [zip archive](https://aka.ms/speech/spx-zips.zip), then extract it. SPX requires either the .NET Core or .NET Framework runtime, and the following versions are supported by platform:
+
+* Windows: [.NET Framework 4.7](https://dotnet.microsoft.com/download/dotnet-framework/net471), [.NET Core 2.2](https://dotnet.microsoft.com/download/dotnet-core/2.2)
+* Linux: [.NET Core 2.2](https://dotnet.microsoft.com/download/dotnet-core/2.2)
+
+After you've installed a runtime, go to the root directory `spx-zips` that you extracted from the download, and extract the subdirectory that you need (`spx-net471`, for example). In a command prompt, change directory to this location, and then run `spx` to start the application.
+
+## Create subscription config
+
+To start using SPX, you first need to enter your Speech subscription key and region information. See the [region support](https://docs.microsoft.com/azure/cognitive-services/speech-service/regions#speech-sdk) page to find your region identifier. Once you have your subscription key and region identifier (ex. `eastus`, `westus`), run the following commands.
+
+```shell
+spx config @key --set YOUR-SUBSCRIPTION-KEY
+spx config @region --set YOUR-REGION-ID
+```
+
+Your subscription authentication is now stored for future SPX requests. If you need to remove either of these stored values, run `spx config @region --clear` or `spx config @key --clear`.
+
+## Basic usage
+
+This section shows a few basic SPX commands that are often useful for first-time testing and experimentation. Start by performing some speech recognition using your default microphone by running the following command.
+
+```shell
+spx recognize --microphone
+```
+
+After entering the command, SPX will begin listening for audio on the current active input device, and stop after you press `ENTER`. The recorded speech is then recognized and converted to text in the console output. Text-to-speech synthesis is also easy to do using SPX. 
+
+Running the following command will take the entered text as input, and output the synthesized speech to the current active output device.
+
+```shell
+spx synthesize --text "Testing synthesis using SPX" --speakers
+```
+
+In addition to speech recognition and synthesis, you can also do speech translation with SPX. Similar to the speech recognition command above, run the following command to capture audio from your default microphone, and perform translation to text in the target language.
+
+```shell
+spx translate --microphone --source en-US --target ru-RU --output file C:\some\file\path\russian_translation.txt
+```
+
+In this command, you specify both the source (language to translate **from**), and the target (language to translate **to**) languages. Using the `--microphone` argument will listen to audio on the current active input device, and stop after you press `ENTER`. The output is a text translation to the target language, written to a text file.
+
+> [!NOTE]
+> See the [language and locale article](language-support.md) for a list of all supported languages with their corresponding locale codes.
+
+## Batch operations
+
+The commands in the previous section are great for quickly seeing how the Speech service works. However, when assessing whether or not your use-cases can be met, you likely need to perform batch operations against a range of input you already have, to see how the service handles a variety of scenarios. This section shows how to:
+
+* Run batch speech recognition on a directory of audio files
+* Iterate through a `.tsv` file and run batch text-to-speech synthesis
+
+## Batch speech recognition
+
+If you have a directory of audio files, it's easy with SPX to quickly run batch-speech recognition. Simply run the following command, pointing to your directory with the `--files` command. In this example, you append `\*.wav` to the directory to recognize all `.wav` files present in the dir. Additionally, specify the `--threads` argument to run the recognition on 10 parallel threads.
+
+> [!NOTE]
+> The `--threads` argument can be also used in the next section for `spx synthesize` commands, and the available threads will depend on the CPU and it's current load percentage.
+
+```shell
+spx recognize --files C:\your_wav_file_dir\*.wav --output file C:\output_dir\speech_output.tsv --threads 10
+```
+
+The recognized speech output is written to `speech_output.tsv` using the `--output file` argument. The following is an example of the output file structure.
+
+    audio.input.id    recognizer.session.started.sessionid    recognizer.recognized.result.text
+    sample_1    07baa2f8d9fd4fbcb9faea451ce05475    A sample wave file.
+    sample_2    8f9b378f6d0b42f99522f1173492f013    Sample text synthesized.
+
+## Batch text-to-speech synthesis
+
+The easiest way to run batch text-to-speech is to create a new `.tsv` (tab-separated-value) file, and leverage the `--foreach` command in SPX. Consider the following file `text_synthesis.tsv`:
+
+    audio.output    text
+    C:\batch_wav_output\wav_1.wav    Sample text to synthesize.
+    C:\batch_wav_output\wav_2.wav    Using SPX to run batch-synthesis.
+    C:\batch_wav_output\wav_3.wav    Some more text to test capabilities.
+
+ Next, you run a command to point to `text_synthesis.tsv`, perform synthesis on each `text` field, and write the result to the corresponding `audio.output` path as a `.wav` file. 
+
+```shell
+spx synthesize --foreach in @C:\your\path\to\text_synthesis.tsv
+```
+
+This command is the equivalent of running `spx synthesize --text Sample text to synthesize --audio output C:\batch_wav_output\wav_1.wav` **for each** record in the `.tsv` file. A couple things to note:
+
+* The column headers, `audio.output` and `text`, correspond to the command line arguments `--audio output` and `--text`, respectively. Multi-part command line arguments like `--audio output` should be formatted in the file with no spaces, no leading dashes, and periods separating strings, e.g. `audio.output`. Any other existing command line arguments can be added to the file as additional columns using this pattern.
+* When the file is formatted in this way, no additional arguments are required to be passed to `--foreach`.
+* Ensure to separate each value in the `.tsv` with a **tab**.
+
+However, if you have a `.tsv` file like the following example, with column headers that **do not match** command line arguments:
+
+    wav_path    str_text
+    C:\batch_wav_output\wav_1.wav    Sample text to synthesize.
+    C:\batch_wav_output\wav_2.wav    Using SPX to run batch-synthesis.
+    C:\batch_wav_output\wav_3.wav    Some more text to test capabilities.
+
+You can override these field names to the correct arguments using the following syntax in the `--foreach` call. This is the same call as above.
+
+```shell
+spx synthesize --foreach audio.output;text in @C:\your\path\to\text_synthesis.tsv
+```
+
+## Next steps
+
+* Complete the [speech recognition](./quickstarts/speech-to-text-from-microphone.md) or [speech synthesis](./quickstarts/text-to-speech.md) quickstarts using the SDK.
diff --git a/articles/cognitive-services/Speech-Service/spx-overview.md b/articles/cognitive-services/Speech-Service/spx-overview.md
@@ -0,0 +1,46 @@
+---
+title: SPX - Speech service
+titleSuffix: Azure Cognitive Services
+description: SPX is a command line tool for using the Speech service without writing any code. SPX requires minimal set up, and it's easy to immediately start experimenting with key features of the Speech service to see if your use-cases can be met.
+services: cognitive-services
+author: trevorbye
+manager: nitinme
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: conceptual
+ms.date: 04/14/2020
+ms.author: trbye
+---
+
+# What is SPX?
+
+SPX is a command line tool for using the Speech service without writing any code. SPX requires minimal setup, and it's easy to immediately start experimenting with key features of the Speech service to see if your use-cases can be met. Within minutes, you can run simple test workflows like batch speech-recognition from a directory of files, or text-to-speech on a collection of strings from a file. Beyond simple workflows, SPX is production-ready and can be scaled up to run larger processes using automated `.bat` or shell scripts.
+
+The majority of the primary features in the Speech SDK are available in SPX, but some advanced features and customizations are simplified in SPX. Consider the following guidance to decide when to use SPX or the SDK.
+
+Use SPX when:
+* You want to experiment with Speech service features with minimal setup and no code
+* You have relatively simple requirements for a production application using the Speech service
+
+Use the SDK when:
+* You want to integrate Speech service functionality within a specific language or platform (e.g. C#, Python, C++)
+* You have complex requirements that may require advanced service requests, or developing custom behavior including response streaming
+
+## Core features
+
+* Speech recognition - Convert speech-to-text either from audio files or directly from a microphone, or transcribe a recorded conversation.
+
+* Speech synthesis - Convert text-to-speech using either input from text files, or input directly from the command line. Customize speech output characteristics using [SSML configurations](speech-synthesis-markup.md), and either [standard or neural voices](speech-synthesis-markup.md#standard-neural-and-custom-voices).
+
+* Speech translation - Translate audio in a source language to text in a target language.
+
+* Run on Azure compute resources - Send SPX commands to run on an Azure remote compute resource using `spx webjob`.
+
+## Get started
+
+To get started with SPX, see the [basics article](spx-basics.md). This article shows you how to run some basic commands in SPX, and also shows slightly more advanced commands for running batch operations for speech-to-text and text-to-speech. After reading the basics article, you should have enough of an understanding of the SPX syntax to start writing some custom commands, or automating simple Speech operations.
+
+## Next steps
+
+- [SPX basics](spx-basics.md)
+- If your use-case is more complex, [get the Speech SDK](speech-sdk.md)
diff --git a/articles/cognitive-services/Speech-Service/toc.yml b/articles/cognitive-services/Speech-Service/toc.yml
@@ -539,6 +539,20 @@
       items:
         - name: Speech devices SDK release notes
           href: devices-sdk-release-notes.md
+- name: Tools
+  items:
+  - name: SPX
+    items:
+      - name: What is SPX?
+        href: spx-overview.md
+      - name: SPX basics
+        href: spx-basics.md
+  - name: Speech Studio
+    items:
+      - name: What is Speech Studio?
+        href: https://speech.microsoft.com
+      - name: Create a Custom Commands app with Speech Studio
+        href: quickstart-custom-speech-commands-create-new.md 
 - name: Migration
   items:
   - name: From Bing Speech