fixes

Trevor Bye · Trevor Bye · commit 7192d34811d7 · 2020-04-07T11:57:32.000-07:00
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-cpp.md b/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-cpp.md
@@ -97,7 +97,7 @@ void synthesizeSpeech()
 
 Run the program, and a synthesized `.wav` file is written to the location you specified. This is a good example of the most basic usage, but next you look at customizing output and handling the output response as an in-memory stream for working with custom scenarios.
 
-### Synthesize to speaker output
+## Synthesize to speaker output
 
 In some cases, you may want to directly output synthesized speech directly to a speaker. To do this, simply omit the `AudioConfig` param when creating the `SpeechSynthesizer` in the example above. This outputs to the current active output device.
 
@@ -121,7 +121,7 @@ For many scenarios in speech application development, you likely need the result
 It's simple to make this change from the previous example. First, remove the `AudioConfig`, as you will manage the output behavior manually from this point onward for increased control. Then pass `NULL` for the `AudioConfig` in the `SpeechSynthesizer` constructor. 
 
 > ![NOTE]
-> Passing `NULL` for the `AudioConfig`, rather than omitting it like in the speaker output example 
+> Passing `NULL` for the `AudioConfig`, rather than omitting it like in the speaker output example
 > above, will not play the audio by default on the current active output device.
 
 This time, you save the result to a [`SpeechSynthesisResult`](https://docs.microsoft.com/cpp/cognitive-services/speech/speechsynthesisresult) variable. The `GetAudioData` getter returns a `byte []` of the output data. You can work with this `byte []` manually, or you can use the [`AudioDataStream`](https://docs.microsoft.com/cpp/cognitive-services/speech/audiodatastream) class to manage the in-memory stream. In this example you use the `AudioDataStream.FromResult()` static function to get a stream from the result.
@@ -218,7 +218,7 @@ The output works, but there a few simple additional changes you can make to help
 </speak>
 ```
 
-### Neural voices
+## Neural voices
 
 Neural voices are speech synthesis algorithms powered by deep neural networks. When using a neural voice, synthesized speech is nearly indistinguishable from the human recordings. With the human-like natural prosody and clear articulation of words, neural voices significantly reduce listening fatigue when users interact with AI systems.
 
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-csharp.md b/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-csharp.md
@@ -92,7 +92,7 @@ static async Task SynthesizeAudioAsync()
 
 Run the program, and a synthesized `.wav` file is written to the location you specified. This is a good example of the most basic usage, but next you look at customizing output and handling the output response as an in-memory stream for working with custom scenarios.
 
-### Synthesize to speaker output
+## Synthesize to speaker output
 
 In some cases, you may want to directly output synthesized speech directly to a speaker. To do this, simply omit the `AudioConfig` param when creating the `SpeechSynthesizer` in the example above. This outputs to the current active output device.
 
@@ -116,7 +116,7 @@ For many scenarios in speech application development, you likely need the result
 It's simple to make this change from the previous example. First, remove the `AudioConfig` block, as you will manage the output behavior manually from this point onward for increased control. Then pass `null` for the `AudioConfig` in the `SpeechSynthesizer` constructor. 
 
 > ![NOTE]
-> Passing `null` for the `AudioConfig`, rather than omitting it like in the speaker output example 
+> Passing `null` for the `AudioConfig`, rather than omitting it like in the speaker output example
 > above, will not play the audio by default on the current active output device.
 
 This time, you save the result to a [`SpeechSynthesisResult`](https://docs.microsoft.com/dotnet/api/microsoft.cognitiveservices.speech.speechsynthesisresult?view=azure-dotnet) variable. The `AudioData` property contains a `byte []` of the output data. You can work with this `byte []` manually, or you can use the [`AudioDataStream`](https://docs.microsoft.com/dotnet/api/microsoft.cognitiveservices.speech.audiodatastream?view=azure-dotnet) class to manage the in-memory stream. In this example you use the `AudioDataStream.FromResult()` static function to get a stream from the result.
@@ -211,7 +211,7 @@ The output works, but there a few simple additional changes you can make to help
 </speak>
 ```
 
-### Neural voices
+## Neural voices
 
 Neural voices are speech synthesis algorithms powered by deep neural networks. When using a neural voice, synthesized speech is nearly indistinguishable from the human recordings. With the human-like natural prosody and clear articulation of words, neural voices significantly reduce listening fatigue when users interact with AI systems.
 
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-java.md b/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-java.md
@@ -85,7 +85,7 @@ public static void main(String[] args) {
 
 Run the program, and a synthesized `.wav` file is written to the location you specified. This is a good example of the most basic usage, but next you look at customizing output and handling the output response as an in-memory stream for working with custom scenarios.
 
-### Synthesize to speaker output
+## Synthesize to speaker output
 
 In some cases, you may want to directly output synthesized speech directly to a speaker. To do this, instantiate the `AudioConfig` using the `fromDefaultSpeakerOutput()` static function. This outputs to the current active output device.
 
@@ -110,7 +110,7 @@ For many scenarios in speech application development, you likely need the result
 It's simple to make this change from the previous example. First, remove the `AudioConfig` block, as you will manage the output behavior manually from this point onward for increased control. Then pass `null` for the `AudioConfig` in the `SpeechSynthesizer` constructor. 
 
 > ![NOTE]
-> Passing `null` for the `AudioConfig`, rather than omitting it like in the speaker output example 
+> Passing `null` for the `AudioConfig`, rather than omitting it like in the speaker output example
 > above, will not play the audio by default on the current active output device.
 
 This time, you save the result to a [`SpeechSynthesisResult`](https://docs.microsoft.com/java/api/com.microsoft.cognitiveservices.speech.speechsynthesisresult?view=azure-java-stable) variable. The `SpeechSynthesisResult.getAudioData()` function returns a `byte []` of the output data. You can work with this `byte []` manually, or you can use the [`AudioDataStream`](https://docs.microsoft.com/java/api/com.microsoft.cognitiveservices.speech.audiodatastream?view=azure-java-stable) class to manage the in-memory stream. In this example you use the `AudioDataStream.fromResult()` static function to get a stream from the result.
@@ -217,7 +217,7 @@ The output works, but there a few simple additional changes you can make to help
 </speak>
 ```
 
-### Neural voices
+## Neural voices
 
 Neural voices are speech synthesis algorithms powered by deep neural networks. When using a neural voice, synthesized speech is nearly indistinguishable from the human recordings. With the human-like natural prosody and clear articulation of words, neural voices significantly reduce listening fatigue when users interact with AI systems.
 
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-python.md b/articles/cognitive-services/Speech-Service/includes/how-to/text-to-speech-basics/text-to-speech-basics-python.md
@@ -36,7 +36,8 @@ from azure.cognitiveservices.speech.audio import AudioOutputConfig
 To call the Speech service using the Speech SDK, you need to create a [`SpeechConfig`](https://docs.microsoft.com/python/api/azure-cognitiveservices-speech/azure.cognitiveservices.speech.speechconfig?view=azure-python). This class includes information about your subscription, like your key and associated region, endpoint, host, or authorization token.
 
 > [!NOTE]
-> Regardless of whether you're performing speech recognition, speech synthesis, translation, or intent recognition, you'll always create a configuration.
+> Regardless of whether you're performing speech recognition, speech synthesis, translation, or intent
+> recognition, you'll always create a configuration.
 
 There are a few ways that you can initialize a [`SpeechConfig`](https://docs.microsoft.com/python/api/azure-cognitiveservices-speech/azure.cognitiveservices.speech.speechconfig?view=azure-python):
 
@@ -70,7 +71,7 @@ synthesizer.speak_text_async("A simple test to write to a file.")
 
 Run the program, and a synthesized `.wav` file is written to the location you specified. This is a good example of the most basic usage, but next you look at customizing output and handling the output response as an in-memory stream for working with custom scenarios.
 
-### Synthesize to speaker output
+## Synthesize to speaker output
 
 In some cases, you may want to directly output synthesized speech directly to a speaker. To do this, use the example in the previous section, but change the `AudioOutputConfig` by removing the `filename` param, and set `use_default_speaker=True`. This outputs to the current active output device.
 
@@ -89,7 +90,7 @@ For many scenarios in speech application development, you likely need the result
 It's simple to make this change from the previous example. First, remove the `AudioConfig`, as you will manage the output behavior manually from this point onward for increased control. Then pass `None` for the `AudioConfig` in the `SpeechSynthesizer` constructor. 
 
 > ![NOTE]
-> Passing `None` for the `AudioConfig`, rather than omitting it like in the speaker output example 
+> Passing `None` for the `AudioConfig`, rather than omitting it like in the speaker output example
 > above, will not play the audio by default on the current active output device.
 
 This time, you save the result to a [`SpeechSynthesisResult`](https://docs.microsoft.com/python/api/azure-cognitiveservices-speech/azure.cognitiveservices.speech.speechsynthesisresult?view=azure-python) variable. The `audio_data` property contains a `bytes` object of the output data. You can work with this object manually, or you can use the [`AudioDataStream`](https://docs.microsoft.com/python/api/azure-cognitiveservices-speech/azure.cognitiveservices.speech.audiodatastream?view=azure-python) class to manage the in-memory stream. In this example you use the `AudioDataStream` constructor to get a stream from the result.
@@ -171,7 +172,7 @@ The output works, but there a few simple additional changes you can make to help
 </speak>
 ```
 
-### Neural voices
+## Neural voices
 
 Neural voices are speech synthesis algorithms powered by deep neural networks. When using a neural voice, synthesized speech is nearly indistinguishable from the human recordings. With the human-like natural prosody and clear articulation of words, neural voices significantly reduce listening fatigue when users interact with AI systems.
 
diff --git a/articles/cognitive-services/Speech-Service/spx-basics.md b/articles/cognitive-services/Speech-Service/spx-basics.md
@@ -0,0 +1,32 @@
+---
+title: "SPX basics - Speech service"
+titleSuffix: Azure Cognitive Services
+description: Learn how to use the SPX command line tool to work with the Speech SDK with no code and minimal setup. 
+services: cognitive-services
+author: trevorbye
+manager: nitinme
+ms.service: cognitive-services
+ms.subservice: speech-service
+ms.topic: quickstart
+ms.date: 04/04/2020
+ms.author: trbye
+---
+
+# Learn the basics of SPX
+
+## Prerequisites
+
+The only prerequisite is an Azure Speech subscription. See the [guide](get-started.md#new-resource) on creating a new subscription if you don't already have one.
+
+## Download and install
+
+## Create subscription config
+
+To start using SPX, you first need to enter your Speech subscription key and region information. See the [region support](https://docs.microsoft.com/azure/cognitive-services/speech-service/regions#speech-sdk) page to find your region identifier. Once you have your subscription key and region identifier (ex. `eastus`, `westus`), run the following commands.
+
+```shell
+spx config @key --set YOUR-SUBSCRIPTION-KEY
+spx config @region --set YOUR-REGION-ID
+```
+
+Your subscription authentication is now stored for future SPX requests. If you need to remove either of these stored values, run `spx config @region --clear` or `spx config @key --clear`.
diff --git a/articles/cognitive-services/Speech-Service/text-to-speech-basics.md b/articles/cognitive-services/Speech-Service/text-to-speech-basics.md
@@ -42,10 +42,6 @@ In this article, you learn common design patterns for doing text-to-speech synth
 [!INCLUDE [Python Basics include](includes/how-to/text-to-speech-basics/text-to-speech-basics-python.md)]
 ::: zone-end
 
-::: zone pivot="programming-language-more"
-[!INCLUDE [More languages include](./includes/how-to/speech-to-text-basics/more.md)]
-::: zone-end
-
 ## Next steps
 
 * [Get started with Custom Voice](how-to-custom-voice.md)