Merge pull request #202590 from eric-urban/eur/samples-repo-root

v-regandowner · web-flow · commit 8faf138d6a85 · 2022-06-23T13:23:59.000-04:00
user story 1907156
diff --git a/articles/cognitive-services/Speech-Service/includes/common/rest.md b/articles/cognitive-services/Speech-Service/includes/common/rest.md
@@ -7,4 +7,4 @@ ms.topic: include
 ms.author: eur
 ---
 
- [Speech-to-text REST API v3.0 reference](https://westus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0) | [Speech-to-text REST API for short audio reference](../../rest-speech-to-text-short.md) | [Additional Samples on GitHub](https://github.com/Azure-Samples/cognitive-services-quickstart-code)
+ [Speech-to-text REST API v3.0 reference](https://westus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0) | [Speech-to-text REST API for short audio reference](../../rest-speech-to-text-short.md) | [Additional Samples on GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk)
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/recognize-speech/rest.md b/articles/cognitive-services/Speech-Service/includes/how-to/recognize-speech/rest.md
@@ -17,10 +17,22 @@ At a command prompt, run the following command. Insert the following values into
 - Your Speech service region.
 - The path for input audio files. You can generate audio files by using [text-to-speech](../../../get-started-text-to-speech.md).
 
-:::code language="curl" source="~/cognitive-services-quickstart-code/curl/speech/speech-to-text.sh" id="request":::
-
-You should receive a response like the following one:
-
-:::code language="curl" source="~/cognitive-services-quickstart-code/curl/speech/speech-to-text.sh" id="response":::
+```curl
+curl --location --request POST 'https://INSERT_REGION_HERE.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US' \
+--header 'Ocp-Apim-Subscription-Key: INSERT_SUBSCRIPTION_KEY_HERE' \
+--header 'Content-Type: audio/wav' \
+--data-binary @'INSERT_AUDIO_FILE_PATH_HERE'
+```
+
+You should receive a response with a JSON body like the following one:
+
+```json
+{
+    "RecognitionStatus": "Success",
+    "DisplayText": "My voice is my passport, verify me.",
+    "Offset": 6600000,
+    "Duration": 32100000
+}
+```
 
 For more information, see the [speech-to-text REST API reference](../../../rest-speech-to-text.md).
diff --git a/articles/cognitive-services/Speech-Service/includes/how-to/speech-synthesis/rest.md b/articles/cognitive-services/Speech-Service/includes/how-to/speech-synthesis/rest.md
@@ -25,17 +25,31 @@ You might also want to change the following values:
 - The output voice. To get a list of voices available for your Speech service endpoint, see the next section.
 - The output file. In this example, we direct the response from the server into a file named *output.mp3*.
 
-:::code language="curl" source="~/cognitive-services-quickstart-code/curl/speech/text-to-speech.sh":::
+```curl
+curl --location --request POST 'https://INSERT_REGION_HERE.tts.speech.microsoft.com/cognitiveservices/v1' \
+--header 'Ocp-Apim-Subscription-Key: INSERT_SUBSCRIPTION_KEY_HERE' \
+--header 'Content-Type: application/ssml+xml' \
+--header 'X-Microsoft-OutputFormat: audio-16khz-128kbitrate-mono-mp3' \
+--header 'User-Agent: curl' \
+--data-raw '<speak version='\''1.0'\'' xml:lang='\''en-US'\''>
+    <voice xml:lang='\''en-US'\'' xml:gender='\''Female'\'' name='\''en-US-JennyNeural'\''>
+        my voice is my passport verify me
+    </voice>
+</speak>' > output.mp3
+```
 
 ## List available voices for your Speech service endpoint
 
 To list the available voices for your Speech service endpoint, run the following command:
 
-:::code language="curl" source="~/cognitive-services-quickstart-code/curl/speech/get-voices.sh" id="request":::
+```curl
+curl --location --request GET 'https://INSERT_ENDPOINT_HERE.tts.speech.microsoft.com/cognitiveservices/voices/list' \
+--header 'Ocp-Apim-Subscription-Key: INSERT_SUBSCRIPTION_KEY_HERE'
+```
 
-You should receive a response like the following one:
+You should receive a response with a JSON body like the following one:
 
-```http
+```json
 [
     {
         "Name": "Microsoft Server Speech Text to Speech Voice (en-US, ChristopherNeural)",
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/speaker-recognition-basics/cpp.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/speaker-recognition-basics/cpp.md
@@ -27,13 +27,34 @@ Before you start, you must install the Speech SDK. Depending on your platform, u
 
 To run the examples in this article, add the following statements at the top of your .cpp file:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="dependencies":::
+```cpp
+#include <iostream>
+#include <stdexcept>
+// Note: Install the NuGet package Microsoft.CognitiveServices.Speech.
+#include <speechapi_cxx.h>
+
+using namespace std;
+using namespace Microsoft::CognitiveServices::Speech;
+
+// Note: Change the locale if desired.
+auto profile_locale = "en-us";
+auto audio_config = Audio::AudioConfig::FromDefaultMicrophoneInput();
+auto ticks_per_second = 10000000;
+```
 
 ## Create a speech configuration
 
 To call the Speech service by using the Speech SDK, create a [`SpeechConfig`](/cpp/cognitive-services/speech/speechconfig) class. This class includes information about your subscription, like your key and associated region, endpoint, host, or authorization token.
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="get_speech_config":::
+```cpp
+shared_ptr<SpeechConfig> GetSpeechConfig()
+{
+    auto subscription_key = 'PASTE_YOUR_SPEECH_SUBSCRIPTION_KEY_HERE';
+    auto region = 'PASTE_YOUR_SPEECH_ENDPOINT_REGION_HERE';
+    auto config = SpeechConfig::FromSubscription(subscription_key, region);
+    return config;
+}
+```
 
 ## Text-dependent verification
 
@@ -43,7 +64,19 @@ Speaker verification is the act of confirming that a speaker matches a known, or
 
 Start by creating the `TextDependentVerification` function:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="text_dependent_verification":::
+```cpp
+void TextDependentVerification(shared_ptr<VoiceProfileClient> client, shared_ptr<SpeakerRecognizer> recognizer)
+{
+    std::cout << "Text Dependent Verification:\n\n";
+    // Create the profile.
+    auto profile = client->CreateProfileAsync(VoiceProfileType::TextDependentVerification, profile_locale).get();
+    std::cout << "Created profile ID: " << profile->GetId() << "\n";
+    AddEnrollmentsToTextDependentProfile(client, profile);
+    SpeakerVerify(profile, recognizer);
+    // Delete the profile.
+    client->DeleteProfileAsync(profile);
+}
+```
 
 This function creates a [VoiceProfile](/cpp/cognitive-services/speech/speaker-voiceprofile) object with the [CreateProfileAsync](/cpp/cognitive-services/speech/speaker-voiceprofileclient#createprofileasync) method. There are three [types](/cpp/cognitive-services/speech/microsoft-cognitiveservices-speech-namespace#enum-voiceprofiletype) of `VoiceProfile`:
 
@@ -59,15 +92,44 @@ You then call two helper functions that you'll define next, `AddEnrollmentsToTex
 
 Define the following function to enroll a voice profile:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="add_enrollments_dependent":::
+```cpp
+void AddEnrollmentsToTextDependentProfile(shared_ptr<VoiceProfileClient> client, shared_ptr<VoiceProfile> profile)
+{
+    shared_ptr<VoiceProfileEnrollmentResult> enroll_result = nullptr;
+    auto phraseResult = client->GetActivationPhrasesAsync(profile->GetType(), profile_locale).get();
+    auto phrases = phraseResult->GetPhrases();
+    while (enroll_result == nullptr || enroll_result->GetEnrollmentInfo(EnrollmentInfoType::RemainingEnrollmentsCount) > 0)
+    {
+        if (phrases != nullptr && phrases->size() > 0)
+        {
+            std::cout << "Please say the passphrase, \"" << phrases->at(0) << "\"\n";
+            enroll_result = client->EnrollProfileAsync(profile, audio_config).get();
+            std::cout << "Remaining enrollments needed: " << enroll_result->GetEnrollmentInfo(EnrollmentInfoType::RemainingEnrollmentsCount) << ".\n";
+        }
+        else
+        {
+            std::cout << "No passphrases received, enrollment not attempted.\n\n";
+        }
+    }
+    std::cout << "Enrollment completed.\n\n";
+}
+```
 
 In this function, you enroll audio samples in a `while` loop that tracks the number of samples remaining, and that are required, for enrollment. In each iteration, [EnrollProfileAsync](/cpp/cognitive-services/speech/speaker-voiceprofileclient#enrollprofileasync) prompts you to speak the passphrase into your microphone, and it adds the sample to the voice profile.
 
 ### SpeakerVerify function
 
 Define `SpeakerVerify` as follows:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="speaker_verify":::
+```cpp
+void SpeakerVerify(shared_ptr<VoiceProfile> profile, shared_ptr<SpeakerRecognizer> recognizer)
+{
+    shared_ptr<SpeakerVerificationModel> model = SpeakerVerificationModel::FromProfile(profile);
+    std::cout << "Speak the passphrase to verify: \"My voice is my passport, verify me.\"\n";
+    shared_ptr<SpeakerRecognitionResult> result = recognizer->RecognizeOnceAsync(model).get();
+    std::cout << "Verified voice profile for speaker: " << result->ProfileId << ". Score is: " << result->GetScore() << ".\n\n";
+}
+```
 
 In this function, you create a [SpeakerVerificationModel](/cpp/cognitive-services/speech/speaker-speakerverificationmodel) object with the [SpeakerVerificationModel::FromProfile](/cpp/cognitive-services/speech/speaker-speakerverificationmodel#fromprofile) method, passing in the [VoiceProfile](/cpp/cognitive-services/speech/speaker-voiceprofile) object you created earlier.
 
@@ -81,7 +143,19 @@ In contrast to *text-dependent* verification, *text-independent* verification do
 
 Start by creating the `TextIndependentVerification` function:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="text_independent_verification":::
+```cpp
+void TextIndependentVerification(shared_ptr<VoiceProfileClient> client, shared_ptr<SpeakerRecognizer> recognizer)
+{
+    std::cout << "Text Independent Verification:\n\n";
+    // Create the profile.
+    auto profile = client->CreateProfileAsync(VoiceProfileType::TextIndependentVerification, profile_locale).get();
+    std::cout << "Created profile ID: " << profile->GetId() << "\n";
+    AddEnrollmentsToTextIndependentProfile(client, profile);
+    SpeakerVerify(profile, recognizer);
+    // Delete the profile.
+    client->DeleteProfileAsync(profile);
+}
+```
 
 Like the `TextDependentVerification` function, this function creates a [VoiceProfile](/cpp/cognitive-services/speech/speaker-voiceprofile) object with the [CreateProfileAsync](/cpp/cognitive-services/speech/speaker-voiceprofileclient#createprofileasync) method.
 
@@ -93,7 +167,28 @@ You then call two helper functions: `AddEnrollmentsToTextIndependentProfile`, wh
 
 Define the following function to enroll a voice profile:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="add_enrollments_independent":::
+```cpp
+void AddEnrollmentsToTextIndependentProfile(shared_ptr<VoiceProfileClient> client, shared_ptr<VoiceProfile> profile)
+{
+    shared_ptr<VoiceProfileEnrollmentResult> enroll_result = nullptr;
+    auto phraseResult = client->GetActivationPhrasesAsync(profile->GetType(), profile_locale).get();
+    auto phrases = phraseResult->GetPhrases();
+    while (enroll_result == nullptr || enroll_result->GetEnrollmentInfo(EnrollmentInfoType::RemainingEnrollmentsSpeechLength) > 0)
+    {
+        if (phrases != nullptr && phrases->size() > 0)
+        {
+            std::cout << "Please say the activation phrase, \"" << phrases->at(0) << "\"\n";
+            enroll_result = client->EnrollProfileAsync(profile, audio_config).get();
+            std::cout << "Remaining audio time needed: " << enroll_result->GetEnrollmentInfo(EnrollmentInfoType::RemainingEnrollmentsSpeechLength) / ticks_per_second << " seconds.\n";
+        }
+        else
+        {
+            std::cout << "No activation phrases received, enrollment not attempted.\n\n";
+        }
+    }
+    std::cout << "Enrollment completed.\n\n";
+}
+```
 
 In this function, you enroll audio samples in a `while` loop that tracks the number of seconds of audio remaining, and that are required, for enrollment. In each iteration, [EnrollProfileAsync](/cpp/cognitive-services/speech/speaker-voiceprofileclient#enrollprofileasync) prompts you to speak into your microphone, and it adds the sample to the voice profile.
 
@@ -105,7 +200,19 @@ Speaker identification is used to determine *who* is speaking from a given group
 
 Start by creating the `TextIndependentIdentification` function:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="text_independent_indentification":::
+```cpp
+void TextIndependentIdentification(shared_ptr<VoiceProfileClient> client, shared_ptr<SpeakerRecognizer> recognizer)
+{
+    std::cout << "Speaker Identification:\n\n";
+    // Create the profile.
+    auto profile = client->CreateProfileAsync(VoiceProfileType::TextIndependentIdentification, profile_locale).get();
+    std::cout << "Created profile ID: " << profile->GetId() << "\n";
+    AddEnrollmentsToTextIndependentProfile(client, profile);
+    SpeakerIdentify(profile, recognizer);
+    // Delete the profile.
+    client->DeleteProfileAsync(profile);
+}
+```
 
 Like the `TextDependentVerification` and `TextIndependentVerification` functions, this function creates a [VoiceProfile](/cpp/cognitive-services/speech/speaker-voiceprofile) object with the [CreateProfileAsync](/cpp/cognitive-services/speech/speaker-voiceprofileclient#createprofileasync) method.
 
@@ -117,7 +224,16 @@ You then call two helper functions: `AddEnrollmentsToTextIndependentProfile`, wh
 
 Define the `SpeakerIdentify` function as follows:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="speaker_identify":::
+```cpp
+void SpeakerIdentify(shared_ptr<VoiceProfile> profile, shared_ptr<SpeakerRecognizer> recognizer)
+{
+    shared_ptr<SpeakerIdentificationModel> model = SpeakerIdentificationModel::FromProfiles({ profile });
+    // Note: We need at least four seconds of audio after pauses are subtracted.
+    std::cout << "Please speak for at least ten seconds to identify who it is from your list of enrolled speakers.\n";
+    shared_ptr<SpeakerRecognitionResult> result = recognizer->RecognizeOnceAsync(model).get();
+    std::cout << "The most similar voice profile is: " << result->ProfileId << " with similarity score: " << result->GetScore() << ".\n\n";
+}
+```
 
 In this function, you create a [SpeakerIdentificationModel](/cpp/cognitive-services/speech/speaker-speakeridentificationmodel) object with the [SpeakerIdentificationModel::FromProfiles](/cpp/cognitive-services/speech/speaker-speakeridentificationmodel#fromprofiles) method. `SpeakerIdentificationModel::FromProfiles` accepts a list of [VoiceProfile](/cpp/cognitive-services/speech/speaker-voiceprofile) objects. In this case, you pass in the `VoiceProfile` object you created earlier. If you want, you can pass in multiple `VoiceProfile` objects, each enrolled with audio samples from a different voice.
 
@@ -127,11 +243,23 @@ Next, [SpeechRecognizer::RecognizeOnceAsync](/cpp/cognitive-services/speech/spee
 
 Finally, define the `main` function as follows:
 
-:::code language="cpp" source="~/cognitive-services-quickstart-code/cpp/speech/speaker-recognition.cpp" id="main":::
+```cpp
+int main()
+{
+    auto speech_config = GetSpeechConfig();
+    auto client = VoiceProfileClient::FromConfig(speech_config);
+    auto recognizer = SpeakerRecognizer::FromConfig(speech_config, audio_config);
+    TextDependentVerification(client, recognizer);
+    TextIndependentVerification(client, recognizer);
+    TextIndependentIdentification(client, recognizer);
+    std::cout << "End of quickstart.\n";
+}
+```
 
 This function calls the functions you defined previously. First, it creates a [VoiceProfileClient](/cpp/cognitive-services/speech/speaker-voiceprofileclient) object and a [SpeakerRecognizer](/cpp/cognitive-services/speech/speaker-speakerrecognizer) object.
 
-```
+
+```cpp
 auto speech_config = GetSpeechConfig();
 auto client = VoiceProfileClient::FromConfig(speech_config);
 auto recognizer = SpeakerRecognizer::FromConfig(speech_config, audio_config);
@@ -143,14 +271,14 @@ The `VoiceProfileClient` object is used to create, enroll, and delete voice prof
 
 The examples in this article use the default device microphone as input for audio samples. In scenarios where you need to use audio files instead of microphone input, change the following line:
 
-```
+```cpp
 auto audio_config = Audio::AudioConfig::FromDefaultMicrophoneInput();
 ```
 
 to:
 
-```
-auto audio_config = Audio::AudioConfig::FromWavFileInput(path/to/your/file.wav);
+```cpp
+auto audio_config = Audio::AudioConfig::FromWavFileInput("path/to/your/file.wav");
 ```
 
 Or replace any use of `audio_config` with [Audio::AudioConfig::FromWavFileInput](/cpp/cognitive-services/speech/audio-audioconfig#fromwavfileinput). You can also have mixed inputs by using a microphone for enrollment and files for verification, for example.
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/speaker-recognition-basics/javascript.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/speaker-recognition-basics/javascript.md
diff --git a/articles/cognitive-services/Speech-Service/includes/quickstarts/speaker-recognition-basics/rest.md b/articles/cognitive-services/Speech-Service/includes/quickstarts/speaker-recognition-basics/rest.md