new STT API version and fast transcription

eric-urban · eric-urban · commit b1c862c17e2f · 2024-11-06T07:11:06.000-08:00
diff --git a/articles/ai-services/speech-service/fast-transcription-create.md b/articles/ai-services/speech-service/fast-transcription-create.md
diff --git a/articles/ai-services/speech-service/includes/release-notes/release-notes-stt.md b/articles/ai-services/speech-service/includes/release-notes/release-notes-stt.md
@@ -2,10 +2,24 @@
 author: eric-urban
 ms.service: azure-ai-speech
 ms.topic: include
-ms.date: 7/12/2024
+ms.date: 11/12/2024
 ms.author: eur
 ---
 
+
+### November 2024 release
+
+#### Speech to text REST API version 2024-11-15
+
+The speech to text REST API version 2024-11-15 is released for general availability. For more information, see the [speech to text REST API reference documentation](https://go.microsoft.com/fwlink/?linkid=2296107) and the [Speech to text REST API guide](../../rest-speech-to-text.md).
+
+> [!NOTE]
+> The speech to text REST API version 2024-05-15-preview is deprecated.
+
+#### Fast transcription (GA)
+
+Fast transcription is now generally available via [speech to text REST API version 2024-11-15](https://go.microsoft.com/fwlink/?linkid=2296107). Fast transcription allows you to transcribe audio file to text accurately and synchronously, with a high speed factor. It can transcribe audio much faster than the actual audio duration. For more information, see the [fast transcription API guide](../../fast-transcription-create.md).
+
 ### October 2024 release
 
 #### Video translation (Preview)
@@ -14,7 +28,6 @@ The video translation API is now available in public preview. For more informati
 
 ### September 2024 release
 
-
 #### Real-time speech to text 
 
 [Real-time speech to text](../../how-to-recognize-speech.md) has released new models, with better quality, for the following languages. 
@@ -76,7 +89,7 @@ Speech [pronunciation assessment](../../how-to-pronunciation-assessment.md) now
 
 #### Fast Transcription API (Preview)
 
-Fast transcription is now available in public preview. Fast transcription allows you to transcribe audio file to text accurately and synchronously, with a high speed factor. It can transcribe audio much faster than the actual audio length. For more information, see the [fast transcription API guide](../../fast-transcription-create.md).
+Fast transcription is now available in public preview. Fast transcription allows you to transcribe audio file to text accurately and synchronously, with a high speed factor. It can transcribe audio much faster than the actual audio duration. For more information, see the [fast transcription API guide](../../fast-transcription-create.md).
 
 > [!TIP]
 > Try out fast transcription in [Azure AI Studio](https://aka.ms/fasttranscription/studio).
@@ -88,7 +101,7 @@ Fast transcription is now available in public preview. Fast transcription allows
 The Speech to text REST API version 3.2 is now generally available. For more information about speech to text REST API v3.2, see the [Speech to text REST API v3.2 reference documentation](/rest/api/speechtotext/operation-groups?view=rest-speechtotext-v3.2&preserve-view=true) and the [Speech to text REST API guide](../../rest-speech-to-text.md). 
 
 > [!NOTE]
-> Preview versions *3.2-preview.1* and *3.2-preview.2* will be removed in September 2024.
+> Preview versions *3.2-preview.1* and *3.2-preview.2* are retired as of September 2024.
 
 [Speech to text REST API](../../rest-speech-to-text.md) v3.1 will be retired on a date to be announced. Speech to text REST API v3.0 will be retired on April 1st, 2026. For more information about upgrading, see the Speech to text REST API [v3.0 to v3.1](../../migrate-v3-0-to-v3-1.md) and [v3.1 to v3.2](../../migrate-v3-1-to-v3-2.md) migration guides.
 
diff --git a/articles/ai-services/speech-service/index.yml b/articles/ai-services/speech-service/index.yml
@@ -12,7 +12,7 @@ metadata:
   manager: nitinme
   ms.service: azure-ai-speech
   ms.topic: hub-page
-  ms.date: 8/20/2024
+  ms.date: 11/12/2024
   ms.author: eur
 
 highlightedContent:
@@ -116,8 +116,8 @@ conceptualContent:
           text: Migrate to neural voice
           url: migration-overview-neural-voice.md
         - itemType: how-to-guide
-          text: Migrate to the v3.2 REST API
-          url: migrate-v3-1-to-v3-2.md
+          text: Migrate to speech to text REST API version 2024-11-15
+          url: migrate-2024-11-15.md
         - itemType: how-to-guide
           text: Migrate to Batch synthesis REST API
           url: migrate-to-batch-synthesis.md
diff --git a/articles/ai-services/speech-service/migrate-2024-11-15.md b/articles/ai-services/speech-service/migrate-2024-11-15.md
@@ -0,0 +1,94 @@
+---
+title: Migrate code from v3.2 to version 2024-11-15 - Speech service
+titleSuffix: Azure AI services
+description: This document helps developers migrate code from v3.2 to version 2024-11-15 of the Speech to text REST API.
+author: eric-urban
+ms.author: eur
+manager: nitinme
+ms.service: azure-ai-speech
+ms.topic: how-to
+ms.date: 11/12/2024
+#Customer intent: As a developer, I want to migrate code from v3.2 to version 2024-11-15 of the Speech to text REST API.
+---
+
+# Migrate code from v3.2 to version 2024-11-15
+
+The Speech to text REST API is used for [fast transcription](./fast-transcription-create.md), [batch transcription](batch-transcription.md), and [custom speech](custom-speech-overview.md). This article describes changes from version 3.2 to version 2024-11-15.
+
+> [!IMPORTANT]
+> Speech to text REST API version `2024-11-15` is the latest version that's generally available. 
+> - [Speech to text REST API](rest-speech-to-text.md) version `2024-05-15-preview` will be retired on a date to be announced. 
+> - Speech to text REST API `v3.0`, `v3.1`, `v3.2`, `3.2-preview.1`, and `3.2-preview.2` will be retired on April 1st, 2026. 
+> 
+> For more information about upgrading, see the Speech to text REST API [v3.0 to v3.1](migrate-v3-0-to-v3-1.md), [v3.1 to v3.2](migrate-v3-1-to-v3-2.md), and [v3.2 to 2024-11-15](migrate-2024-11-15.md) migration guides.
+
+## Base path
+
+Custom speech API switched from a path based versioning scheme to a query parameter based scheme in alignment with general Azure API versioning schemes. This required changes to the used base path. Update path from `/speechtotext/v3.2` to `/speechtotext` and append API version with `?api-version=2024-11-15` to all requests.
+
+## Datasets
+
+The `email` property and the connected email notification process is removed from the API.
+
+The `duration` property in dataset responses is renamed from `duration` to `durationMilliseconds` and are now a plain number instead of an ISO8601 formatted string (P1D2H3M4S…) to further simply processing.
+
+The query parameter `sasValidityInSeconds` is renamed to `sasLifetimeMinutes` for getting files. Usage is only allowed for an account with BYOS disabled. For BYOS enabled accounts, SAS URLs aren't returned.
+
+The `project` property is removed in creation requests.
+
+## Models
+
+Removed the `text` property in a model creation request. The alternative is to create a dataset with the text content and create a dataset first, which then is later on used for model creation.
+
+The `email` property and the connected email notification process is removed from the API.
+
+The query parameter `sasValidityInSeconds` is renamed to `sasLifetimeMinutes` for getting files. Usage is only allowed for an account with BYOS (bring your own storage) disabled. For BYOS enabled accounts, SAS URLs aren't returned.
+
+The `GET models/id/manifest` operation now always requires a nonzero SAS lifetime. The corresponding `sasValidityInSeconds` property is renamed to `sasLifetimeMinutes`.
+
+The `project` property is removed in creation requests.
+ 
+## Evaluations
+
+The query parameter `sasValidityInSeconds` is renamed to `sasLifetimeMinutes` for getting files. Usage is only allowed for an account with BYOS disabled. For BYOS enabled accounts, SAS URLs aren't returned.
+
+The `project` property is removed in creation requests
+
+The `email` property and the connected email notification process is removed from the API.
+
+## Endpoints
+
+The API to retrieve and delete log files of endpoint logs is removed. Custom speech now supports BYOS (bring your own storage). Only accounts with BYOS enabled can enable logging on model endpoints. This offers full manageability of log files on customer storage instead of a proxy API.
+
+Removed support for `timeToLive` in endpoint creations.
+
+Removed the `text` property in an endpoint creation request. The alternative is to create a dataset with the text content and create a dataset first, which then is later on used for model creation. This model can then be used to create an endpoint.
+
+Endpoint links now only return endpoint of websocket connection, used for SDK.
+
+The `project` property is removed in creation requests.
+
+The `email` property and the connected email notification process is removed from the API.
+
+## Transcriptions
+
+Removed the top-level `diarizationEnabled` property of a transcription. The diarization configuration is simplified to `"diarization": {"maxSpeakers": 2,"enabled": true}`. The `maxSpeakers` property is optional and defaults to 2. The `enabled` property is required for diarization.
+
+Transcription creation: `timeToLive` renamed to `timeToLiveHours` including a format change from ISO8601 formatted string to a simple int (number of hours).
+
+The `duration` property in transcription responses is renamed from `duration` to `durationMilliseconds` and are now a plain number instead of an ISO8601 formatted string (P1D2H3M4S…) to further simplify processing. Transcription result files have this property added for consistency with API.
+
+The query parameter `sasValidityInSeconds` is renamed to `sasLifetimeMinutes` for getting files. Usage is only allowed for an account with BYOS disabled. For BYOS enabled accounts, SAS URLs aren't returned.
+
+The `project` property is removed in creation requests.
+
+The `email` property and the connected email notification process is removed from the API.
+ 
+## Projects
+
+The projects API is removed.
+
+## Next steps
+
+* [Speech to text REST API](rest-speech-to-text.md)
+* [Speech to text REST API 2024-11-15 reference documentation](/rest/api/speechtotext/operation-groups?view=rest-speechtotext-2024-11-15&preserve-view=true)
diff --git a/articles/ai-services/speech-service/migrate-v3-0-to-v3-1.md b/articles/ai-services/speech-service/migrate-v3-0-to-v3-1.md
@@ -7,7 +7,7 @@ ms.author: eur
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: how-to
-ms.date: 9/20/2024
+ms.date: 11/12/2024
 ms.reviewer: heikora
 ms.devlang: csharp
 ms.custom: devx-track-csharp
@@ -16,12 +16,14 @@ ms.custom: devx-track-csharp
 
 # Migrate code from v3.0 to v3.1 of the REST API
 
-The Speech to text REST API is used for [Batch transcription](batch-transcription.md) and [custom speech](custom-speech-overview.md). Changes from version 3.0 to 3.1 are described in the sections below.
+The Speech to text REST API is used for [fast transcription](./fast-transcription-create.md), [batch transcription](batch-transcription.md), and [custom speech](custom-speech-overview.md). Changes from version 3.0 to 3.1 are described in the sections below.
 
 > [!IMPORTANT]
-> Speech to text REST API v3.2 is the latest version that's generally available. Preview versions *3.2-preview.1* and *3*.2-preview.2* will be removed in September 2024.
-> [Speech to text REST API](rest-speech-to-text.md) v3.1 will be retired on a date to be announced.
-> Speech to text REST API v3.0 will be retired on April 1st, 2026. 
+> Speech to text REST API version `2024-11-15` is the latest version that's generally available. 
+> - [Speech to text REST API](rest-speech-to-text.md) version `2024-05-15-preview` will be retired on a date to be announced. 
+> - Speech to text REST API `v3.0`, `v3.1`, `v3.2`, `3.2-preview.1`, and `3.2-preview.2` will be retired on April 1st, 2026. 
+> 
+> For more information about upgrading, see the Speech to text REST API [v3.0 to v3.1](migrate-v3-0-to-v3-1.md), [v3.1 to v3.2](migrate-v3-1-to-v3-2.md), and [v3.2 to 2024-11-15](migrate-2024-11-15.md) migration guides.
 
 ## Base path
 
diff --git a/articles/ai-services/speech-service/migrate-v3-1-to-v3-2.md b/articles/ai-services/speech-service/migrate-v3-1-to-v3-2.md
@@ -6,7 +6,7 @@ author: eric-urban
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: how-to
-ms.date: 9/20/2024
+ms.date: 11/12/2024
 ms.author: eur
 ms.devlang: csharp
 ms.custom: devx-track-csharp
@@ -15,12 +15,14 @@ ms.custom: devx-track-csharp
 
 # Migrate code from v3.1 to v3.2 of the REST API
 
-The Speech to text REST API is used for [Batch transcription](batch-transcription.md) and [custom speech](custom-speech-overview.md). This article describes changes from version 3.1 to 3.2.
+The Speech to text REST API is used for [fast transcription](./fast-transcription-create.md), [batch transcription](batch-transcription.md), and [custom speech](custom-speech-overview.md). This article describes changes from version 3.1 to 3.2.
 
 > [!IMPORTANT]
-> Speech to text REST API v3.2 is the latest version that's generally available. Preview versions *3.2-preview.1* and *3*.2-preview.2* will be removed in September 2024.
-> [Speech to text REST API](rest-speech-to-text.md) v3.1 will be retired on a date to be announced.
-> Speech to text REST API v3.0 will be retired on April 1st, 2026. 
+> Speech to text REST API version `2024-11-15` is the latest version that's generally available. 
+> - [Speech to text REST API](rest-speech-to-text.md) version `2024-05-15-preview` will be retired on a date to be announced. 
+> - Speech to text REST API `v3.0`, `v3.1`, `v3.2`, `3.2-preview.1`, and `3.2-preview.2` will be retired on April 1st, 2026. 
+> 
+> For more information about upgrading, see the Speech to text REST API [v3.0 to v3.1](migrate-v3-0-to-v3-1.md), [v3.1 to v3.2](migrate-v3-1-to-v3-2.md), and [v3.2 to 2024-11-15](migrate-2024-11-15.md) migration guides.
 
 ## Base path
 
diff --git a/articles/ai-services/speech-service/releasenotes.md b/articles/ai-services/speech-service/releasenotes.md
@@ -7,7 +7,7 @@ author: eric-urban
 ms.author: eur
 ms.service: azure-ai-speech
 ms.topic: release-notes
-ms.date: 10/9/2024
+ms.date: 11/12/2024
 ms.custom: references_regions
 # Customer intent: As a developer, I want to learn about new releases and features for Azure AI Speech.
 ---
@@ -18,9 +18,9 @@ Azure AI Speech is updated on an ongoing basis. To stay up-to-date with recent d
 
 ## Recent highlights
 
+* Fast transcription is now generally available. It can transcribe audio much faster than the actual audio duration. For more information, see the [fast transcription API guide](fast-transcription-create.md).
 * Azure AI Speech Toolkit extension is now available for Visual Studio Code users. It contains a list of speech quick-starts and scenario samples that can be easily built and run with simple clicks. For more information, see [Azure AI Speech Toolkit in Visual Studio Code Marketplace](https://aka.ms/speech-toolkit-vscode).
 * Azure AI speech high definition (HD) voices are available in public preview. The HD voices can understand the content, automatically detect emotions in the input text, and adjust the speaking tone in real-time to match the sentiment. For more information, see [What are Azure AI Speech high definition (HD) voices?](high-definition-voices.md).
-* Fast transcription is now available in public preview. It can transcribe audio much faster than the actual audio length. For more information, see the [fast transcription API guide](fast-transcription-create.md).
 * Video translation is now available in the Azure AI Speech service. For more information, see [What is video translation?](./video-translation-overview.md).
 * The Azure AI Speech service supports OpenAI text to speech voices. For more information, see [What are OpenAI text to speech voices?](./openai-voices.md). 
 * The custom voice API is available for creating and managing [professional](./professional-voice-create-project.md) and [personal](./personal-voice-create-project.md) custom neural voice models. 
diff --git a/articles/ai-services/speech-service/rest-speech-to-text.md b/articles/ai-services/speech-service/rest-speech-to-text.md
@@ -5,7 +5,7 @@ description: Get reference documentation for Speech to text REST API.
 manager: nitinme
 ms.service: azure-ai-speech
 ms.topic: reference
-ms.date: 9/23/2024
+ms.date: 11/12/2024
 ms.reviewer: eur
 author: eric-urban
 ms.author: eur
@@ -17,22 +17,18 @@ ms.author: eur
 Speech to text REST API is used for [batch transcription](batch-transcription.md) and [custom speech](custom-speech-overview.md). 
 
 > [!IMPORTANT]
-> Speech to text REST API v3.2 is the latest version that's generally available. Preview versions *3.2-preview.1* and *3*.2-preview.2* will be removed in September 2024.
-> [Speech to text REST API](rest-speech-to-text.md) v3.1 will be retired on a date to be announced. For more information about upgrading, see the Speech to text REST API [v3.1 to v3.2](migrate-v3-1-to-v3-2.md) migration guide.
-> Speech to text REST API v3.0 will be retired on April 1st, 2026. For more information about upgrading, see the Speech to text REST API [v3.0 to v3.1](migrate-v3-0-to-v3-1.md) and [v3.1 to v3.2](migrate-v3-1-to-v3-2.md) migration guides.
+> Speech to text REST API version `2024-11-15` is the latest version that's generally available. 
+> - [Speech to text REST API](rest-speech-to-text.md) version `2024-05-15-preview` will be retired on a date to be announced. 
+> - Speech to text REST API `v3.0`, `v3.1`, `v3.2`, `3.2-preview.1`, and `3.2-preview.2` will be retired on April 1st, 2026. 
+> 
+> For more information about upgrading, see the Speech to text REST API [v3.0 to v3.1](migrate-v3-0-to-v3-1.md), [v3.1 to v3.2](migrate-v3-1-to-v3-2.md), and [v3.2 to 2024-11-15](migrate-2024-11-15.md) migration guides.
 
 > [!div class="nextstepaction"]
-> [See the Speech to text REST API 2024-05-15 reference documentation](/rest/api/speechtotext/operation-groups?view=rest-speechtotext-2024-05-15-preview&preserve-view=true)
-
-> [!div class="nextstepaction"]
-> [See the Speech to text REST API v3.2 reference documentation](/rest/api/speechtotext/operation-groups?view=rest-speechtotext-v3.2&preserve-view=true)
-
-> [!div class="nextstepaction"]
-> [See the Speech to text REST API v3.1 reference documentation](/rest/api/speechtotext/operation-groups?view=rest-speechtotext-v3.1&preserve-view=true)
+> [See the Speech to text REST API 2024-11-15 reference documentation](/rest/api/speechtotext/operation-groups?view=rest-speechtotext-2024-11-15&preserve-view=true)
 
 Use Speech to text REST API to:
 
-- [Fast transcription](fast-transcription-create.md): Transcribe audio files with returning results synchronously and much faster than real-time audio. Use the fast transcription API ([/speechtotext/transcriptions:transcribe](/rest/api/speechtotext/transcriptions/transcribe)) in the scenarios that you need the transcript of an audio recording as quickly as possible with predictable latency, such as quick audio or video transcription or video translation.
+- [Fast transcription](fast-transcription-create.md): Transcribe audio files with returning results synchronously and much faster than real-time audio. Use the fast transcription API ([/speechtotext/transcriptions:transcribe](https://go.microsoft.com/fwlink/?linkid=2296107)) in the scenarios that you need the transcript of an audio recording as quickly as possible with predictable latency, such as quick audio or video transcription or video translation.
 - [Custom speech](custom-speech-overview.md): Upload your own data, test and train a custom model, compare accuracy between models, and deploy a model to a custom endpoint. Copy models to other subscriptions if you want colleagues to have access to a model that you built, or if you want to deploy a model to more than one region.
 - [Batch transcription](batch-transcription.md): Transcribe audio files as a batch from multiple URLs or an Azure container. 
 
diff --git a/articles/ai-services/speech-service/toc.yml b/articles/ai-services/speech-service/toc.yml
@@ -40,7 +40,7 @@ items:
         href: get-speech-recognition-results.md
       - name: Real-time diarization quickstart
         href: get-started-stt-diarization.md
-    - name: Fast transcription API (Preview)
+    - name: Fast transcription API
       href: fast-transcription-create.md
     - name: Batch transcription API
       items:
@@ -452,6 +452,9 @@ items:
               href: migrate-to-custom-voice-api.md
         - name: Speech to text REST API migration
           items:
+          - name: From Speech to text v3.2 to 2024-11-15
+            href: migrate-2024-11-15.md
+            displayName: migrate,migration,deprecate,retire,sunset
           - name: From Speech to text v3.1 to v3.2
             href: migrate-v3-1-to-v3-2.md
             displayName: migrate,migration,deprecate,retire,sunset