You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/includes/release-notes/release-notes-stt.md
+26Lines changed: 26 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,6 +7,32 @@ ms.author: eur
7
7
8
8
### January 2025 release
9
9
10
+
#### New Feature - Semantic Segmentation
11
+
Announcing the release of a new feature: Semantic Segmentation. This feature integrates a punctuation module inside decoder that segments audio based on semantic information, resulting in more logical and precise segmentation boundaries.
12
+
Key Benefits:
13
+
- Improved Segmentation Accuracy: By leveraging semantic information, this feature significantly reduces instances of long segments caused by the absence of pauses in the input audio.
14
+
- Reduce latency caused by under-segmentation: The overall latency for the Speech-to-Text service is reduced, with a 40%-60% reduction in the length of the longest 5% of segments.
15
+
- Over-Segmentation Mitigation: This feature also helps prevent over-segmentation by delaying segmentation when a better sentence can be formed.
16
+
17
+
Supported Locales:
18
+
19
+
<li>English (en-US, en-GB)</li>
20
+
<li>Chinese (zh-CN, zh-HK)</li>
21
+
<li>Japanese (ja-JP)</li>
22
+
<li>Korean (ko-KR)</li>
23
+
<li>German (de-DE)</li>
24
+
<li>French (fr-FR)</li>
25
+
<li>Italian (it-IT)</li>
26
+
<li>Spanish (es-ES, es-MX)</li>
27
+
<li>Hindi (hi-IN)</li>
28
+
<li>Portuguese (pt-BR, pt-PT)</li>
29
+
<li>Turkish (tr-TR)</li>
30
+
<li>Russian (ru-RU)</li>
31
+
<li>Thai (th-TH)</li>
32
+
<li>Indonesian (id-ID)</li>
33
+
34
+
For implementation details, please refer to the documentation: [How to Recognize Speech](https://learn.microsoft.com/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-csharp) in section "Semantic Segmentation".
35
+
10
36
#### Real-time speech to text - New English model release
11
37
12
38
Announcing the release of the latest English speech model (en-US, en-CA), which brings substantial improvements across various performance metrics. Below are the key highlights of this release:
0 commit comments