Skip to content

Commit a5f44c8

Browse files
Merge pull request #2429 from jackykang7777/patch-2
Update release-notes-stt.md for Semantic Segmentation Feature Release
2 parents c2a799c + 07b5bee commit a5f44c8

File tree

1 file changed

+26
-0
lines changed

1 file changed

+26
-0
lines changed

articles/ai-services/speech-service/includes/release-notes/release-notes-stt.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,32 @@ ms.author: eur
77

88
### January 2025 release
99

10+
#### New Feature - Semantic Segmentation
11+
Announcing the release of a new feature: Semantic Segmentation. This feature integrates a punctuation module inside decoder that segments audio based on semantic information, resulting in more logical and precise segmentation boundaries.
12+
Key Benefits:
13+
- Improved Segmentation Accuracy: By leveraging semantic information, this feature significantly reduces instances of long segments caused by the absence of pauses in the input audio.
14+
- Reduce latency caused by under-segmentation: The overall latency for the Speech-to-Text service is reduced, with a 40%-60% reduction in the length of the longest 5% of segments.
15+
- Over-Segmentation Mitigation: This feature also helps prevent over-segmentation by delaying segmentation when a better sentence can be formed.
16+
17+
Supported Locales:
18+
19+
<li>English (en-US, en-GB)</li>
20+
<li>Chinese (zh-CN, zh-HK)</li>
21+
<li>Japanese (ja-JP)</li>
22+
<li>Korean (ko-KR)</li>
23+
<li>German (de-DE)</li>
24+
<li>French (fr-FR)</li>
25+
<li>Italian (it-IT)</li>
26+
<li>Spanish (es-ES, es-MX)</li>
27+
<li>Hindi (hi-IN)</li>
28+
<li>Portuguese (pt-BR, pt-PT)</li>
29+
<li>Turkish (tr-TR)</li>
30+
<li>Russian (ru-RU)</li>
31+
<li>Thai (th-TH)</li>
32+
<li>Indonesian (id-ID)</li>
33+
34+
For implementation details, please refer to the documentation: [How to Recognize Speech](https://learn.microsoft.com/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-csharp) in section "Semantic Segmentation".
35+
1036
#### Real-time speech to text - New English model release
1137

1238
Announcing the release of the latest English speech model (en-US, en-CA), which brings substantial improvements across various performance metrics. Below are the key highlights of this release:

0 commit comments

Comments
 (0)