Skip to content

Commit 09647a9

Browse files
authored
first proof guide.en-gb.md
1 parent 2423a68 commit 09647a9

File tree

1 file changed

+10
-9
lines changed
  • pages/public_cloud/ai_machine_learning/endpoints_guide_08_audio_transcriptions

1 file changed

+10
-9
lines changed

pages/public_cloud/ai_machine_learning/endpoints_guide_08_audio_transcriptions/guide.en-gb.md

Lines changed: 10 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: AI Endpoints - Speech to Text
33
excerpt: Learn how to transcribe audio files with OVHcloud AI Endpoints
4-
updated: 2025-09-30
4+
updated: 2025-10-01
55
---
66

77
> [!primary]
@@ -250,7 +250,6 @@ The `diarize` parameter enables speaker separation in the generated transcript.
250250
This is useful for meetings, debates, or interviews where multiple people are speaking.
251251
252252
> [!warning]
253-
> **Warning**:
254253
> - This parameter is only available with the default `verbose_json` [response format](#response-formats). Using any other will raise an error.
255254
> - `diarize` is not supported when using the OpenAI client libraries. You must use a direct HTTP request with `requests`, `cURL`, or another HTTP client.
256255
@@ -299,7 +298,7 @@ The `prompt` parameter lets you provide extra context to improve transcription.
299298
- **Translate** generated speech to English.
300299
301300
> [!warning]
302-
> **Warning**: The prompt **must be written in the same language** as the audio. For example, if your audio is in English, your prompt must also be in English.
301+
> The prompt **must be written in the same language** as the audio. For example, if your audio is in English, your prompt must also be in English.
303302
304303
**Examples**
305304
@@ -380,6 +379,7 @@ The `prompt` parameter lets you provide extra context to improve transcription.
380379
>> "prompt": "以下是普通話的句子。"
381380
>> }
382381
>> ```
382+
>
383383
384384
#### Timestamp Granularities
385385
@@ -431,7 +431,7 @@ The `timestamp_granularities` parameter controls the level of time markers inclu
431431
>> ```
432432
>>
433433
>> > [!warning]
434-
>> > **Warning**: Generating `["word"]` timestamps incurs additional latency.
434+
>> > Generating `["word"]` timestamps incurs additional latency.
435435
>>
436436
> **Word and segment timestamps**
437437
>>
@@ -461,7 +461,7 @@ The `timestamp_granularities` parameter controls the level of time markers inclu
461461
>> ```
462462
>>
463463
>> > [!warning]
464-
>> > **Warning**: Generating `["word"]` timestamps can incur additional latency.
464+
>> > Generating `["word"]` timestamps can incur additional latency.
465465
>>
466466
467467
#### Response Formats
@@ -545,8 +545,7 @@ By **default**, when unset, the audio is **processed as a single block**.
545545
546546
When set to `auto`, the system first normalizes audio loudness and then uses voice activity detection (VAD) to automatically split the audio at natural pauses (silence).
547547
548-
You can also provide a `server_vad` object
549-
to manually tweak VAD detection parameters. This lets you control the following parameters:
548+
You can also provide a `server_vad` object to manually tweak VAD detection parameters. This lets you control the following parameters:
550549
551550
- `prefix_padding_ms`: Amount of audio to include before the VAD detected speech (in milliseconds).
552551
- `silence_duration_ms`: Duration of silence to detect speech stop (in milliseconds). With shorter values the model will respond more quickly, but may jump in on short pauses from the user.
@@ -609,7 +608,9 @@ If your audio file exceeds these limits, you can split it into smaller chunks be
609608
610609
Try to avoid splitting mid-sentence, as this can cause context to be lost and reduce transcription accuracy. Using compressed audio formats can also help reduce file size.
611610
612-
**Example**: Splitting Audio with open-source Python PyDub library:
611+
**Example**
612+
613+
Splitting Audio with open-source Python PyDub library:
613614
614615
```python
615616
from pydub import AudioSegment
@@ -631,7 +632,7 @@ Repeat this process to create multiple chunks, then transcribe each chunk indivi
631632
632633
> [!warning]
633634
>
634-
> **Warning**: OVHcloud makes no guarantees about the usability or security of third-party software like PyDub.
635+
> OVHcloud makes no guarantees about the usability or security of third-party software like PyDub.
635636
636637
## Conclusion
637638

0 commit comments

Comments
 (0)