Skip to content

Commit e41e82e

Browse files
RoRoJfpagny
andauthored
Apply suggestions from code review
Co-authored-by: fpagny <[email protected]>
1 parent 9ac06ee commit e41e82e

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

pages/generative-apis/how-to/query-audio-models.mdx

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -46,14 +46,14 @@ You can also use the upload button to send supported audio file formats, such as
4646
You can query the models programmatically using your favorite tools or languages.
4747
In the example that follows, we will use the OpenAI Python client.
4848

49-
### Chat Completions API or Audio Transcriptions API?
49+
### Audio Transcriptions API or Chat Completions API?
5050

51-
Both the [Chat Completions API](https://www.scaleway.com/en/developers/api/generative-apis/#path-chat-completions-create-a-chat-completion) and the [Audio Transcriptions API](https://www.scaleway.com/en/developers/api/generative-apis/#path-audio-create-an-audio-transcription) are OpenAI-compatible REST APIs that accept audio input.
52-
53-
The **Chat Completions API** is more suitable when transcribing audio input is part of a broader task, rather than a pure transcription task. For example, building a voice chat assistant which listens and responds in natural language, or sending multiple inputs (audio and text) to be interpreted. This API can be used for audio tasks with compatible multimodal models, such as `voxtral-small-24b`.
51+
Both the [Audio Transcriptions API](https://www.scaleway.com/en/developers/api/generative-apis/#path-audio-create-an-audio-transcription) and the [Chat Completions API](https://www.scaleway.com/en/developers/api/generative-apis/#path-chat-completions-create-a-chat-completion) are OpenAI-compatible REST APIs that accept audio input.
5452

5553
The **Audio Transcriptions API** is designed for pure speech-to-text (audio transcription) tasks, such as transcribing a voice note or meeting recording file. It can be used with compatible audio models, such as `whisper-large-v3`.
5654

55+
The **Chat Completions API** is more suitable for understanding audio input as part of a broader task, rather than a pure transcription task. For example, building a voice chat assistant which listens and responds in natural language, or sending multiple inputs (audio and text) to be interpreted or classified (answering questions like "Is this audio a ringtone?"). This API can be used for audio tasks with compatible multimodal models, such as `voxtral-small-24b`.
56+
5757
<Message type="note">
5858
Scaleway's support for the Audio Transcriptions API is currently at beta stage. Support of the full feature set will be incremental.
5959
</Message>

0 commit comments

Comments
 (0)