feat: add Mistral audio transcription adapter by devin-ai-integration[bot] · Pull Request #3968 · fastrepl/char

devin-ai-integration · 2026-02-14T09:40:32Z

Summary

Adds a MistralAdapter to owhisper-client implementing both RealtimeSttAdapter (WebSocket) and BatchSttAdapter (HTTP).

Realtime (live.rs): Connects to wss://api.mistral.ai/v1/audio/transcriptions/realtime using the model voxtral-mini-transcribe-realtime-2602. Sends base64-encoded PCM audio via input_audio.append JSON messages. Parses transcription.text.delta (interim) and transcription.segment (final, with timestamps) events.

Batch (batch.rs): Multipart POST to /v1/audio/transcriptions with verbose_json response format and segment-level timestamps. Default model is voxtral-mini-latest.

Registration: Mistral variant added to Provider, AdapterKind, and public exports.

The WebSocket protocol was derived from Mistral's Python SDK source, not official WebSocket API docs.

Review & Testing Checklist for Human

Verify WebSocket message format against live Mistral API — the event types (input_audio.append, input_audio.end, session.update, transcription.segment, transcription.text.delta) were reverse-engineered from the Python SDK. Run test_build_single / test_build_dual with a real MISTRAL_API_KEY to confirm the handshake and event flow works end-to-end.
Verify batch verbose_json response shape — MistralBatchResponse expects { model, text, language, segments: [{ text, start, end }] }. Run test_mistral_transcribe with a real key against a known audio file to confirm deserialization succeeds and segments populate correctly.
Word timestamps are interpolated, not from API — both batch and realtime transcription.segment events only provide segment-level start/end. Word timestamps are estimated by dividing segment duration evenly across words. Verify this approximation is acceptable for downstream consumers (transcript UI, word highlighting, etc.).
Language support claims all languages supported — language_support_live/batch returns Supported { quality: NoData } unconditionally (same pattern as OpenAI adapter). Confirm this is the desired behavior or if Mistral has a known supported language list.

Notes

All integration tests are #[ignore] gated on MISTRAL_API_KEY — only unit tests for JSON parsing run in CI.
Requested by: @yujonglee
Link to Devin run: https://app.devin.ai/sessions/d07d6a835ada44c7b681a3c25a8bea5d

Add MistralAdapter implementing both RealtimeSttAdapter and BatchSttAdapter: - Realtime: WebSocket at /v1/audio/transcriptions/realtime with base64 PCM audio, session.update for audio format config, and parsing of transcription.text.delta, transcription.segment, and error events - Batch: Multipart POST to /v1/audio/transcriptions with verbose_json response format and segment-level timestamps - Register Mistral in AdapterKind, Provider, and public exports Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

netlify · 2026-02-14T09:40:37Z

✅ Deploy Preview for hyprnote canceled.

Name	Link
🔨 Latest commit	`c0718e0`
🔍 Latest deploy log	https://app.netlify.com/projects/hyprnote/deploys/69904a21b11e7900088086c8

netlify · 2026-02-14T09:40:37Z

✅ Deploy Preview for hyprnote-storybook canceled.

Name	Link
🔨 Latest commit	`c0718e0`
🔍 Latest deploy log	https://app.netlify.com/projects/hyprnote-storybook/deploys/69904a21950cfb0008f4f192

devin-ai-integration · 2026-02-14T09:40:38Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR that start with 'DevinAI' or '@devin'.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration

Devin Review found 2 potential issues.

View 4 additional findings in Devin Review.

crates/transcribe-proxy/src/routes/batch.rs

crates/owhisper-client/src/adapter/mistral/live.rs

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration

Devin Review found 2 new potential issues.

View 5 additional findings in Devin Review.

crates/transcribe-proxy/src/routes/batch.rs

crates/owhisper-client/src/adapter/mistral/mod.rs

…rustfmt Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration

Devin Review found 1 new potential issue.

View 8 additional findings in Devin Review.

devin-ai-integration · 2026-02-14T10:14:56Z

crates/transcribe-proxy/src/routes/streaming/hyprnote.rs

        Provider::Gladia => GladiaAdapter.build_ws_url(api_base, params, channels),
        Provider::ElevenLabs => ElevenLabsAdapter.build_ws_url(api_base, params, channels),
+        Provider::Mistral => MistralAdapter.build_ws_url(api_base, params, channels),
    }


🚩 Proxy path forwards raw binary audio, but Mistral expects base64 JSON text messages

The transcribe-proxy relay handler at crates/transcribe-proxy/src/relay/handler.rs:273-284 forwards client binary WebSocket messages directly to the upstream as binary frames. However, Mistral's realtime WebSocket API expects audio to arrive as JSON text messages with base64-encoded PCM data (the input_audio.append format defined at crates/owhisper-client/src/adapter/mistral/live.rs:56-63).

This means the proxy path (build_proxy_with_adapter in crates/transcribe-proxy/src/routes/streaming/hyprnote.rs:123) will not work correctly for Mistral — binary audio from clients would be forwarded as-is rather than wrapped in base64 JSON.

However, this is the same pre-existing limitation that affects the OpenAI adapter (which also requires base64 JSON via input_audio_buffer.append). Both providers are wired into the proxy dispatch without any binary-to-text audio transformation. The direct client path (ListenClient) handles this correctly via audio_to_message(). If the proxy is not actually used for these providers in production, this is a non-issue — but worth confirming.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration bot assigned yujonglee Feb 14, 2026

devin-ai-integration bot requested a review from yujonglee February 14, 2026 09:40

devin-ai-integration bot and others added 2 commits February 14, 2026 09:43

fix: add missing test imports for RealtimeSttAdapter and StreamResponse

856b4b2

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

fix: add Provider::Mistral arms to transcribe-proxy match expressions

96258d1

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration bot commented Feb 14, 2026

View reviewed changes

crates/transcribe-proxy/src/routes/batch.rs Show resolved Hide resolved

crates/owhisper-client/src/adapter/mistral/live.rs Show resolved Hide resolved

fix: handle Provider::Mistral in all match arms across codebase

e8ac694

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration bot commented Feb 14, 2026

View reviewed changes

crates/transcribe-proxy/src/routes/batch.rs Show resolved Hide resolved

crates/owhisper-client/src/adapter/mistral/mod.rs Outdated Show resolved Hide resolved

fix: wire up Mistral batch proxy, add explicit language support, run …

c0718e0

…rustfmt Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

yujonglee merged commit 4c97631 into main Feb 14, 2026
24 of 25 checks passed

yujonglee deleted the devin/1771061713-mistral-adapter branch February 14, 2026 10:13

devin-ai-integration bot commented Feb 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Mistral audio transcription adapter#3968

feat: add Mistral audio transcription adapter#3968
yujonglee merged 5 commits intomainfrom
devin/1771061713-mistral-adapter

devin-ai-integration bot commented Feb 14, 2026 •

edited

Loading

Uh oh!

netlify bot commented Feb 14, 2026 •

edited

Loading

Uh oh!

netlify bot commented Feb 14, 2026 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Feb 14, 2026

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

devin-ai-integration bot commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Review & Testing Checklist for Human

Notes

Uh oh!

netlify bot commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyprnote canceled.

Uh oh!

netlify bot commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyprnote-storybook canceled.

Uh oh!

devin-ai-integration bot commented Feb 14, 2026

🤖 Devin AI Engineer

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

devin-ai-integration bot commented Feb 14, 2026 •

edited

Loading

netlify bot commented Feb 14, 2026 •

edited

Loading

netlify bot commented Feb 14, 2026 •

edited

Loading