Skip to content

Commit cc9feee

Browse files
committed
Improved markdown table
1 parent 883550c commit cc9feee

File tree

1 file changed

+15
-8
lines changed

1 file changed

+15
-8
lines changed

examples/Speech_transcription_methods.ipynb

Lines changed: 15 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -19,12 +19,12 @@
1919
"metadata": {},
2020
"source": [
2121
"### 📊 Quick-look\n",
22-
"| Mode | Latency to **first token** | Best for (real examples) | What you still need to handle / key limits |\n",
23-
"|--------------------------------|-------------------------|--------------------------------------------------------------|-----------------------------------------------------------|\n",
24-
"| File upload + `stream=False` (blocking) | seconds | Voicemail, meeting recordings | • No partial results, users see nothing until file finishes <br>• Max 25 MB per request (you must chunk long audio) |\n",
25-
"| File upload + `stream=True` | subseconds | Voice memos in mobile apps | • Still requires a completed file <br>• You implement progress bars / chunked uploads |\n",
26-
"| Realtime WebSocket | subseconds | Live captions in webinars | • Audio must be pcm16, g711_ulaw, or g711_alaw <br>• Session ≤ 30 min, reconnect & stitch <br>• You handle speaker-turn formatting to build the full transcript |\n",
27-
"| Agents SDK VoicePipeline | subseconds | Internal help-desk assistant | • Python-only beta <br>• API surface may change |"
22+
"| Mode | Latency to **first token** | Best for (real examples) | Advantages | What you still need to handle / key limits |\n",
23+
"|--------------------------------|---------------------------|--------------------------------------------------------------|-----------------------------------------------------------|-----------------------------------------------------------|\n",
24+
"| File upload + `stream=False` (blocking) | seconds | Voicemail, meeting recordings | Simple to set up | • No partial results, users see nothing until file finishes <br>• Max 25 MB per request (you must chunk long audio) |\n",
25+
"| File upload + `stream=True` | subseconds | Voice memos in mobile apps | Simple to set up & provides a “live” feel via token streaming | • Still requires a completed file <br>• You implement progress bars / chunked uploads |\n",
26+
"| Realtime WebSocket | subseconds | Live captions in webinars | True real-time; accepts a continuous audio stream | • Audio must be pcm16, g711_ulaw, or g711_alaw <br>• Session ≤ 30 min, reconnect & stitch <br>• You handle speaker-turn formatting to build the full transcript |\n",
27+
"| Agents SDK VoicePipeline | subseconds | Internal help-desk assistant | Real-time streaming and easy to build agentic workflows | • Python-only beta <br>• API surface may change |"
2828
]
2929
},
3030
{
@@ -88,6 +88,7 @@
8888
"import os\n",
8989
"import time\n",
9090
"from typing import List\n",
91+
"from pathlib import Path\n",
9192
"\n",
9293
"# ─── Third-Party ───────────────────────────────────────────────────────────────\n",
9394
"import nest_asyncio\n",
@@ -103,6 +104,7 @@
103104
" VoicePipeline,\n",
104105
" VoicePipelineConfig,\n",
105106
")\n",
107+
"from IPython.display import Audio, display\n",
106108
"# ───────────────────────────────────────────────────────────────────────────────\n",
107109
"nest_asyncio.apply()\n",
108110
"\n",
@@ -184,8 +186,6 @@
184186
}
185187
],
186188
"source": [
187-
"from IPython.display import Audio, display\n",
188-
"from pathlib import Path\n",
189189
"AUDIO_PATH = Path('./data/sample_audio_files/lotsoftimes-78085.mp3') # change me\n",
190190
"MODEL_NAME = \"gpt-4o-transcribe\"\n",
191191
"\n",
@@ -617,6 +617,13 @@
617617
"[User]: Like these next few links.\n",
618618
"[Assistant]: Comme ces quelques liens suivants."
619619
]
620+
},
621+
{
622+
"name": "stderr",
623+
"output_type": "stream",
624+
"text": [
625+
"Error processing turns: no close frame received or sent\n"
626+
]
620627
}
621628
],
622629
"source": [

0 commit comments

Comments
 (0)