Skip to content

Commit 732cf24

Browse files
committed
Fixed PR comments
1 parent 3e6d268 commit 732cf24

File tree

2 files changed

+4
-6
lines changed

2 files changed

+4
-6
lines changed

examples/Speech_transcription_methods.ipynb

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -5,17 +5,15 @@
55
"id": "26a10eea",
66
"metadata": {},
77
"source": [
8-
"# 🗣️ Methods of Speech-to-Text using OpenAI API & Agents SDK\n",
9-
"\n",
10-
"**Updated : April 29 2025** \n",
8+
"# 🗣️ Comparing Speech-to-Text Methods with the OpenAI API\n",
119
"\n",
1210
"## Overview\n",
1311
"\n",
1412
"This notebook provides a clear, hands-on guide for beginners to quickly get started with Speech-to-Text (STT) using the OpenAI API. You'll explore multiple practical methods, their use cases, and considerations.\n",
1513
"\n",
1614
"By the end you will be able to select and use the appropriate transcription method for your use use cases.\n",
1715
"\n",
18-
"Assumption: For simplicity and ease of use, this notebook uses WAV audio files. Real-time microphone streaming (e.g., from web apps or microphones) is not utilized."
16+
"*Note: For simplicity and ease of use, this notebook uses WAV audio files. Real-time microphone streaming (e.g., from web apps or microphones) is not utilized.*"
1917
]
2018
},
2119
{
@@ -24,7 +22,7 @@
2422
"metadata": {},
2523
"source": [
2624
"### 📊 Quick-look\n",
27-
"| Mode | Latency to **first token** | Best for (real examples) | Advantages | What you still need to handle / key limits |\n",
25+
"| Mode | Latency to **first token** | Best for (real examples) | Advantages | Key limitations |\n",
2826
"|--------------------------------|---------------------------|--------------------------------------------------------------|-----------------------------------------------------------|-----------------------------------------------------------|\n",
2927
"| File upload + `stream=False` (blocking) | seconds | Voicemail, meeting recordings | Simple to set up | • No partial results, users see nothing until file finishes <br>• Max 25 MB per request (you must chunk long audio) |\n",
3028
"| File upload + `stream=True` | subseconds | Voice memos in mobile apps | Simple to set up & provides a “live” feel via token streaming | • Still requires a completed file <br>• You implement progress bars / chunked uploads |\n",

registry.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
# should build pages for, and indicates metadata such as tags, creation date and
55
# authors for each page.
66

7-
- title: Methods of Speech-to-Text using OpenAI API & Agents SDK
7+
- title: Comparing Speech-to-Text Methods with the OpenAI API
88
path: examples/Speech_transcription_methods.ipynb
99
date: 2025-04-29
1010
authors:

0 commit comments

Comments
 (0)