Skip to content

Commit 765f8ea

Browse files
committed
Update Realtime Prompting Guide for clarity and consistency
- Revised text to enhance clarity, including changes to descriptions of the gpt-realtime model and its capabilities. - Updated audio file references to reflect the new gpt-realtime model. - Improved formatting and added additional context for better understanding of prompting techniques.
1 parent 5c68ce4 commit 765f8ea

File tree

1 file changed

+26
-21
lines changed

1 file changed

+26
-21
lines changed

examples/Realtime_prompting_guide.ipynb

Lines changed: 26 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -15,12 +15,12 @@
1515
"\n",
1616
"Today, we’re releasing gpt-realtime — our most capable speech-to-speech model yet in the API and announcing the general availability of the Realtime API. \n",
1717
"\n",
18-
"Speech-to-speech systems are essential for enabling voice as a core AI interface. Today’s release enhances robustness and usability, giving enterprises the confidence to deploy mission-critical voice agents at scale.\n",
18+
"Speech-to-speech systems are essential for enabling voice as a core AI interface. The new release enhances robustness and usability, giving enterprises the confidence to deploy mission-critical voice agents at scale.\n",
1919
"\n",
2020
"\n",
21-
"The new gpt-realtime model delivers stronger instruction following, more reliable tool calling, noticeably better voice quality, and an overall smoother feel. These gains make it practical to move from chained approaches to true realtime experiences, cutting latency, carrying over more nuance from spoken input, and producing responses that sound more natural and expressive.\n",
21+
"The new gpt-realtime model delivers stronger instruction following, more reliable tool calling, noticeably better voice quality, and an overall smoother feel. These gains make it practical to move from chained approaches to true realtime experiences, cutting latency and producing responses that sound more natural and expressive.\n",
2222
"\n",
23-
"Realtime model benefit from different prompting techniques that wouldn't directly apply to text based models. This prompting guide starts with a simple prompt skeleton, then walks through each part with practical tips, small patterns you can copy, and examples you can adapt to your use case.\n",
23+
"Realtime model benefit from different prompting techniques that wouldn't directly apply to text based models. This prompting guide starts with a suggested prompt skeleton, then walks through each part with practical tips, small patterns you can copy, and examples you can adapt to your use case.\n",
2424
"\n",
2525
"# Table of Contents\n",
2626
"\n",
@@ -185,7 +185,7 @@
185185
"id": "d36c87e3",
186186
"metadata": {},
187187
"source": [
188-
"This is the audio from our new GA model `gpt-4o-realtime-preview-2025-08-29`"
188+
"This is the audio from our new GA model `gpt-realtime`"
189189
]
190190
},
191191
{
@@ -271,7 +271,7 @@
271271
"id": "084f7cae",
272272
"metadata": {},
273273
"source": [
274-
"This is the audio from our new GA model `gpt-4o-realtime-preview-2025-08-29`"
274+
"This is the audio from our new GA model `gpt-realtime`"
275275
]
276276
},
277277
{
@@ -354,7 +354,9 @@
354354
"- Start your response very happy\n",
355355
"- Midway, change to sad\n",
356356
"- At the end change your mood to very angry\n",
357-
"```"
357+
"```\n",
358+
"\n",
359+
"This is the audio from our new GA model `gpt-realtime`"
358360
]
359361
},
360362
{
@@ -435,7 +437,7 @@
435437
"id": "da1ac60b",
436438
"metadata": {},
437439
"source": [
438-
"This is the audio from our old `gpt-4o-realtime-preview-2025-06-03`"
440+
"This is the audio from our old `gpt-4o-realtime-preview-2025-06-03` with speed instructions"
439441
]
440442
},
441443
{
@@ -472,7 +474,7 @@
472474
"id": "77cd87ce",
473475
"metadata": {},
474476
"source": [
475-
"This is the audio from our new GA model `gpt-4o-realtime-preview-2025-08-29` after adding the instruction to the same prompt."
477+
"This is the audio from our new GA model `gpt-realtime` with speed instructions"
476478
]
477479
},
478480
{
@@ -553,7 +555,7 @@
553555
"id": "4dd0c9bf",
554556
"metadata": {},
555557
"source": [
556-
"This is the responses after applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
558+
"This is the responses after applying the instruction using `gpt-realtime`\n",
557559
"\n",
558560
"<img\n",
559561
" src=\"../images/lang_constraint_en.png\"\n",
@@ -598,7 +600,7 @@
598600
"id": "891519ed",
599601
"metadata": {},
600602
"source": [
601-
"This is the responses after applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
603+
"This is the responses after applying the instruction using `gpt-realtime`\n",
602604
"\n",
603605
"\n",
604606
"<img\n",
@@ -655,7 +657,7 @@
655657
"id": "7d0da635",
656658
"metadata": {},
657659
"source": [
658-
"This is the responses **before** applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
660+
"This is the responses **before** applying the instruction using `gpt-realtime`\n",
659661
"\n",
660662
"<img\n",
661663
" src=\"../images/repeat_before.png\"\n",
@@ -668,7 +670,7 @@
668670
"id": "84fc575c",
669671
"metadata": {},
670672
"source": [
671-
"This is the responses **after** applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
673+
"This is the responses **after** applying the instruction using `gpt-realtime`\n",
672674
"\n",
673675
"\n",
674676
"<img\n",
@@ -752,7 +754,7 @@
752754
"id": "93f9b8b5",
753755
"metadata": {},
754756
"source": [
755-
"This is the audio from our new GA model `gpt-4o-realtime-preview-2025-08-29` using the reference pronunciations.\n",
757+
"This is the audio from our new GA model `gpt-realtime` using the reference pronunciations.\n",
756758
"\n",
757759
"It is able to correctly pronounce SQL as \"sequel\"."
758760
]
@@ -848,7 +850,7 @@
848850
"id": "43f9826e",
849851
"metadata": {},
850852
"source": [
851-
"This is the responses **before** applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
853+
"This is the responses **before** applying the instruction using `gpt-realtime`\n",
852854
"\n",
853855
"> Sure! The number is 55119765423. Let me know if you need anything else!"
854856
]
@@ -858,7 +860,7 @@
858860
"id": "54a62295",
859861
"metadata": {},
860862
"source": [
861-
"This is the responses **after** applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
863+
"This is the responses **after** applying the instruction using `gpt-realtime`\n",
862864
"\n",
863865
"> Sure! The number is: 5-5-1-1-1-9-7-6-5-4-2-3. Please let me know if you need anything else!"
864866
]
@@ -956,7 +958,9 @@
956958
"- Always respond in the same language the user is speaking in, if intelligible.\n",
957959
"- Only respond to clear audio or text. \n",
958960
"- If the user's audio is not clear (e.g. ambiguous input/background noise/silent/unintelligible) or if you did not fully hear or understand the user, ask for clarification using {preferred_language} phrases.\n",
959-
"```"
961+
"```\n",
962+
"\n",
963+
"This is the responses **after** applying the instruction using `gpt-realtime`"
960964
]
961965
},
962966
{
@@ -1083,7 +1087,7 @@
10831087
"id": "e8839685",
10841088
"metadata": {},
10851089
"source": [
1086-
"This is the responses after applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
1090+
"This is the responses after applying the instruction using `gpt-realtime`\n",
10871091
"\n",
10881092
"<img\n",
10891093
" src=\"../images/tool_proactive.png\"\n",
@@ -1170,7 +1174,7 @@
11701174
"id": "672cb487",
11711175
"metadata": {},
11721176
"source": [
1173-
"This is the responses **after** applying the instruction using `gpt-4o-realtime-preview-2025-08-29`\n",
1177+
"This is the responses **after** applying the instruction using `gpt-realtime`\n",
11741178
"\n",
11751179
"<img\n",
11761180
" src=\"../images/tool_no_confirm.png\"\n",
@@ -1358,7 +1362,7 @@
13581362
"id": "7d30e0d5",
13591363
"metadata": {},
13601364
"source": [
1361-
"Here’s an example without rephrasing instruction:\n",
1365+
"Here’s an example without the rephrasing instruction:\n",
13621366
"\n",
13631367
">Assistant: Your current credit card balance is positive at 32,323,232 AUD."
13641368
]
@@ -1368,7 +1372,7 @@
13681372
"id": "91f73021",
13691373
"metadata": {},
13701374
"source": [
1371-
"Here’s the same example with rephrasing instruction:\n",
1375+
"Here’s the same example with the rephrasing instruction:\n",
13721376
"\n",
13731377
">Assistant: Just finished checking that—your credit card balance is thirty-two million three hundred twenty-three thousand two hundred thirty-two dollars in your favor. Your last payment was processed on August first. Does that match what you expected?"
13741378
]
@@ -1484,6 +1488,7 @@
14841488
"metadata": {},
14851489
"source": [
14861490
"## Sample phrases\n",
1491+
"Sample phrases act as “anchor examples” for the model. They show the style, brevity, and tone you want it to follow, without locking it into one rigid response.\n",
14871492
"\n",
14881493
"- **When to use**: Responses lack your brand style or are not consistent.\n",
14891494
"- **What it does**: Provides sample phrases the model can vary to stay natural and brief.\n",
@@ -1874,7 +1879,7 @@
18741879
"id": "860fdb2e",
18751880
"metadata": {},
18761881
"source": [
1877-
"This is the conversation responses from our new GA model `gpt-4o-realtime-preview-2025-08-29` using the instruction.\n",
1882+
"This is the conversation responses from our new GA model `gpt-realtime` using the instruction.\n",
18781883
"\n",
18791884
"<img\n",
18801885
" src=\"../images/escalate_07.png\"\n",

0 commit comments

Comments
 (0)