You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/realtime-audio.md
+5-8Lines changed: 5 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -87,7 +87,7 @@ To authenticate:
87
87
88
88
Once the WebSocket connection session to `/realtime` is established and authenticated, the functional interaction takes place via events for sending and receiving WebSocket messages. These events each take the form of a JSON object.
89
89
90
-
:::image type="content" source="../media/how-to/real-time/realtime-api-sequence.svg" alt-text="Diagram of the Realtime API authentication and connection sequence.":::
90
+
:::image type="content" source="../media/how-to/real-time/realtime-api-sequence.png" alt-text="Diagram of the Realtime API authentication and connection sequence." lightbox="../media/how-to/real-time/realtime-api-sequence.png":::
91
91
92
92
<!--
93
93
sequenceDiagram
@@ -197,7 +197,7 @@ The session relies on caller-initiated [`input_audio_buffer.commit`](../realtime
197
197
- The server responds by sending the [`input_audio_buffer.committed`](../realtime-audio-reference.md#realtimeservereventinputaudiobuffercommitted) event.
198
198
- The server responds by sending the [`conversation.item.created`](../realtime-audio-reference.md#realtimeservereventconversationitemcreated) event.
199
199
200
-
:::image type="content" source="../media/how-to/real-time/input-audio-buffer-client-managed.svg" alt-text="Diagram of the Realtime API input audio sequence without server decision mode.":::
200
+
:::image type="content" source="../media/how-to/real-time/input-audio-buffer-client-managed.png" alt-text="Diagram of the Realtime API input audio sequence without server decision mode." lightbox="../media/how-to/real-time/input-audio-buffer-client-managed.png":::
201
201
202
202
<!--
203
203
sequenceDiagram
@@ -221,7 +221,8 @@ The session can be configured with the `turn_detection` type set to `server_vad`
221
221
- The server commits the input audio buffer by sending the [`input_audio_buffer.committed`](../realtime-audio-reference.md#realtimeservereventinputaudiobuffercommitted) event.
222
222
- The server sends the [`conversation.item.created`](../realtime-audio-reference.md#realtimeservereventconversationitemcreated) event with the user message item created from the audio buffer.
223
223
224
-
:::image type="content" source="../media/how-to/real-time/input-audio-buffer-server-vad.svg" alt-text="Diagram of the Realtime API input audio sequence with server decision mode.":::
224
+
:::image type="content" source="../media/how-to/real-time/input-audio-buffer-server-vad.png" alt-text="Diagram of the Realtime API input audio sequence with server decision mode." lightbox="../media/how-to/real-time/input-audio-buffer-server-vad.png":::
225
+
225
226
226
227
<!--
227
228
sequenceDiagram
@@ -250,7 +251,7 @@ Optionally, the client can truncate or delete items in the conversation:
250
251
- The client deletes an item in the conversation with a [`conversation.item.delete`](../realtime-audio-reference.md#realtimeclienteventconversationitemdelete) event.
251
252
- The server [`conversation.item.deleted`](../realtime-audio-reference.md#realtimeservereventconversationitemdeleted) event is returned to sync the client and server state.
252
253
253
-
:::image type="content" source="../media/how-to/real-time/conversation-item-sequence.svg" alt-text="Diagram of the Realtime API conversation item sequence.":::
254
+
:::image type="content" source="../media/how-to/real-time/conversation-item-sequence.png" alt-text="Diagram of the Realtime API conversation item sequence." lightbox="../media/how-to/real-time/conversation-item-sequence.png":::
254
255
255
256
<!--
256
257
sequenceDiagram
@@ -269,10 +270,6 @@ sequenceDiagram
269
270
-->
270
271
271
272
272
-
273
-
274
-
275
-
276
273
## Related content
277
274
278
275
* Try the [real-time audio quickstart](../realtime-audio-quickstart.md)
0 commit comments