FEAT: Adding audio and tool support to chat completions #1311

rlundeen2 · 2026-01-10T06:10:11Z

Title says it all! Supporting audio for gpt-audio and also tool calls.

Tests:

Added unit tests and integration tests
All integration tests running

jsong468 · 2026-01-12T17:00:18Z

pyrit/prompt_target/openai/completions_audio_config.py

+
+
+@dataclass
+class OpenAICompletionsAudioConfig:


optional NIT: do we want to consider renaming to OpenAIChatAudioConfig to correspond to our OpenAIChatTarget and make it clear that it's not OpenAICompletionTarget

I disagree, this is neither "nit" nor optional 😆

jsong468 · 2026-01-12T17:07:38Z

pyrit/prompt_target/openai/completions_audio_config.py

+
+# Voices supported by OpenAI Chat Completions API audio output.
+# See: https://platform.openai.com/docs/guides/text-to-speech#voice-options
+CompletionsAudioVoice = Literal["alloy", "ash", "ballad", "coral", "echo", "sage", "shimmer", "verse", "marin", "cedar"]


curious why this isn't exactly the same as the list on the platform.openai.com webpage that's linked above? (missing fable, nova, onyx)

I'll add them!

jsong468 · 2026-01-12T23:04:18Z

pyrit/prompt_target/openai/openai_chat_target.py

+            extension=extension,
+        )
+
+        if audio_format == "pcm16":


might be missing something, but is there a unit test for pcm16 specifically?

hannahwestra25 · 2026-01-14T16:42:53Z

pyrit/prompt_target/openai/openai_chat_target.py

+
+        # Skip audio for assistant messages - OpenAI only allows audio in user messages.
+        # For assistant responses, the transcript text piece should already be included.
+        if role == "assistant":


is assistant the only other option besides user ?

hannahwestra25 · 2026-01-14T16:51:14Z

pyrit/prompt_target/openai/openai_chat_target.py

+        if not pieces:
+            raise EmptyResponseException(message="Failed to extract any response content.")
+
+        return Message(message_pieces=pieces)


so right now you wouldn't be able to tell what is the transcript text and what is just text content, so hypothetically, there could be no transcipt but there could be text content and there would be no distinction. Do you see a distinction being useful (I'm not sure whether the content / value of text content vs transcript makes it obvious which is which so that being more explicit is unnecessary) ?

hannahwestra25 · 2026-01-14T16:58:21Z

pyrit/prompt_target/openai/openai_chat_target.py

                    content.append(entry)
+                elif message_piece.converted_value_data_type == "audio_path":
+                    ext = DataTypeSerializer.get_extension(message_piece.converted_value)
+                    if not ext or ext.lower() not in [".wav", ".mp3"]:


https://platform.openai.com/docs/guides/speech-to-text says "mp3, mp4, mpeg, mpga, m4a, wav, and webm" so is this just that pyrit + openai chat completions only supports .wav & .mp3 ? because then we should maybe more exact

rlundeen2 added 4 commits January 9, 2026 22:07

adding audio and tool support to chat completions

8eb085f

doc fix

af485ab

precommit

a81fb09

precommit

464d786

jsong468 reviewed Jan 12, 2026

View reviewed changes

jsong468 approved these changes Jan 12, 2026

View reviewed changes

hannahwestra25 reviewed Jan 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT: Adding audio and tool support to chat completions #1311

FEAT: Adding audio and tool support to chat completions #1311

Uh oh!

rlundeen2 commented Jan 10, 2026 •

edited

Loading

Uh oh!

jsong468 Jan 12, 2026

Uh oh!

romanlutz Jan 14, 2026

Uh oh!

jsong468 Jan 12, 2026

Uh oh!

rlundeen2 Jan 13, 2026

Uh oh!

jsong468 Jan 12, 2026

Uh oh!

hannahwestra25 Jan 14, 2026

Uh oh!

hannahwestra25 Jan 14, 2026

Uh oh!

hannahwestra25 Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants



		@dataclass
		class OpenAICompletionsAudioConfig:

FEAT: Adding audio and tool support to chat completions #1311

Are you sure you want to change the base?

FEAT: Adding audio and tool support to chat completions #1311

Uh oh!

Conversation

rlundeen2 commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rlundeen2 commented Jan 10, 2026 •

edited

Loading