Switch to Responses API for OpenAI by tpaulshippy · Pull Request #325 · crmne/ruby_llm

tpaulshippy · 2025-08-06T05:35:37Z

What this does

As explained here there are numerous reasons to use the newer Responses API instead of the Chat Completions API. Features we get by switching include:

Mixing web search with custom tools
Conversations with images in and out
Certain models like o4-mini-deep-research
MCP support through OpenAI

There is one feature not yet available - audio inputs are not supported by the Responses API. So, the library will detect any audio inputs and fall back to the Chat Completions API if they exist.

Type of change

New feature

Scope check

I read the Contributing Guide
This aligns with RubyLLM's focus on LLM communication
This isn't application-specific logic that belongs in user code
This benefits most users, not just my specific use case

Quality check

I ran overcommit --install and all hooks pass
I tested my changes thoroughly
I updated documentation if needed
I didn't modify auto-generated files manually (models.json, aliases.json)

API changes

No API changes

Related issues

Replaces #290
Should enable resolution of #213

Some model (incl. `o4-mini-deep-research`) aren't compatible with the chat/completion API. This PR introduces a new `Response` class, which - similarly to `Chat` (and inheriting from the same base `Conversation` class) - allows a user to target the `responses` endpoint.

…when streaming

lib/ruby_llm/providers/openai/streaming.rb

tpaulshippy · 2025-08-26T13:17:14Z

This change would be backward compatible as it falls back to the chat completion API when you provide audio.

jaryl · 2025-09-24T05:47:20Z

Just some heads up, support for audio in the Responses API might be coming:

Multimodal from the ground up. Text, images, audio, function calls—all first-class citizens. We didn’t bolt modalities onto a text API; we designed the house with enough bedrooms from day one.
Source: https://developers.openai.com/blog/responses-api/

Don't see it in the docs yet.

rikkiprince · 2025-10-09T17:33:52Z

I'm interested in Responses API support so we can more easily connect to remote MCP servers.

radanskoric · 2025-10-23T08:17:03Z

While there seem to be very few technical reasons to switch to the Responses API, it looks like OpenAI is set on pushing it hard. Ignoring the reasons why they're doing this (this is not the place to discuss that) I'm worried that they will start to release new features only on ResponsesAPI in order to force people to switch. I doubt they will deprecate the Completions API but I wouldn't be surprised if they decide to "leave it behind". In that case it would be good to have RubyLLM support ResponsesAPI at least as an optional non-default.

I realise it's a problem due to RubyLLM not being designed to have multiple APIs per provider but is that a direction with less friction? Perhaps "OpenAIResponses" could be introduced as a new separate provider, with OpenAI provider still defaulting to Completions. I know that has some potential for confusion but at least it would make it very explicit that you need to intentionally switch to the Responses API.

tpaulshippy · 2025-11-02T22:31:55Z

What are the scenarios, aside from audio inputs, where the completions API would provide value? I could change this to offer both options but I'm just not sure why.

tpaulshippy · 2025-11-02T22:39:54Z

Just some heads up, support for audio in the Responses API might be coming:

Kind of ridiculous that "Coming soon" has been so long.

radanskoric · 2025-11-05T11:08:30Z

What are the scenarios, aside from audio inputs, where the completions API would provide value? I could change this to offer both options but I'm just not sure why.

I don't think there's any. But there are also very few technical reasons why you'd need to switch to ResponsesAPI, other than OpenAI obviously being intent on pushing it hard as the default choice. Which I think is a good enough reason to add it to RubyLLM.

It's kind of annoying that Audio is not supported because otherwise this could be a change to just switch the RubyLLM backend from Completion to Responses. And I don't think it's an option to just not support audio in RubyLLM so supporting both seem like the only option going forward.

I'm wondering if a way forward would be to keep Responses vs Completions API as an implementation detail: Use Responses by default and transparently fallback to Completions with audio? This would remove the need to refactor RubyLLM to support multiple implementations per provider. It would be just one implementation that happens to use both APIs under the hood.

tpaulshippy · 2025-11-05T13:55:16Z

Use Responses by default and transparently fallback to Completions with audio?

That was my intent with the current state of this PR. Are you seeing something missing or different?

radanskoric · 2025-11-05T14:29:11Z

That was my intent with the current state of this PR. Are you seeing something missing or different?

Sorry! I noticed this PR some time ago and missed that development in the meantime. I see now the @using_responses_api = !audio_input?(messages) part. Thank you for pointing it out, that's exactly what I was suggesting. 👏

IAPark · 2025-12-01T23:43:37Z

I'm a little confused to hear that there aren't any compelling reasons to want to use the response API. Better support for reasoning + being able to use web search while also using function calling seem like big advantages.

thomaswitt · 2025-12-12T09:34:17Z

@tpaulshippy Thank you for your work.

To be honest, I don't understand why this isn't merged by @crmne. Our reason to switch to ruby_llm was mainly the better support of modern APIs and I honestly can't quite understand why this isn't a priority.

OpenAI clearly says (in bold): While Chat Completions remains supported, Responses is recommended for all new projects. So clearly, this is a deprecated API
We measured about 10-30% better performance in our internal tests with more complex prompts combined with structured outputs
We measured significantly less erratic responses with the new API. This is especially relevant to repetitions in arrays - which can happen according the docs, but with the old API you sometimes get thousands of tokens of the same value. That happens significantly less with the the new API.

So unless there is a reson I am not seeing, I am strongly urging to merge this PR and use the recommended API. It's just a question of time when a new model won't be supported anymore with the Chat Responses API.

MirkoMignini · 2026-02-09T09:28:50Z

@tpaulshippy thank you for your work on this PR, are you planning to rebase it and clean conflicts? it's a great addition to ruby_llm and hopefully @crmne will merge it when ready! thanks again to both

tpaulshippy · 2026-02-09T14:33:01Z

@tpaulshippy thank you for your work on this PR, are you planning to rebase it and clean conflicts? it's a great addition to ruby_llm and hopefully @crmne will merge it when ready! thanks again to both

Started taking a look. Need to work a bit on the reasoning part and get cassettes updated. Should be able to push something here soon.

…thod

tpaulshippy · 2026-02-10T05:46:04Z

Did what I could. Had some malloc issues updating cartridges on a couple of the PDF/image specs. Also noticed that we aren't really streaming the reasoning information from OpenAI, but I don't see it happening in main branch either - just getting a bunch of nils.

thomaswitt · 2026-02-11T06:43:35Z

Here are the first features only available in the responses API:

https://x.com/openaidevs/status/2021286050623373500

Ceterum censeo ruby_LLM API responsorum sustinere debere, @crmne

cktricky · 2026-02-12T20:59:08Z

Super keen to see this shipped ❤️

redox and others added 20 commits July 23, 2025 09:30

useless

0a1c980

Merge branch 'main' into responses-api

44ab739

Start simplifying by moving responses into openai provider only

ab2ac42

Introduce new module to hold chat completions API stuff

79649c6

Add support for attaching media

fc3945b

Refactor a bit to support audio inputs with fallback

99e4d9e

Restore use of complete

a693991

Setup response schema for responses API

d8ff718

Update with params spec

447100a

Update cassettes for chat with_schema

f8375f5

Remove some extra params

f48ba3f

OpenAI responses API does not seem to provide token counts on chunks …

c0ae2aa

…when streaming

Update error handling with responses

d00cc38

Handle chunks from responses when streaming

928b1c1

Rubocop fixes

049c8ea

Update spec for responses

eb53d58

One more rubocop

4f68ebd

Clean up some methods we don't need to rename or don't need

f50cb23

Remove extra namespaces

bf2c549

tpaulshippy commented Aug 6, 2025

View reviewed changes

lib/ruby_llm/providers/openai/streaming.rb Show resolved Hide resolved

Paul Shippy added 6 commits August 6, 2025 20:32

Merge branch 'main' into responses-api

40e17be

Update some cassettes

b0dace8

Rubocop -A

367f341

Merge branch 'main' into responses-api

f25ffdd

Merge main into responses-api

ea03496

Update some cassettes

371e3d2

tpaulshippy mentioned this pull request Aug 14, 2025

Switch to Responses API for OpenAI tpaulshippy/ruby_llm_community#4

Closed

Paul Shippy added 2 commits August 24, 2025 22:04

Merge branch 'main' into responses-api

31fd54e

Update cassettes

485a28b

Paul Shippy added 3 commits August 26, 2025 18:06

Cleanup some comments and deprecated stuff

9133c7c

Merge branch 'main' into responses-api

bf4b381

Fix model id in responses API payload

6cda94f

tpaulshippy mentioned this pull request Aug 29, 2025

Support OpenAI's image_generation tool for multi-turn image editing tpaulshippy/ruby_llm_community#6

Merged

10 tasks

Paul Shippy added 2 commits October 17, 2025 08:12

Merge branch 'main' into responses-api

1a97543

Fix some broken specs after merge

27213bc

tpaulshippy added 4 commits February 9, 2026 07:02

Merge branch 'main' into responses-api

07c9452

Reorder inflector mappings after merge

9249859

Update inheritance of Azure class to use OpenAIBase

512ffa3

Start implementing thinking in responses API

d9a3edb

tpaulshippy added 3 commits February 9, 2026 20:36

Update XAI class to inherit from OpenAIBase

c3a9c5a

Refactor tool handling in OpenAI integration to unify tool mapping me…

97e38ae

…thod

Update cartridges

afe8d1d

Merge branch 'main' into responses-api

59fe9ab

Uh oh!

Conversation

tpaulshippy commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

Type of change

Scope check

Quality check

API changes

Related issues

Uh oh!

Uh oh!

tpaulshippy commented Aug 26, 2025

Uh oh!

jaryl commented Sep 24, 2025

Uh oh!

rikkiprince commented Oct 9, 2025

Uh oh!

radanskoric commented Oct 23, 2025

Uh oh!

tpaulshippy commented Nov 2, 2025

Uh oh!

tpaulshippy commented Nov 2, 2025

Uh oh!

radanskoric commented Nov 5, 2025

Uh oh!

tpaulshippy commented Nov 5, 2025

Uh oh!

radanskoric commented Nov 5, 2025

Uh oh!

IAPark commented Dec 1, 2025

Uh oh!

thomaswitt commented Dec 12, 2025

Uh oh!

MirkoMignini commented Feb 9, 2026

Uh oh!

tpaulshippy commented Feb 9, 2026

Uh oh!

tpaulshippy commented Feb 10, 2026

Uh oh!

thomaswitt commented Feb 11, 2026

Uh oh!

cktricky commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

tpaulshippy commented Aug 6, 2025 •

edited

Loading