fix(core): replay aborted managed tool outputs by MukundaKatta · Pull Request #1191 · openai/openai-agents-js

MukundaKatta · 2026-04-21T15:11:49Z

Summary

queue synthetic function_call_result items when a streamed managed-conversation run aborts after OpenAI Responses function_call events have already been emitted
replay those synthetic outputs on the next conversationId turn so the server-side conversation store stays balanced
add a streaming regression test that aborts after a raw function_call event and verifies the next run sends the synthetic output

Testing

git diff --check
npm test -- --run packages/agents-core/test/run.stream.test.ts (blocked locally because vitest is not installed in this clone)

changeset-bot · 2026-04-21T15:12:04Z

🦋 Changeset detected

Latest commit: 6e5df89

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 5 packages

Name	Type
@openai/agents-core	Patch
@openai/agents-extensions	Patch
@openai/agents-openai	Patch
@openai/agents-realtime	Patch
@openai/agents	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 081a5a386c

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-21T15:16:01Z

+          if (serverConversationTracker?.conversationId && inputMarked) {
+            clearManagedConversationSupplementalItems(
+              serverConversationTracker.conversationId,
+            );


Clear pending abort outputs on every successful turn

clearManagedConversationSupplementalItems is called only in the streaming path, so abort outputs queued in pendingManagedConversationAbortItems survive successful non-streaming conversationId turns. Because getManagedConversationSupplementalItems always prepends those pending items, a later fresh runner.run(..., { conversationId }) can resend the same synthetic function_call_result again, which can rebalance the transcript incorrectly or trigger duplicate-call errors from the provider. Please clear the queue after successful non-streaming model calls as well.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 6e5df89e5d

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-21T18:40:12Z

  ProcessedResponse<any>,
  AgentInputItem[]
 >();
+const pendingManagedConversationAbortItems = new Map<string, AgentInputItem[]>();


Bound pending abort cache to avoid unbounded growth

The new pendingManagedConversationAbortItems cache is a process-global Map keyed by conversationId, but this commit adds no general eviction path for entries that are never replayed successfully. Since queueManagedConversationSupplementalItems keeps inserting IDs and cleanup currently depends on later turn success, aborted conversations that are never resumed can accumulate indefinitely in long-lived workers, causing memory growth over time. Please add a bounded eviction strategy (for example TTL/size cap, and/or broader clear paths) so orphaned conversation IDs do not stay resident forever.

Useful? React with 👍 / 👎.

wsk-builds

Thanks for working on this. I checked out the current draft locally and ran:

pnpm i
pnpm -F @openai/agents-core build-check
pnpm -F @openai/agents-core dist:check
CI=1 pnpm test -- packages/agents-core/test/run.stream.test.ts

All completed successfully; the Vitest command ended up running the full suite in this checkout and reported 111 test files / 1783 tests passing.

I agree this is an important edge case for conversationId + streaming aborts: if a Responses function call has already been persisted server-side, the next turn needs some way to avoid leaving the managed conversation in an unbalanced state.

I think the main risk is the current ownership/lifetime of the pending abort items. pendingManagedConversationAbortItems is a module-level Map keyed only by conversationId, so the cleanup state can outlive a Runner/request and can also be shared across unrelated callers in the same process. That makes a few cases hard to reason about: concurrent runs using the same conversation id, aborted runs that are never resumed, and long-lived server processes where pending entries may remain indefinitely. I think this state should either be tied to RunState / the runner/session lifecycle, or the PR should explicitly document why process-global conversation-id state is safe here.

I also think the synthesized output semantics need another look. The PR turns a raw function_call into a function_call_result with status: "completed" and output text "aborted". A user abort is not really a completed tool execution, and this overlaps with the status/history concerns discussed in #1104 / #1110. If completed is required for server-managed history compatibility, it would be good to call that out and add a test/probe explaining why incomplete or another representation is not viable. Otherwise the model-visible history may incorrectly imply the tool succeeded.

For tests, the current regression test covers the basic replay path, but I think this needs a broader matrix before merge:

multiple function calls emitted before abort
abort before a usable call_id is available
previousResponseId mode remains a no-op
pending items are cleared after a successful replay
pending items do not get lost or duplicated if the next run fails
concurrent runs with the same conversationId
interaction with existing managed conversation supplemental handoff items

One smaller thing: the changeset package looks right as @openai/agents-core, but the summary should probably use the repo's Conventional Commit style, e.g. fix(core): replay aborted managed conversation tool outputs.

Overall, I think the PR is pointed at a real bug, but I'd prefer to see the pending-item ownership and synthesized-result semantics clarified before merge.

fix(core): replay aborted managed tool outputs

081a5a3

chatgpt-codex-connector Bot reviewed Apr 21, 2026

View reviewed changes

chore: add changeset

6e5df89

github-actions Bot added the package:agents-core label Apr 21, 2026

chatgpt-codex-connector Bot reviewed Apr 21, 2026

View reviewed changes

seratch marked this pull request as draft April 23, 2026 23:11

wsk-builds reviewed Apr 26, 2026

View reviewed changes

This was referenced Apr 26, 2026

fix: #1104 preserve incomplete status for rejected function tool outputs #1110

Draft

Rejected tool calls use status: 'completed' in function_call_result, causing model hallucinations #1104

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(core): replay aborted managed tool outputs#1191

fix(core): replay aborted managed tool outputs#1191
MukundaKatta wants to merge 2 commits intoopenai:mainfrom
MukundaKatta:codex/agents-js-abort-tool-cleanup

MukundaKatta commented Apr 21, 2026

Uh oh!

changeset-bot Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 21, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 21, 2026

Uh oh!

wsk-builds left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MukundaKatta commented Apr 21, 2026

Summary

Testing

Uh oh!

changeset-bot Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

wsk-builds left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

changeset-bot Bot commented Apr 21, 2026 •

edited

Loading

wsk-builds left a comment •

edited

Loading