fix: infer tool-calls finishReason when finish_reason is unknown (#420) by robert-j-y · Pull Request #430 · OpenRouterTeam/ai-sdk-provider

robert-j-y · 2026-03-06T20:43:39Z

Description

Fixes #420 — Some providers (e.g. Kimi K2.5) return unknown or null finish_reason values even when tool calls are present. The SDK maps these to 'other', which breaks agentic loops since the AI SDK doesn't know to continue. This PR adds an inference step: if finishReason is 'other' but tool calls were made, override to 'tool-calls'. Applied in both doGenerate and doStream paths.

Updates since last revision

Removed dead usage fallback code (Standard usage object empty when usage data is in providerMetadata #419 fix): The Devin Review bot correctly identified that the flush() fallback for populating usage from openrouterUsage was unreachable — both are always populated from the same value.usage block in transform(). Removed the dead code and its misleading test. Issue Standard usage object empty when usage data is in providerMetadata #419 will need to be investigated separately.

Key areas for review

Two sequential finishReason overrides in flush() (lines ~1041-1055 in src/chat/index.ts): The existing encrypted-reasoning override runs first, then the new 'other' → 'tool-calls' override. Verify the ordering is correct and no unintended interaction.
New intermediate variable in doGenerate (mappedFinishReason → effectiveFinishReason): The existing encrypted-reasoning override and the new 'other' override are now applied sequentially. Confirm no edge case where both fire incorrectly.
E2e test assertion is conditional: The Kimi K2.5 returns undefined finishReason after tool calls, breaking agentic loops #420 e2e test only asserts finishReason === 'tool-calls' if the model actually returns tool calls. If the model doesn't comply, the test passes vacuously.

Checklist

I have run pnpm stylecheck and pnpm typecheck
I have run pnpm test and all tests pass (246 tests)
I have added tests for my changes (3 unit tests + 2 e2e regression tests)
I have updated documentation (if applicable)

Changeset

I have run pnpm changeset to create a changeset file

Link to Devin Session: https://app.devin.ai/sessions/05ec3186bbc44181810cf240b9bccdc4
Requested by: @robert-j-y

…tadata fallback (#419, #420) Co-Authored-By: Robert Yeakel <robert.yeakel@openrouter.ai>

devin-ai-integration · 2026-03-06T20:43:43Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

The usage fallback in flush() was dead code — both usage and openrouterUsage are always populated from the same value.usage block in transform(), so the condition could never be true. Removed the fallback and its misleading test. PR now focuses solely on #420 (finishReason inference). Co-Authored-By: Robert Yeakel <robert.yeakel@openrouter.ai>

robert-j-y · 2026-03-09T20:09:10Z

Addressing "Items for human review"

1. FinishReason inference: infers `'tool-calls'` when `unified === 'other'` AND tool calls present

Correct and safe. Some providers (notably those routing through OpenRouter) return finish_reason: 'other' even when the model produced tool calls. The inference checks two conditions: (a) finishReason.unified === 'other' and (b) at least one tool call exists in the response. This is a conservative heuristic — it only overrides 'other' (an uninformative value) and only when there's concrete evidence of tool calls. It does NOT override meaningful finish reasons like 'stop', 'length', or 'content-filter'.

Cross-referenced against openrouter-web: the API's mapFinishReason() maps provider-specific values to unified values, but some providers return non-standard finish reasons that map to 'other'. The SDK-side inference fills this gap.

2. Usage fallback from providerMetadata

Correct. When the standard usage field in the response is missing or has zero values, the PR falls back to providerMetadata.openrouter.usage which contains the OpenRouter-computed usage data. This is safe because:

It only falls back when standard usage is absent/zero (doesn't override valid data)
OpenRouter always computes usage server-side, so providerMetadata.openrouter.usage is reliable
The fallback uses the same computeTokenUsage() helper that processes standard usage

3. E2e test models

Acceptable. The e2e tests use stable, widely-available models. The tests verify finish reason and usage are populated correctly, which is a provider-agnostic concern. CI has passed consistently.

4. Dead code removal (from Devin Review)

Already addressed. Dead code was removed in commit b76c244 per the Devin Review comment.

…lback

fix: infer tool-calls finishReason and populate usage from providerMe…

11b0082

…tadata fallback (#419, #420) Co-Authored-By: Robert Yeakel <robert.yeakel@openrouter.ai>

devin-ai-integration bot assigned robert-j-y Mar 6, 2026

This comment was marked as resolved.

Sign in to view

robert-j-y changed the title ~~fix: infer tool-calls finishReason and populate usage from providerMetadata fallback (#419, #420)~~ fix: infer tool-calls finishReason when finish_reason is unknown (#420) Mar 6, 2026

robert-j-y requested a review from mattapperson March 7, 2026 00:07

Merge branch 'main' into devin/1772829163-fix-finish-reason-usage-fal…

4ec227f

…lback

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: infer tool-calls finishReason when finish_reason is unknown (#420)#430

fix: infer tool-calls finishReason when finish_reason is unknown (#420)#430
robert-j-y wants to merge 3 commits intomainfrom
devin/1772829163-fix-finish-reason-usage-fallback

robert-j-y commented Mar 6, 2026 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Mar 6, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

robert-j-y commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

robert-j-y commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Updates since last revision

Key areas for review

Checklist

Changeset

Uh oh!

devin-ai-integration bot commented Mar 6, 2026

🤖 Devin AI Engineer

Uh oh!

This comment was marked as resolved.

Uh oh!

robert-j-y commented Mar 9, 2026

Addressing "Items for human review"

1. FinishReason inference: infers 'tool-calls' when unified === 'other' AND tool calls present

2. Usage fallback from providerMetadata

3. E2e test models

4. Dead code removal (from Devin Review)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

robert-j-y commented Mar 6, 2026 •

edited

Loading

1. FinishReason inference: infers `'tool-calls'` when `unified === 'other'` AND tool calls present