fix: harden payment replay and purchase guardrails by batmn-dev · Pull Request #47 · batmn-dev/not-a-wrapper

batmn-dev · 2026-03-02T00:51:35Z

Summary

Harden replay normalization and compiler handling for payment-related assistant/tool message flows across OpenAI and Anthropic paths.
Add payment intent parsing and route guardrails to reduce accidental purchase execution and enforce safer tool-policy behavior.
Introduce persistent payment tool state support in Convex (schema/functions/backfill) and expand regression coverage for replay, intent, and policy integration.

Test plan

Existing automated tests updated/added for replay matrix behavior
Existing automated tests updated/added for payment intent parsing
Existing automated tests updated/added for tool platform and route-policy integration
Run full test suite locally before merge

Made with Cursor

Note

High Risk
Touches payment side-effect execution by adding intent-based tool suppression, execution-time blocking, and a new Convex chatToolState ledger with backfill; misclassification or state/version bugs could block valid purchases or fail to prevent side effects. Feature flags reduce blast radius but rollout/backfill paths still need careful review.

Overview
Prevents payment side effects from being triggered by replay/edit/model-switch flows by adding a canonical per-chat payment state ledger (chatToolState) with idempotent upserts, version-based staleness rejection, and best-effort cleanup on message truncation/clear.

Introduces server-side payment intent classification and request-scoped policy overrides to remove pay_purchase on status_check/active-job turns (plus a defense-in-depth isPurchaseBlocked execution guard), and adds optional lazy backfill from toolCallLog for legacy chats.

Hardens replay by marking payment tools as explicitly non-replayable, extracting jobId/url/status into platformToolContext, and compiling them into deterministic continuity text (not tool calls); expands regression tests and adds observability fields/flags (PAYMENT_*) including toolCallLog metadata (chatVersion, toolKey, stateMutationKey).

^{Written by Cursor Bugbot for commit f680add. This will update automatically on new commits. Configure here.}

Summary by cubic

Prevents accidental purchases during status checks and stops payment replay from causing side effects. Adds a canonical per-chat payment state synced to tool outputs, propagates chatVersion for safe edits/resends, and truncates stale state on edit and clear.

New Features
- Canonical Convex payment state (chatToolState) with idempotent upserts, chatVersion propagation, and server-side truncation on edits/resends and chat clear to drop stale branches.
- Intent classifier + policy deny pay_purchase on status/active job, plus an execution-time isPurchaseBlocked guard for defense-in-depth.
- Route integration with lazy backfill that correlates status to the latest purchase job; structured logs include chatVersion/toolKey/stateMutationKey; client and server pass chatVersion to keep state consistent.
- Replay compilers downgrade payment tools to text continuity with URL/jobId/status preserved and avoid inferring “in progress” when terminal is unknown.
- Keyword boundary matching avoids substring collisions (e.g., “checkout” vs “check”).
Migration
- Deploy Convex schema and run codegen.
- Enable flags as needed:
  - PAYMENT_STATUS_GUARDRAILS_V1, PAYMENT_CHAT_STATE_V1
  - PAYMENT_GUARDRAIL_MODE=observe or enforce
  - Optional: PAYMENT_CHAT_STATE_BACKFILL_V1
- No dependency changes; behavior is gated behind flags.

^{Written for commit f680add. Summary will update on new commits.}

Prevent payment tool parts from being replayed as executable calls, add intent/state-based policy overrides to block unsafe purchases, and persist payment tool state for safer status-first continuity across provider switches. Made-with: Cursor

vercel · 2026-03-02T00:51:40Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
not-a-wrapper	Ready	Preview, Comment	Mar 2, 2026 3:36am

cubic-dev-ai

3 issues found across 23 files

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="app/api/chat/intent/payment-intent.ts">

<violation number="1" location="app/api/chat/intent/payment-intent.ts:55">
P1: Substring matching can misclassify intent (e.g., "goodbye" includes "buy") and accidentally treat non‑purchase messages as `new_purchase`. Use word/phrase boundaries to avoid false positives for safety-critical intent classification.</violation>
</file>

<file name="lib/config.ts">

<violation number="1" location="lib/config.ts:315">
P2: Validate PAYMENT_GUARDRAIL_MODE so only "observe" or "enforce" are accepted; otherwise fall back to "observe". The current type assertion doesn’t prevent invalid env values from propagating.</violation>
</file>

<file name="lib/tools/capability-policy.ts">

<violation number="1" location="lib/tools/capability-policy.ts:351">
P1: Bug: observe mode returns `null` instead of a loggable override, so this case is silently dropped. The route handler (`app/api/chat/route.ts:703`) gates logging on `if (policyOverrides)`, meaning the "unknown intent + active job" scenario in observe mode produces zero telemetry — contradicting the comment's stated intent. Return an override object without `denyTools` so the route can log it.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

app/api/chat/intent/payment-intent.ts

lib/tools/capability-policy.ts

lib/config.ts

app/api/chat/intent/payment-intent.ts

app/api/chat/replay/compilers/openai.ts

lib/config.ts

app/api/chat/replay/normalize.ts

convex/chatToolStateBackfill.ts

greptile-apps · 2026-03-02T01:03:48Z

Greptile Summary

This PR implements a comprehensive payment guardrail system to prevent accidental purchase re-execution during message replay and edit/resend operations.

Key Changes:

Payment intent classifier using keyword matching to distinguish status checks from new purchases
Canonical chatToolState ledger in Convex with OCC guards for payment lifecycle tracking
Route-level policy system that denies pay_purchase when status-check intent is detected
Defense-in-depth isPurchaseBlocked callback in platform tools as execution-time failsafe
Replay compiler enhancements to downgrade payment tools to text continuity summaries
Fail-safe backfill logic to hydrate legacy chats without enabling purchases on ambiguous data
Edit/resend truncation integrated with state cleanup via chatVersion tracking

Issues Found:

Race condition in upsertFromStatus where status updates for different jobs arriving out-of-order could incorrectly clear activePurchaseJobId

Architecture Notes:
The implementation follows a defense-in-depth strategy with three layers: (1) intent classification at the route level, (2) policy-based tool filtering, and (3) execution-time callback guard. The OCC pattern via chatVersion and lastMutationKey provides idempotency and stale-write protection.

Confidence Score: 4/5

Safe to merge with one race condition fix recommended for production
Comprehensive payment guardrail system with multi-layer defense, extensive test coverage, and fail-safe backfill logic. One race condition in status updates should be addressed before production use.
Pay close attention to convex/chatToolState.ts for the race condition fix

Important Files Changed

Filename	Overview
convex/chatToolState.ts	New payment state ledger with OCC guards. Race condition: `upsertFromStatus` clears `activePurchaseJobId` without verifying jobId match
app/api/chat/intent/payment-intent.ts	Deterministic keyword-based payment intent classifier with safety-first reclassification logic
app/api/chat/route.ts	Adds payment guardrails with intent classification, state-driven policy, and lazy backfill support
lib/tools/platform.ts	Adds defense-in-depth `isPurchaseBlocked` callback to prevent accidental purchase execution
convex/chatToolStateBackfill.ts	Fail-safe backfill logic that hydrates state from toolCallLog without enabling purchases on ambiguous data
app/api/chat/replay/normalize.ts	Extracts platform tool context from payment tools for replay continuity summaries

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[User Message] --> B[Route: Classify Intent]
    B --> C{Intent Class?}
    C -->|status_check| D[Policy: Deny pay_purchase]
    C -->|new_purchase| E{Has Active Job?}
    C -->|unknown| F{Has Active Job?}
    
    E -->|Yes| D
    E -->|No| G[Policy: Allow Both Tools]
    
    F -->|Yes| H{Guardrail Mode?}
    F -->|No| G
    
    H -->|enforce| D
    H -->|observe| G
    
    D --> I[Remove pay_purchase from ToolSet]
    G --> J[Keep All Tools]
    
    I --> K[Model Generation]
    J --> K
    
    K --> L{Tool Called?}
    L -->|pay_purchase| M{isPurchaseBlocked?}
    L -->|pay_status| N[Execute Status Check]
    
    M -->|true| O[Throw Error]
    M -->|false| P[Execute Purchase]
    
    P --> Q[Upsert chatToolState]
    N --> R[Update Status in chatToolState]
    
    Q --> S[Set activePurchaseJobId]
    R --> T{isTerminal?}
    T -->|Yes| U[Clear activePurchaseJobId]
    T -->|No| V[Keep activePurchaseJobId]
    
    style D fill:#ffcccc
    style O fill:#ffcccc
    style Q fill:#ccffcc
    style R fill:#ccffcc

_{Last reviewed commit: 35fd30b}

greptile-apps

_{23 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

convex/chatToolState.ts

Improve payment intent keyword matching to avoid substring collisions, preserve payment context in replay fallback summaries, and harden canonical state/backfill handling to reduce cross-job and stale-state errors. Made-with: Cursor

app/api/chat/replay/compilers/platform-tool-fallback.ts

convex/chatToolState.ts

convex/messages.ts

Avoid forcing an in-progress status label when replay context lacks a terminal signal, and include chat version/state mutation metadata for payment tool observability. Also clear canonical chat tool state when clearing chat history and cover fallback behavior with a regression test. Made-with: Cursor

app/api/chat/route.ts

convex/messages.ts

Keep payment chat state synchronized with pay_purchase/pay_status tool outputs and clear stale state when edits truncate history, preventing outdated job context from surviving resend flows. Made-with: Cursor

cubic-dev-ai

1 issue found across 5 files (changes from recent commits).

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="convex/messages.ts">

<violation number="1" location="convex/messages.ts:290">
P2: Backfilled payment state (chatVersion 0 with no sourceMessageTimestamp) will never be cleared by this truncation fallback, leaving stale payment state after edits. Consider treating chatVersion 0 as unknown and deleting it when any truncation occurs.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

cubic-dev-ai · 2026-03-02T03:42:04Z

convex/messages.ts

+        const versionWasTruncated =
+          toDelete.length > 0 && toolState.chatVersion >= truncationMinVersion


P2: Backfilled payment state (chatVersion 0 with no sourceMessageTimestamp) will never be cleared by this truncation fallback, leaving stale payment state after edits. Consider treating chatVersion 0 as unknown and deleting it when any truncation occurs.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At convex/messages.ts, line 290: <comment>Backfilled payment state (chatVersion 0 with no sourceMessageTimestamp) will never be cleared by this truncation fallback, leaving stale payment state after edits. Consider treating chatVersion 0 as unknown and deleting it when any truncation occurs.</comment> <file context> @@ -277,11 +281,16 @@ export const deleteFromTimestamp = mutation({ + typeof toolState.sourceMessageTimestamp === "number" && toolState.sourceMessageTimestamp >= timestamp - ) { + const versionWasTruncated = + toDelete.length > 0 && toolState.chatVersion >= truncationMinVersion + </file context>

Suggested change

const versionWasTruncated =

toDelete.length > 0 && toolState.chatVersion >= truncationMinVersion

const versionWasTruncated =

toDelete.length > 0 &&

(toolState.chatVersion === 0 ||

toolState.chatVersion >= truncationMinVersion)

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-02T03:47:01Z

convex/chatToolStateBackfill.ts

+    const rowId = await ctx.db.insert("chatToolState", {
+      chatId,
+      userId: user._id,
+      chatVersion: 0, // Backfilled, not from a real version


Backfilled chatToolState rows immune to edit truncation

Medium Severity

Backfilled chatToolState rows use chatVersion: 0 and sourceMessageTimestamp: undefined, making them immune to both truncation mechanisms during message edits. The server-side check toolState.chatVersion >= truncationMinVersion evaluates to 0 >= ≥1 (false), and sourceTimestampWasTruncated is false since the timestamp is undefined. The client-side truncateFromVersion has the same blind spot. This directly contradicts the code comment stating backfilled rows "don't survive an edit branch," and violates the plan invariant that edit truncation must not leave stale payment state active. A stale backfilled row with activePurchaseJobId set would incorrectly block new purchases after an edit.

Additional Locations (2)

convex/messages.ts#L289-L291

convex/chatToolState.ts#L257-L258

cursor · 2026-03-02T03:47:01Z

app/api/chat/intent/payment-intent.ts

+      confidence: "high",
+      reason: "explicit_purchase_intent",
+    }
+  }


Classifier case ordering shadows status-keyword-only fallback

Low Severity

The check at line 112 (hasPurchaseKeyword && !hasActiveJob) is evaluated before the check at line 121 (hasStatusKeyword && !hasActiveJob && !hasAnyJob), making the latter unreachable when both keyword types are present. A message like "What is the status of my order?" with no existing jobs matches both "status" and "order" keywords, so it's classified as new_purchase (high confidence) instead of unknown. The test suite avoids this case by using messages without overlapping keywords.

Additional Locations (1)

app/api/chat/intent/payment-intent.ts#L119-L127

cubic-dev-ai bot reviewed Mar 2, 2026

View reviewed changes

app/api/chat/intent/payment-intent.ts Outdated Show resolved Hide resolved

lib/tools/capability-policy.ts Outdated Show resolved Hide resolved

lib/config.ts Outdated Show resolved Hide resolved

cursor bot reviewed Mar 2, 2026

View reviewed changes

greptile-apps bot reviewed Mar 2, 2026

View reviewed changes

convex/chatToolState.ts Outdated Show resolved Hide resolved

fix: tighten payment guardrails and replay continuity

21fb21b

Improve payment intent keyword matching to avoid substring collisions, preserve payment context in replay fallback summaries, and harden canonical state/backfill handling to reduce cross-job and stale-state errors. Made-with: Cursor

vercel bot deployed to Preview March 2, 2026 01:32 View deployment

cursor bot reviewed Mar 2, 2026

View reviewed changes

app/api/chat/replay/compilers/platform-tool-fallback.ts Outdated Show resolved Hide resolved

convex/chatToolState.ts Show resolved Hide resolved

convex/messages.ts Show resolved Hide resolved

vercel bot deployed to Preview March 2, 2026 02:12 View deployment

cursor bot reviewed Mar 2, 2026

View reviewed changes

app/api/chat/route.ts Show resolved Hide resolved

convex/messages.ts Show resolved Hide resolved

fix: persist payment tool state and truncate stale branches

f680add

Keep payment chat state synchronized with pay_purchase/pay_status tool outputs and clear stale state when edits truncate history, preventing outdated job context from surviving resend flows. Made-with: Cursor

batmn-dev merged commit 86d5ddc into main Mar 2, 2026
5 of 6 checks passed

vercel bot deployed to Preview March 2, 2026 03:36 View deployment

cubic-dev-ai bot reviewed Mar 2, 2026

View reviewed changes

cursor bot reviewed Mar 2, 2026

View reviewed changes

		const versionWasTruncated =
		toDelete.length > 0 && toolState.chatVersion >= truncationMinVersion

Conversation

batmn-dev commented Mar 2, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by cubic

Uh oh!

vercel bot commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot commented Mar 2, 2026

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 2, 2026

Choose a reason for hiding this comment

Backfilled chatToolState rows immune to edit truncation

Uh oh!

cursor bot Mar 2, 2026

Choose a reason for hiding this comment

Classifier case ordering shadows status-keyword-only fallback

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

batmn-dev commented Mar 2, 2026 •

edited by cubic-dev-ai bot

Loading

vercel bot commented Mar 2, 2026 •

edited

Loading

cubic-dev-ai bot Mar 2, 2026 •

edited

Loading