fix: improve diff parsing for Grok models with malformed content #7758

roomote · 2025-09-07T15:43:04Z

Summary

This PR addresses Issue #7750 where Grok Coder models were experiencing parsing errors with the apply_diff tool, specifically when the error messages incorrectly indicated that files contained diff markers (like =======) when they actually did not.

Problem

As reported by @mrbm, the system was showing error messages about escaping special markers like ======= even when these markers were not present in the actual file content. This suggests the issue is with how the diff parser handles malformed content generated by AI models, not with the file content itself.

Solution

This PR improves the diff parsing logic to:

Detect Grok-specific malformed patterns - Identifies common issues like:
- Consecutive separators (======= appearing multiple times)
- Separators appearing before SEARCH markers
- Unbalanced markers (mismatched SEARCH/REPLACE blocks)
- Too many separators relative to SEARCH blocks
Provide better error messages - When malformed diffs are detected:
- Clearly explains it is likely an AI model issue, not file content
- Provides debugging information showing marker counts
- Offers actionable suggestions (use read_file first, use simpler diffs, etc.)
- Shows the correct diff format
Add comprehensive test coverage - 27 new test cases covering various Grok-specific scenarios

Changes

Enhanced error detection in multi-search-replace.ts and multi-file-search-replace.ts
Added helper methods detectGrokMalformedDiff() and analyzeDiffStructure()
Added comprehensive test suite in grok-malformed-diff.spec.ts
Updated existing tests to match new error message format

Testing

✅ All existing tests pass
✅ Added 27 new test cases for Grok-specific scenarios
✅ Linting and type checking pass

Note

While this implementation should help with many Grok-related diff issues, I have asked @mrbm for additional details about their specific case to ensure this fully resolves the issue. The fix focuses on better detection and messaging for malformed diffs, which should prevent the confusing error messages about markers that do not exist in files.

Fixes #7750

Important

Enhances diff parsing to detect Grok-specific malformed patterns, improves error messaging, and adds comprehensive test coverage.

Behavior:
- Improves diff parsing logic in multi-search-replace.ts and multi-file-search-replace.ts to detect Grok-specific malformed patterns like consecutive separators, separators before SEARCH markers, unbalanced markers, and excessive separators.
- Provides enhanced error messages for malformed diffs, indicating potential AI model issues and offering debugging info and suggestions.
Functions:
- Adds detectGrokMalformedDiff() and analyzeDiffStructure() to identify and analyze malformed diffs.
Testing:
- Introduces grok-malformed-diff.spec.ts with 27 new test cases for Grok-specific scenarios.
- Updates existing tests in multi-search-replace.spec.ts to match new error message format.
Misc:
- Adjusts error handling in validateMarkerSequencing() to incorporate Grok detection.

^{This description was created by}^{for d509c20. You can customize this summary. It will automatically update as commits are pushed.}

- Add detection for Grok-specific malformed diff patterns - Provide better error messages when AI models generate incorrect syntax - Add debugging information to help users understand what went wrong - Include actionable suggestions for fixing diff issues - Add comprehensive test coverage for Grok scenarios This addresses issue #7750 where Grok models were generating malformed diffs that caused confusing error messages about markers that were not actually present in the files being edited.

ellipsis-dev · 2025-09-07T15:44:52Z

src/core/diff/strategies/multi-file-search-replace.ts

 	}

-	private validateMarkerSequencing(diffContent: string): { success: boolean; error?: string } {
+	private validateMarkerSequencing(diffContent: string): { success: boolean; error?: string; debugInfo?: string } {


Both strategies (MultiFileSearchReplaceDiffStrategy and MultiSearchReplaceDiffStrategy) now include nearly identical diff validation methods. Consider extracting the common logic (detectGrokMalformedDiff, analyzeDiffStructure, and parts of validateMarkerSequencing) into a shared utility module to reduce duplication.

^{This comment was generated because it violated a code review rule: irule_tTqpIuNs8DV0QFGj.}

roomote

Reviewing my own code is like debugging in a mirror - everything looks backwards but the bugs are still mine.

roomote · 2025-09-07T15:47:32Z