Dual Storage Architecture #9

zaibkhan · 2025-09-09T17:51:26Z

Mirrors ai-code-review-evaluation#6 for like-for-like benchmarking.

Base: dual-storage-baseline
Head: dual-storage-enhanced

Original PR excerpt:

Test 6

* Dual writer: mode 3 * Add integration tests for playlits in mode 3 * Remove todo * Update pkg/apiserver/rest/dualwriter_mode3.go Co-authored-by: Arati R. <[email protected]> * Admin: Fixes an issue where user accounts could not be enabled (#88117) Fix: unable to enable user * [REVIEW] FInish mode 3 and add tests * Improve logging * Update dependencies * Update pkg/apiserver/rest/dualwriter_mode3_test.go Co-authored-by: maicon <[email protected]> * remove test assertion * Use mode log when dual writer is initiated --------- Co-authored-by: Arati R. <[email protected]> Co-authored-by: gonvee <[email protected]> Co-authored-by: maicon <[email protected]>

codoki-pr-intelligence · 2025-09-09T17:51:30Z

Codoki PR Review

Summary: Preserve update consistency, fix metrics/log context
What’s good: Mode 3 now reads from unified storage and writes asynchronously to legacy with timeouts; List is implemented and metrics/logging are added throughout.
Review Status: ❌ Requires changes
Overall Priority: High

Issues (Critical & High only)

Severity	Issue	Why it matters
High	Correctness — Legacy update may diverge from storage; wrong metrics rec...	`…/rest/dualwriter_mode3.go` Two issues in this Update path: (1) the error path records legacy metrics for a storage failure; (2) the legacy update reuses objInfo instead of the unified-store result, risking divergence between storages (e.g., different defaulting/resourceVersion). Wrap objInfo so legacy gets the exact object returned by storage, and fix the metrics recorder.

Showing top 1 issues. Critical: 0, High: 1. See inline suggestions for more.

Key Feedback (click to expand)

Needs improvement: Update no longer propagates the unified-store result to legacy, risking divergence; a few metrics calls use the wrong recorder/labels and one log context loses fields.
Testing: Consider adding a test that asserts Update writes the exact object returned by storage to legacy (e.g., wrap objInfo so legacy receives the unified-store result). Also add an assertion that metrics use kind (not name) for labels and that storage vs. legacy recorders are correctly selected.
Open questions: Should legacy writes proceed even if the request context is canceled? Today the goroutines inherit the request context and may abort immediately; if best-effort writes are desired, consider decoupling from the parent context.

Confidence: 3/5 — Needs work before merge (1 high · status: Requires changes)

Sequence Diagram

sequenceDiagram
    participant Caller
    participant DualWriterMode3
    participant Storage
    participant Legacy
    Caller->>DualWriterMode3: Update(name, objInfo,...)
    DualWriterMode3->>Storage: Update(...)
    alt storage error
        Storage-->>DualWriterMode3: err
        DualWriterMode3-->>Caller: error
    else success
        Storage-->>DualWriterMode3: res, async, nil
        DualWriterMode3->>Legacy: Update(..., async)
        Legacy-->>DualWriterMode3: (ignored in async)
        DualWriterMode3-->>Caller: res, async, nil
    end

React with 👍 or 👎 if you found this review useful.

codoki-pr-intelligence · 2025-09-09T17:54:16Z

pkg/apiserver/rest/dualwriter_mode3.go

-	}
-	return obj, created, err
+		startLegacy := time.Now()
+		defer cancel()


⚠️ High: Two issues in this Update path: (1) the error path records legacy metrics for a storage failure; (2) the legacy update reuses objInfo instead of the unified-store result, risking divergence between storages (e.g., different defaulting/resourceVersion). Wrap objInfo so legacy gets the exact object returned by storage, and fix the metrics recorder.

Suggested change

defer cancel()

```go

if err != nil {

log.Error(err, "unable to update in storage")

d.recordStorageDuration(true, mode3Str, options.Kind, method, startStorage)

return res, async, err

}

// Ensure legacy receives the same object returned by Storage to avoid divergence

_, _, errObjectSt := d.Legacy.Update(ctx, name, &updateWrapper{upstream: objInfo, updated: res}, createValidation, updateValidation, forceAllowCreate, options)

codoki-pr-intelligence · 2025-09-09T17:54:18Z

pkg/apiserver/rest/dualwriter_mode3.go

+
+func (d *DualWriterMode3) Delete(ctx context.Context, name string, deleteValidation rest.ValidateObjectFunc, options *metav1.DeleteOptions) (runtime.Object, bool, error) {
+	var method = "delete"
+	log := d.Log.WithValues("name", name, "kind", options.Kind, "method", method)


🔷 Medium: This loses the per-call fields (name/kind/method) by attaching the base logger instead of the enriched one; also the storage metrics label uses name instead of kind. Attach the enriched logger and use options.Kind for metrics consistency.

go
ctx = klog.NewContext(ctx, log)
// ...
d.recordStorageDuration(false, mode3Str, options.Kind, method, startStorage)

codoki-pr-intelligence · 2025-09-09T17:54:19Z

pkg/apiserver/rest/dualwriter_mode3.go

+		startLegacy := time.Now()
+		ctx, cancel := context.WithTimeoutCause(ctx, time.Second*10, errors.New("legacy deletecollection timeout"))
+		defer cancel()
+		_, err := d.Legacy.DeleteCollection(ctx, deleteValidation, options, listOptions)


🔷 Medium: This records legacy DeleteCollection timing using the storage metrics function; use the legacy recorder for correct attribution.

Suggested change

_, err := d.Legacy.DeleteCollection(ctx, deleteValidation, options, listOptions)

```go

d.recordLegacyDuration(err != nil, mode3Str, options.Kind, method, startLegacy)

codoki-pr-intelligence bot reviewed Sep 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dual Storage Architecture #9

Dual Storage Architecture #9

Uh oh!

zaibkhan commented Sep 9, 2025

Uh oh!

codoki-pr-intelligence bot commented Sep 9, 2025 •

edited

Loading

Uh oh!

codoki-pr-intelligence bot Sep 9, 2025

Uh oh!

codoki-pr-intelligence bot Sep 9, 2025

Uh oh!

codoki-pr-intelligence bot Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-		defer cancel()
+```go
+        if err != nil {
+            log.Error(err, "unable to update in storage")
+            d.recordStorageDuration(true, mode3Str, options.Kind, method, startStorage)
+            return res, async, err
+        }
+        // Ensure legacy receives the same object returned by Storage to avoid divergence
+        _, _, errObjectSt := d.Legacy.Update(ctx, name, &updateWrapper{upstream: objInfo, updated: res}, createValidation, updateValidation, forceAllowCreate, options)

	_, err := d.Legacy.DeleteCollection(ctx, deleteValidation, options, listOptions)
	```go
	d.recordLegacyDuration(err != nil, mode3Str, options.Kind, method, startLegacy)

Dual Storage Architecture #9

Are you sure you want to change the base?

Dual Storage Architecture #9

Uh oh!

Conversation

zaibkhan commented Sep 9, 2025

Uh oh!

codoki-pr-intelligence bot commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codoki PR Review

Sequence Diagram

Uh oh!

codoki-pr-intelligence bot Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

codoki-pr-intelligence bot Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

codoki-pr-intelligence bot Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codoki-pr-intelligence bot commented Sep 9, 2025 •

edited

Loading