fix: r_cte wrong/flaky results under concurrency by KKould · Pull Request #19439 · databendlabs/databend

KKould · 2026-02-10T11:59:48Z

I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/

Summary

fixes: Tracking: Recursive CTE result may be random #19398
Fix flaky / non-deterministic results in WITH RECURSIVE ... UNION ALL ... caused by cross-query interference on recursive CTE internal MEMORY tables.
Make the recursive CTE internal table names query-unique (prefix with query id) so concurrent queries (sqllogictest parallelism in CI) can no longer create/drop/recreate the same internal table name and corrupt each other’s recursion state.
Add a deterministic regression test hook and a stable repro test that simulates the interference between step=0 and step=1.

Root Cause

Recursive CTE is executed in steps. In step=0, Databend creates internal Engine=Memory tables (one per RecursiveCteScan) and writes prepared blocks keyed by exec_id. In step>=1 it reads from the same internal MEMORY table to continue recursion.

Previously, those internal MEMORY tables were created in the current database using stable names taken directly from the recursive scan name / CTE alias (e.g. lines, paths). This makes the internal tables globally visible by (tenant, catalog, database, table_name) and not query-private.

In CI, sqllogictests frequently runs with --parallel > 1 against a single databend-query instance. Multiple concurrent sqllogictest queries can therefore execute recursive CTEs that share common aliases like lines. Because each recursive CTE run also drops its internal tables at the end, one
query can drop/recreate the same internal table name while another query is between step=0 and step=1, causing step=1 to see an empty/replaced table and terminate early (e.g. returning seed-only results such as 1 instead of the correct 1000). This appears as a rare, timing-dependent “random
result” in CI.

CI Execution Chain

scripts/ci/ci-run-sqllogic-tests.sh
- TEST_PARALLEL=${TEST_PARALLEL:-8} (default parallelism)
- runs databend-sqllogictests ... --enable_sandbox --parallel ${TEST_PARALLEL}
.github/actions/test_sqllogic_standalone_linux/action.yml
- env: TEST_PARALLEL: ${{ inputs.parallel }} (workflow overrides the script default)
- run: bash ./scripts/ci/ci-run-sqllogic-tests.sh ${{ inputs.dirs }}
.github/workflows/reuse.sqllogic.yml
- standalone job passes parallel: ${{ matrix.tests.parallel }} (empty => falls back to script default)
- cluster job has matrix entries that explicitly set parallel: "2" for some suites
scripts/ci/ci-run-sqllogic-tests-without-sandbox.sh
- runs databend-sqllogictests ... --parallel 1 (no --enable_sandbox, parallel fixed at 1)
.github/actions/test_sqllogic_iceberg_tpch/action.yml
- uses bash ./scripts/ci/ci-run-sqllogic-tests-without-sandbox.sh ... (parallel=1, no sandbox)
.github/actions/test_sqllogic_stage/action.yml
- uses bash ./scripts/ci/ci-run-sqllogic-tests-without-sandbox.sh ... (parallel=1, no sandbox)
scripts/ci/deploy/databend-query-standalone.sh
- starts databend-query ... --internal-enable-sandbox-tenant (enables SET sandbox_tenant=... used by sqllogictests)

Tests

Unit Test
Logic Test
Benchmark Test
No Test - Explain why

Type of change

Bug Fix (non-breaking change which fixes an issue)
New Feature (non-breaking change which adds functionality)
Breaking Change (fix or feature that could cause existing functionality not to work as expected)
Documentation Update
Refactoring
Performance Improvement
Other (please describe):

This change is

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f6e0c6ce82

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/query/service/src/pipelines/processors/transforms/transform_recursive_cte_source.rs

src/query/service/src/test_kits/rcte_hooks.rs

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 24cf459611

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-13T17:41:00Z

src/query/service/src/pipelines/processors/transforms/transform_recursive_cte_source.rs

+        if short.len() >= 32 {
+            break;


Preserve full query-id entropy in RCTE table prefix

make_rcte_prefix stops after collecting 32 alphanumeric characters, so two concurrent queries whose IDs share the same first 32 alnum chars will still map to the same internal __rcte_* table names. That reintroduces the same cross-query interference this patch is trying to eliminate (wrong/flaky recursive CTE results) for clients that provide custom/long query IDs. Generate the prefix from the full query ID (e.g., full sanitized ID or a hash of it) instead of truncating here.

Useful? React with 👍 / 👎.

fix: r_cte wrong/flaky results under concurrency

1676798

KKould requested a review from SkyFan2002 February 10, 2026 11:59

KKould self-assigned this Feb 10, 2026

github-actions bot added the pr-bugfix this PR patches a bug in codebase label Feb 10, 2026

KKould and others added 2 commits February 11, 2026 10:17

chore: codefmt

2d01ad3

Merge branch 'main' into fix/r_cte_random_result

f6e0c6c

KKould marked this pull request as ready for review February 11, 2026 08:47

chatgpt-codex-connector bot reviewed Feb 11, 2026

View reviewed changes

src/query/service/src/pipelines/processors/transforms/transform_recursive_cte_source.rs Outdated Show resolved Hide resolved

src/query/service/src/test_kits/rcte_hooks.rs Outdated Show resolved Hide resolved

chore: codex review comment

3cf409a

KKould marked this pull request as draft February 11, 2026 12:18

KKould added 3 commits February 11, 2026 20:34

fix: cte repeating prefix

406b4f9

fix: scope recursive CTE rewrite to local scan names

a75dbe6

fix: typo?

24cf459

SkyFan2002 approved these changes Feb 12, 2026

View reviewed changes

KKould marked this pull request as ready for review February 13, 2026 17:33

chatgpt-codex-connector bot reviewed Feb 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: r_cte wrong/flaky results under concurrency#19439

fix: r_cte wrong/flaky results under concurrency#19439
KKould wants to merge 7 commits intodatabendlabs:mainfrom
KKould:fix/r_cte_random_result

KKould commented Feb 10, 2026 •

edited by drmingdrmer

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KKould commented Feb 10, 2026 • edited by drmingdrmer Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root Cause

CI Execution Chain

Tests

Type of change

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

KKould commented Feb 10, 2026 •

edited by drmingdrmer

Loading