Fix row estimation for parallel subquery paths. #1284

avamingli · 2025-08-01T09:22:34Z

In CBDB, row estimation is determined by the relation's rows and cluster segments.
However, when there is a parallel subquery scan path, each worker will process fewer rows (divided by parallel_workers).

set enable_parallel = off;
explain SELECT e.name
FROM employees e
WHERE e.salary > (
    SELECT AVG(salary)
    FROM employees
    WHERE department_id = e.department_id);
                                                         QUERY PLAN                                                         
----------------------------------------------------------------------------------------------------------------------------
 Gather Motion 3:1  (slice1; segments: 3)  (cost=163.42..307.76 rows=3767 width=218)
   ->  Hash Join  (cost=163.42..257.54 rows=1256 width=218)
         Hash Cond: (e.department_id = "Expr_SUBQUERY".csq_c0)
         Join Filter: (e.salary > "Expr_SUBQUERY".csq_c1)
         ->  Seq Scan on employees e  (cost=0.00..71.67 rows=3767 width=254)
         ->  Hash  (cost=150.92..150.92 rows=1000 width=36)
               ->  Broadcast Motion 3:3  (slice2; segments: 3)  (cost=130.09..150.92 rows=1000 width=36)
                     ->  Subquery Scan on "Expr_SUBQUERY"  (cost=130.09..137.59 rows=333 width=36)
                           ->  Finalize HashAggregate  (cost=130.09..134.26 rows=333 width=36)
                                 Group Key: employees.department_id
                                 ->  Redistribute Motion 3:3  (slice3; segments: 3)  (cost=90.50..122.67 rows=990 width=36)
                                       Hash Key: employees.department_id
                                       ->  Partial HashAggregate  (cost=90.50..102.87 rows=990 width=36)
                                             Group Key: employees.department_id
                                             ->  Seq Scan on employees  (cost=0.00..71.67 rows=3767 width=36)
 Optimizer: Postgres query optimizer
(16 rows)

Subquery Scan on "Expr_SUBQUERY" (cost=130.09..137.59 rows=333 width=36)
While, a parallel Subquery Scan has the same rows though cost is less than that.

set enable_parallel = on;
set min_parallel_table_scan_size = 0;
explain SELECT e.name
FROM employees e
WHERE e.salary > (
    SELECT AVG(salary)
    FROM employees
    WHERE department_id = e.department_id);
                                                        QUERY PLAN                                                         
---------------------------------------------------------------------------------------------------------------------------
 Gather Motion 6:1  (slice1; segments: 6)  (cost=131.17..245.45 rows=3767 width=218)
   ->  Parallel Hash Join  (cost=131.17..201.50 rows=628 width=218)
         Hash Cond: (e.department_id = "Expr_SUBQUERY".csq_c0)
         Join Filter: (e.salary > "Expr_SUBQUERY".csq_c1)
         ->  Parallel Seq Scan on employees e  (cost=0.00..52.83 rows=1883 width=254)
         ->  Parallel Hash  (cost=118.67..118.67 rows=1000 width=36)
               ->  Broadcast Workers Motion 6:6  (slice2; segments: 6)  (cost=99.92..118.67 rows=1000 width=36)
                     ->  Subquery Scan on "Expr_SUBQUERY"  (cost=99.92..105.33 rows=333 width=36)
                           ->  HashAggregate  (cost=99.92..102.00 rows=167 width=36)
                                 Group Key: employees.department_id
                                 ->  Redistribute Motion 6:6  (slice3; segments: 6)  (cost=0.00..90.50 rows=1883 width=36)
                                       Hash Key: employees.department_id
                                       Hash Module: 3
                                       ->  Parallel Seq Scan on employees  (cost=0.00..52.83 rows=1883 width=36)
 Optimizer: Postgres query optimizer
(15 rows)

This commit fixes that issue.

The correction not only makes parallel subquery estimation more accurate, but also enables the entire plan to be as parallel as possible, particularly for subqueries in complex queries.

Authored-by: Zhang Mingli [email protected]

Fixes #ISSUE_Number

What does this PR do?

Type of Change

Bug fix (non-breaking change)
New feature (non-breaking change)
Breaking change (fix or feature with breaking changes)
Documentation update

Breaking Changes

Test Plan

Unit tests added/updated
Integration tests added/updated
Passed make installcheck
Passed make -C src/test installcheck-cbdb-parallel

Impact

Performance:

User-facing changes:

Dependencies:

Checklist

Followed contribution guide
Added/updated documentation
Reviewed code for security implications
Requested review from cloudberry committers

Additional Context

CI Skip Instructions

avamingli · 2025-08-05T06:13:35Z

Previously, we attempted to disable window functions inside CASE WHEN
expressions due to concerns about unstable parallel results. However,
this was a misunderstanding. All expressions from the subquery are Var
columns, not the original expressions.

This issue was uncovered when we fixed the subquery row count
estimation, causing the cost to change in the upper plan.

Fixed at commit: Correct parallel window function in CASE WHEN.

EXPLAIN(COSTS OFF)
SELECT empno, depname, salary, bonus, depadj, MIN(bonus) OVER (ORDER BY
empno), MAX(depadj) OVER () FROM(
	SELECT *,
		CASE WHEN enroll_date < '2008-01-01' THEN 2008 -
extract(YEAR FROM enroll_date) END * 500 AS bonus,
		CASE WHEN
			AVG(salary) OVER (PARTITION BY depname) < salary
		THEN 200 END AS depadj FROM empsalary
)s;
                                        QUERY PLAN
--------------------------------------------------------------------------
 WindowAgg
   ->  WindowAgg
         Order By: s.empno
         ->  Gather Motion 6:1  (slice1; segments: 6)
               Merge Key: s.empno
               ->  Sort
                     Sort Key: s.empno
                     ->  Subquery Scan on s
                           ->  WindowAgg
                                 Partition By: empsalary.depname
                                 ->  Sort
                                       Sort Key: empsalary.depname
                                       ->  Redistribute Motion 6:6
(slice2; segments: 6)
                                             Hash Key: empsalary.depname
                                             Hash Module: 3
                                             ->  Parallel Seq Scan on
empsalary
 Optimizer: Postgres query optimizer
(17 rows)

In CBDB, path's row estimation is determined by subpath's rows and cluster segments. However, when there is a parallel subquery scan path, each worker will process fewer rows (divided by parallel_workers). This commit fixes that issue. The correction not only makes parallel subquery estimation more accurate, but also enables the entire plan to be as parallel as possible, particularly for subqueries in complex queries. Authored-by: Zhang Mingli [email protected]

Previously, we attempted to disable window functions inside CASE WHEN expressions due to concerns about unstable parallel results. However, this was a misunderstanding. All expressions from the subquery are Var columns, not the original expressions. This issue was uncovered when we fixed the subquery row count estimation, causing the cost to change in the upper plan. EXPLAIN(COSTS OFF) SELECT empno, depname, salary, bonus, depadj, MIN(bonus) OVER (ORDER BY empno), MAX(depadj) OVER () FROM( SELECT *, CASE WHEN enroll_date < '2008-01-01' THEN 2008 - extract(YEAR FROM enroll_date) END * 500 AS bonus, CASE WHEN AVG(salary) OVER (PARTITION BY depname) < salary THEN 200 END AS depadj FROM empsalary )s; QUERY PLAN -------------------------------------------------------------------------- WindowAgg -> WindowAgg Order By: s.empno -> Gather Motion 6:1 (slice1; segments: 6) Merge Key: s.empno -> Sort Sort Key: s.empno -> Subquery Scan on s -> WindowAgg Partition By: empsalary.depname -> Sort Sort Key: empsalary.depname -> Redistribute Motion 6:6 (slice2; segments: 6) Hash Key: empsalary.depname Hash Module: 3 -> Parallel Seq Scan on empsalary Optimizer: Postgres query optimizer (17 rows) Authored-by: Zhang Mingli [email protected]

my-ship-it

LGTM

avamingli requested review from gfphoenix78, my-ship-it, weinan003 and yjhjstz August 4, 2025 03:38

avamingli added the planner label Aug 4, 2025

my-ship-it force-pushed the fix_subquery_rows branch from 445eb65 to e572023 Compare August 4, 2025 09:20

avamingli force-pushed the fix_subquery_rows branch 2 times, most recently from 414c999 to 0791ff6 Compare August 5, 2025 05:04

avamingli force-pushed the fix_subquery_rows branch 2 times, most recently from 519c515 to 06ee760 Compare August 8, 2025 08:53

avamingli force-pushed the fix_subquery_rows branch from 06ee760 to 5a938a9 Compare August 13, 2025 07:27

avamingli added 2 commits August 14, 2025 13:09

avamingli force-pushed the fix_subquery_rows branch from 5a938a9 to 75b1744 Compare August 14, 2025 05:09

avamingli mentioned this pull request Aug 14, 2025

Give UNION ALL more opportunities for parallel plans in MPP. #1311

Merged

12 tasks

my-ship-it approved these changes Aug 14, 2025

View reviewed changes

weinan003 approved these changes Aug 14, 2025

View reviewed changes

avamingli merged commit 8b01eaf into apache:main Aug 14, 2025
27 checks passed

avamingli deleted the fix_subquery_rows branch August 14, 2025 08:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix row estimation for parallel subquery paths. #1284

Fix row estimation for parallel subquery paths. #1284

Uh oh!

avamingli commented Aug 1, 2025

Uh oh!

avamingli commented Aug 5, 2025

Uh oh!

my-ship-it left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix row estimation for parallel subquery paths. #1284

Fix row estimation for parallel subquery paths. #1284

Uh oh!

Conversation

avamingli commented Aug 1, 2025

What does this PR do?

Type of Change

Breaking Changes

Test Plan

Impact

Checklist

Additional Context

CI Skip Instructions

Uh oh!

avamingli commented Aug 5, 2025

Uh oh!

my-ship-it left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants