ORCA: create windows hash aggregation physical operator when vectorization enabled #1258

jiaqizho · 2025-07-25T08:07:37Z

Fixes #ISSUE_Number

What does this PR do?

Type of Change

Bug fix (non-breaking change)
New feature (non-breaking change)
Breaking change (fix or feature with breaking changes)
Documentation update

Breaking Changes

Test Plan

Unit tests added/updated
Integration tests added/updated
Passed make installcheck
Passed make -C src/test installcheck-cbdb-parallel

Impact

Performance:

User-facing changes:

Dependencies:

Checklist

Followed contribution guide
Added/updated documentation
Reviewed code for security implications
Requested review from cloudberry committers

Additional Context

CI Skip Instructions

src/backend/utils/misc/guc_gp.c

zhangyue-hashdata · 2025-07-29T10:13:13Z

Is it better to add Assert(!node->isWindowHashAgg); in ExecInitWindowAgg()?

zhangyue-hashdata · 2025-07-29T11:40:45Z

Is it better to add code below in explain.c?

WindowAgg *wagg = castNode(WindowAgg, plan);
pname = sname = wagg->isWindowHashAgg ? "WindowHashAgg" : "WindowAgg";

zhangyue-hashdata · 2025-07-29T11:41:50Z

Is it better to add code below in explain.c?

WindowAgg *wagg = castNode(WindowAgg, plan);
pname = sname = wagg->isWindowHashAgg ? "WindowHashAgg" : "WindowAgg";

jiaqizho · 2025-08-06T06:54:59Z

Is it better to add code below in explain.c?

WindowAgg *wagg = castNode(WindowAgg, plan);
pname = sname = wagg->isWindowHashAgg ? "WindowHashAgg" : "WindowAgg";

no need, cause the the row executor won't get the WindowHashAgg node. Only need add the logical in vectorization/***/explain.c

src/backend/gpopt/CGPOptimizer.cpp

my-ship-it

LGTM in general except one comment

…xector In this PR, ORCA now supports generating `WindowHashAgg` plans which already have implementation in the vectorization executor. However, the CBDB row executor currently lacks implementation for the WindowHashAgg operator. To prevent ORCA from generating this operator in the row executor, I've added an struct which named `OptimizerOptions` to control the plan for row executor or vectorization executor. (By the way, ORCA may later generate plans specifically for the vectorization executor). The `WindowAgg` operator implemention in the vectorization execution is: 1. First, sorting the input rows by `ORDER BY` keys 2. Then do the `PARTITION` by `PARTITION BY` keys 3. Finally do the window function. Since step1 must be globally sorted, it cannot be parallelized in the vectorization executor. This results in poor performance of the `WindowAgg` operator. By contrast, `WindowHashAgg` employs a more efficient approach: 1. First hashes input data into buckets based on `PARTITION BY` keys 2. Then sorts data `within each bucket` according to `ORDER BY` keys 3. Finally computes window functions on the sorted bucket data For the row engine, `WindowHashAgg` operators will not be generated. Also current commit introduces a new GUC named `optimizer_force_window_hash_agg` to force generate plans with `WindowHashAgg` (Don't used this GUC expect debug ORCA). Co-Author-By: zhangyue <[email protected]>

jiaqizho changed the title ~~ORCA: create windows hash aggregation physical operator when vectorization enabled~~ [DNM]ORCA: create windows hash aggregation physical operator when vectorization enabled Jul 25, 2025

jiaqizho commented Jul 25, 2025

View reviewed changes

src/backend/utils/misc/guc_gp.c Outdated Show resolved Hide resolved

my-ship-it self-requested a review July 25, 2025 08:19

jiaqizho force-pushed the orca-support-hash-windowagg branch 3 times, most recently from 4714e80 to 3c9d812 Compare July 29, 2025 03:08

jiaqizho force-pushed the orca-support-hash-windowagg branch from 3c9d812 to 8374618 Compare July 30, 2025 07:28

jiaqizho force-pushed the orca-support-hash-windowagg branch from 8374618 to 4856e06 Compare August 6, 2025 06:55

jiaqizho changed the title ~~[DNM]ORCA: create windows hash aggregation physical operator when vectorization enabled~~ ORCA: create windows hash aggregation physical operator when vectorization enabled Aug 6, 2025

jiaqizho force-pushed the orca-support-hash-windowagg branch from 4856e06 to 3ad1cd4 Compare August 12, 2025 08:32

my-ship-it reviewed Aug 14, 2025

View reviewed changes

src/backend/gpopt/CGPOptimizer.cpp Show resolved Hide resolved

my-ship-it approved these changes Aug 14, 2025

View reviewed changes

jiaqizho force-pushed the orca-support-hash-windowagg branch from 3ad1cd4 to 2ac1f69 Compare August 14, 2025 07:27

gongxun0928 approved these changes Aug 14, 2025

View reviewed changes

jiaqizho force-pushed the orca-support-hash-windowagg branch from 2ac1f69 to 1b1675c Compare August 15, 2025 01:47

jiaqizho force-pushed the orca-support-hash-windowagg branch from 1b1675c to e9a0f5f Compare August 15, 2025 03:52

jiaqizho merged commit a840049 into apache:main Aug 15, 2025
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ORCA: create windows hash aggregation physical operator when vectorization enabled #1258

ORCA: create windows hash aggregation physical operator when vectorization enabled #1258

Uh oh!

jiaqizho commented Jul 25, 2025

Uh oh!

Uh oh!

zhangyue-hashdata commented Jul 29, 2025

Uh oh!

zhangyue-hashdata commented Jul 29, 2025

Uh oh!

zhangyue-hashdata commented Jul 29, 2025

Uh oh!

jiaqizho commented Aug 6, 2025

Uh oh!

Uh oh!

my-ship-it left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ORCA: create windows hash aggregation physical operator when vectorization enabled #1258

ORCA: create windows hash aggregation physical operator when vectorization enabled #1258

Uh oh!

Conversation

jiaqizho commented Jul 25, 2025

What does this PR do?

Type of Change

Breaking Changes

Test Plan

Impact

Checklist

Additional Context

CI Skip Instructions

Uh oh!

Uh oh!

zhangyue-hashdata commented Jul 29, 2025

Uh oh!

zhangyue-hashdata commented Jul 29, 2025

Uh oh!

zhangyue-hashdata commented Jul 29, 2025

Uh oh!

jiaqizho commented Aug 6, 2025

Uh oh!

Uh oh!

my-ship-it left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants