[spark] Fix merge into update columns detection by leaves12138 · Pull Request #7868 · apache/paimon

leaves12138 · 2026-05-15T09:52:57Z

What changed

This updates Spark merge-into data evolution update column detection so target self-assignments are not treated as modified columns when Spark adds or changes qualifiers. The logic now treats matching AttributeReference exprIds as the same target column, while still including source-side assignments such as target_col = source_col.

Why

Spark AttributeReference.equals also compares qualifiers, so the same target field can compare unequal as file_name#2 versus t.file_name#2. That made updateColumns include unchanged fields and caused partial updates to rewrite more columns than expected.

Validation

mvn -pl paimon-spark/paimon-spark-common -DskipTests -Dcheckstyle.skip -Drat.skip -Dspotless.check.skip=true compile
mvn -pl paimon-spark/paimon-spark-common -DskipTests -Dcheckstyle.skip -Drat.skip spotless:check

JingsongLi

+1

leaves12138 marked this pull request as ready for review May 15, 2026 09:53

leaves12138 force-pushed the codex/spark-merge-update-columns-master branch from d17a057 to 081fe31 Compare May 15, 2026 09:57

Fix spark merge into update columns incorrect

2214b94

leaves12138 force-pushed the codex/spark-merge-update-columns-master branch from 081fe31 to 2214b94 Compare May 15, 2026 09:59

leaves12138 added 4 commits May 15, 2026 18:38

Add tests for Spark merge update column detection

d5bff08

Simplify spark merge update column detection

6a421cc

Guard spark merge update assignment keys

4dbed4e

Stabilize raw blob split file test

6bb450b

JingsongLi approved these changes May 18, 2026

View reviewed changes

JingsongLi merged commit 65b0589 into apache:master May 18, 2026
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Fix merge into update columns detection#7868

[spark] Fix merge into update columns detection#7868
JingsongLi merged 5 commits into
apache:masterfrom
leaves12138:codex/spark-merge-update-columns-master

leaves12138 commented May 15, 2026 •

edited

Loading

Uh oh!

JingsongLi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leaves12138 commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Why

Validation

Uh oh!

JingsongLi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

leaves12138 commented May 15, 2026 •

edited

Loading