[SPARK-53786][SQL] Default value with special column name should not conflict with real column #52504

szehon-ho · 2025-10-02T06:09:39Z

What changes were proposed in this pull request?

Fix the analysis of default value expression to not include column names

Why are the changes needed?

The following query:

CREATE TABLE t (current_timestamp DEFAULT current_timestamp)

fails with an exception:

[INVALID_DEFAULT_VALUE.NOT_CONSTANT] Failed to execute CREATE TABLE command because the destination column or variable `current_timestamp` has a DEFAULT value CURRENT_TIMESTAMP, which is not a constant expression whose equivalent value is known at query planning time. SQLSTATE: 42623;

This is because , to create a default value DSV2 expression, the code now uses the main analyzer to analyze the default value, which resolves it to the column current_timestamp. However, analyzer should not try to resolve default value to other columns.

Does this PR introduce any user-facing change?

Should fix a regression

How was this patch tested?

Add new unit test in DataSourceV2SQLSuite

Was this patch authored or co-authored using generative AI tooling?

No

szehon-ho · 2025-10-02T06:10:13Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ColumnResolutionHelper.scala

+            u.copy(child = newChild)
          }

+        case d @ DefaultValueExpression(u: UnresolvedAttribute, _, _) =>


Note: before this fix, Default value expression would fall to UnresolvedAttribute above. It would then think the default value refers to the conflicting column name and fail.

case u @ UnresolvedAttribute(nameParts) => val result = withPosition(u) { resolveColumnByName(nameParts) .orElse(LiteralFunctionResolution.resolve(nameParts)) .map { // We trim unnecessary alias here. Note that, we cannot trim the alias at top-level, // as we should resolve `UnresolvedAttribute` to a named expression. The caller side // can trim the top-level alias if it's safe to do so. Since we will call // CleanupAliases later in Analyzer, trim non top-level unnecessary alias is safe. case Alias(child, _) if !isTopLevel => child case other => other } .getOrElse(u) } logDebug(s"Resolving $u to $result") result

…umn name

szehon-ho · 2025-10-02T06:22:08Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ColumnResolutionHelper.scala

+        case d @ DefaultValueExpression(u: UnresolvedAttribute, _, _) =>
+          d.copy(child = LiteralFunctionResolution.resolve(u.nameParts)
+            .map {
+              case Alias(child, _) if !isTopLevel => child


I just copied this from the other code, we dont need it right?

szehon-ho · 2025-10-02T16:06:32Z

failure may not be related:

[error] org.apache.spark.sql.kafka010.KafkaMicroBatchV1SourceWithConsumerSuite, rerunning to verify

szehon-ho · 2025-10-06T21:08:30Z

a better approach is here: #52530

github-actions bot added the SQL label Oct 2, 2025

szehon-ho commented Oct 2, 2025

View reviewed changes

[SPARK-53786][SQL] Default value should not conflict with special col…

da190b3

…umn name

szehon-ho force-pushed the default_value_conflict branch from b0350a5 to da190b3 Compare October 2, 2025 06:19

szehon-ho commented Oct 2, 2025

View reviewed changes

szehon-ho changed the title ~~[SPARK-53786][SQL] Default value should not conflict with special column name~~ [SPARK-53786][SQL] Default value with special column name should not conflict with real column Oct 2, 2025

szehon-ho added 2 commits October 2, 2025 14:16

Add more tests

ad8f55e

Fix another case where special column is in the function

5bb76b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-53786][SQL] Default value with special column name should not conflict with real column #52504

[SPARK-53786][SQL] Default value with special column name should not conflict with real column #52504

szehon-ho commented Oct 2, 2025 •

edited

Loading

Uh oh!

szehon-ho Oct 2, 2025 •

edited

Loading

Uh oh!

szehon-ho Oct 2, 2025

Uh oh!

szehon-ho commented Oct 2, 2025

Uh oh!

szehon-ho commented Oct 6, 2025

Uh oh!

Uh oh!

[SPARK-53786][SQL] Default value with special column name should not conflict with real column #52504

Are you sure you want to change the base?

[SPARK-53786][SQL] Default value with special column name should not conflict with real column #52504

Conversation

szehon-ho commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

szehon-ho Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

szehon-ho Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

szehon-ho commented Oct 2, 2025

Uh oh!

szehon-ho commented Oct 6, 2025

Uh oh!

Uh oh!

szehon-ho commented Oct 2, 2025 •

edited

Loading

szehon-ho Oct 2, 2025 •

edited

Loading