Spark date part #19823

cht42 · 2026-01-15T08:02:25Z

Which issue does this PR close?

Rationale for this change

The current date_part function in datafusion have a few differences with the spark implementation:

day of week parts are 1 indexed in spark but 0 indexed in datafusion
spark supports a few more aliases for certain parts

Full list of spark supported aliases: https://github.com/apache/spark/blob/a03bedb6c1281c5263a42bfd20608d2ee005ab05/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/datetimeExpressions.scala#L3356-L3371

What changes are included in this PR?

New date_part function in spark crate.

Are these changes tested?

Yes with SLT

Are there any user-facing changes?

yes

cht42 · 2026-01-15T08:02:49Z

need to merge #19821 first

cht42 · 2026-01-15T08:04:53Z

datafusion/spark/src/function/datetime/date_part.rs

+            }
+            _ => {
+                return internal_err!(
+                    "First argument of `DATE_PART` must be non-null scalar Utf8"


same as DF date_part, part is a literal

datafusion/datafusion/functions/src/datetime/date_part.rs

Line 185 in eadbed5

"First argument of `DATE_PART` must be non-null scalar Utf8"

Jefffrey · 2026-01-15T12:33:38Z

datafusion/spark/src/planner.rs

+use datafusion_expr::planner::{ExprPlanner, PlannerResult};
+
+#[derive(Default, Debug)]
+pub struct SparkFunctionPlanner;


If we're including this planner now, I feel we should update the lib docs with an example of using this

https://github.com/apache/datafusion/blob/main/datafusion/spark/src/lib.rs

yes, I can do that. I also think it would be nice to provide a way to register the expr planner and the udfs at the same time with something like

datafusion/datafusion/core/src/execution/session_state.rs

Line 1106 in eadbed5

pub fn with_default_features(mut self) -> Self {

.
we could do a with_spark_features ? could track that in a separate issue/PR

Jefffrey · 2026-01-15T12:35:42Z

datafusion/spark/src/function/datetime/date_part.rs

+        internal_err!("spark date_part should have been simplified to standard date_part")
+    }
+
+    fn simplify(


I like that we're using simplify here 👍

datafusion/spark/src/function/datetime/date_part.rs

Jefffrey · 2026-01-15T12:37:29Z

datafusion/spark/src/function/datetime/date_part.rs

+        let date_part_expr = Expr::ScalarFunction(ScalarFunction::new_udf(
+            datafusion_functions::datetime::date_part(),
+            vec![part_expr, date_expr],
+        ));


One concern is if the nullability of the output field will match here?

ah you're right, should we update

datafusion/datafusion/functions/src/datetime/date_part.rs

Line 149 in eadbed5

fn return_field_from_args(&self, args: ReturnFieldArgs) -> Result<FieldRef> {

to be nullable depending on the inputs ?

cht42 added 4 commits January 15, 2026 11:14

feat: Add support for 'isoyear' in date_part function

1ce97c4

feat(spark): add date_part function

8c836ee

test(spark): add date_part tests

51a6b30

test(spark): add planner

d5b7a4a

github-actions bot added sqllogictest SQL Logic Tests (.slt) functions Changes to functions implementation spark labels Jan 15, 2026

cht42 commented Jan 15, 2026

View reviewed changes

Jefffrey reviewed Jan 15, 2026

View reviewed changes

cht42 added 2 commits January 15, 2026 21:27

fix(spark): handle LargeUtf8 in date_part function

c6acc65

docs(spark): add example for using Spark expression planner

bc1b808

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Spark date part #19823

Spark date part #19823

cht42 commented Jan 15, 2026

Uh oh!

cht42 commented Jan 15, 2026

Uh oh!

cht42 Jan 15, 2026

Uh oh!

Jefffrey Jan 15, 2026

Uh oh!

cht42 Jan 15, 2026

Uh oh!

Jefffrey Jan 15, 2026

Uh oh!

Uh oh!

Jefffrey Jan 15, 2026

Uh oh!

cht42 Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Spark date part #19823

Are you sure you want to change the base?

Spark date part #19823

Conversation

cht42 commented Jan 15, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

cht42 commented Jan 15, 2026

Uh oh!

cht42 Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Jefffrey Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

cht42 Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Jefffrey Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Jefffrey Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

cht42 Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants