[EPIC] Refactor all expression serde logic out of `QueryPlanSerde`

### What is the problem the feature request solves?

The `QueryPlanSerde.exprToProtoInternal` method contains logic for serializing Spark expressions to protocol buffer format and also contains checks that Comet supports the expression. This file has grown very large and is hard to navigate, so we would like to refactor this logic such that the per-expression logic is moved into separate classes.

As an example, here is the original approach for handling the `Add` expression:

```scala
      case add @ Add(left, right, _) if supportedDataType(left.dataType) =>
        createMathExpression(
          expr,
          left,
          right,
          inputs,
          binding,
          add.dataType,
          add.evalMode == EvalMode.ANSI,
          (builder, mathExpr) => builder.setAdd(mathExpr))

      case add @ Add(left, _, _) if !supportedDataType(left.dataType) =>
        withInfo(add, s"Unsupported datatype ${left.dataType}")
        None
```

The new approach is to move this into a separate file and class:

```scala
object CometAdd extends CometExpressionSerde with MathBase {
  override def convert(
      expr: Expression,
      inputs: Seq[Attribute],
      binding: Boolean): Option[ExprOuterClass.Expr] = {
    val add = expr.asInstanceOf[Add]
    if (!supportedDataType(add.left.dataType)) {
      withInfo(add, s"Unsupported datatype ${add.left.dataType}")
      return None
    }
    createMathExpression(
      expr,
      add.left,
      add.right,
      inputs,
      binding,
      add.dataType,
      add.evalMode == EvalMode.ANSI,
      (builder, mathExpr) => builder.setAdd(mathExpr))
  }
}
```

These classes are then referenced from QueryPlanSerde in a map:

```scala
  private val exprSerdeMap: Map[Class[_], CometExpressionSerde] = Map(
    classOf[Add] -> CometAdd,
    classOf[Subtract] -> CometSubtract,
    classOf[Multiply] -> CometMultiply,
    ...
```

This approach has some benefits, such as:

- Moving away from all expressions sharing the same logic for determining which data types are supported (different expressions support different types)
- It makes it easier to write unit tests (https://github.com/apache/datafusion-comet/issues/2020)
- Once all expressions migrate to the new pattern, it will be easier to automate generating documentation about supported expressions
- It is likely that we will find common patterns and will be able to refactor the code to reduce boilerplate

### Describe the potential solution

Convert the following expressions:

- [x] Add
- [x] Subtract
- [x] Multiply
- [x] Divide
- [x] IntegralDivide
- [x] Remainder
- [x] ArrayAppend
- [x] ArrayContains
- [x] ArrayDistinct
- [x] ArrayExcept
- [x] ArrayInsert
- [x] ArrayIntersect
- [x] ArrayJoin
- [x] ArrayMax
- [x] ArrayRemove
- [x] ArrayRepeat
- [x] ArraysOverlap
- [x] ArrayUnion
- [x] CreateArray
- [x] Ascii
- [x] ConcatWs
- [x] Chr
- [x] InitCap
- [x] BitwiseCount
- [x] BitwiseGet
- [x] BitwiseNot
- [x] BitwiseOr
- [x] BitwiseXor
- [x] BitLength
- [x] FromUnixTime
- [x] Length
- [x] Acos
- [x] Cos
- [x] Asin
- [x] Sin
- [x] Atan
- [x] Tan
- [x] Exp
- [x] Expm1
- [x] Sqrt
- [x] Signum
- [x] Md5
- [x] ShiftLeft
- [x] ShiftRight
- [x] StringInstr
- [x] StringRepeat
- [x] StringReplace
- [x] StringTranslate
- [x] StringTrim
- [x] StringTrimLeft
- [x] StringTrimRight
- [x] StringTrimBoth
- [x] Upper
- [x] Lower
- [x] Murmur3Hash
- [x] XxHash64
- [x] MapKeys
- [x] MapValues
- [x] MapFromArrays
- [x] GetMapValue
- [x] GreaterThan 
- [x] GreaterThanOrEqual 
- [x] LessThan 
- [x] LessThanOrEqual 
- [x] Substring 
- [x] Like 
- [x] RLike 
- [x] StartsWith 
- [x] EndsWith 
- [x] Contains 
- [x] StringSpace 
- [x] Hour 
- [x] Minute 
- [x] DateAdd 
- [x] DateSub 
- [x] TruncDate 
- [x] TruncTimestamp 
- [x] Second 
- [x] Year 
- [x] IsNull 
- [x] IsNotNull 
- [x] IsNaN
- [x] Atan2 
- [x] Ceil 
- [x] Floor 
- [x] Log 
- [x] Log10 
- [x] Log2 
- [x] Pow 
- [x] Round 
- [x] StringDecode 
- [x] OctetLength 
- [x] Reverse 
- [x] BitwiseAnd 
- [x] In 
- [x] InSet 
- [x] StringRPad 
- [x] Sha2 
- [x] CreateNamedStruct - https://github.com/apache/datafusion-comet/pull/2257
- [x] GetStructField - https://github.com/apache/datafusion-comet/pull/2257
- [x] GetArrayItem - https://github.com/apache/datafusion-comet/pull/2257
- [x] ElementAt - https://github.com/apache/datafusion-comet/pull/2257
- [x] GetArrayStructFields - https://github.com/apache/datafusion-comet/pull/2257
- [x] StructsToJson - https://github.com/apache/datafusion-comet/pull/2257
- [x] ArrayFilter 
- [x] ArrayExcept 
- [x] Rand 
- [x] Randn 
- [x] And - https://github.com/apache/datafusion-comet/pull/2265
- [x] Or - https://github.com/apache/datafusion-comet/pull/2265
- [x] Not(In) - https://github.com/apache/datafusion-comet/pull/2265
- [x] Not - https://github.com/apache/datafusion-comet/pull/2265
- [x] EqualTo  - https://github.com/apache/datafusion-comet/pull/2265
- [x] Not(EqualTo  - https://github.com/apache/datafusion-comet/pull/2265
- [x] EqualNullSafe  - https://github.com/apache/datafusion-comet/pull/2265
- [x] Not(EqualNullSafe  - https://github.com/apache/datafusion-comet/pull/2265
- [x] If - https://github.com/apache/datafusion-comet/pull/2266
- [x] CaseWhen - https://github.com/apache/datafusion-comet/pull/2266
- [x] Alias 
- [x] AttributeReference
- [ ] TryCast - https://github.com/apache/datafusion-comet/pull/2242
- [ ] Cast - https://github.com/apache/datafusion-comet/pull/2242
- [ ] Literal
- [ ] Hex 
- [ ] Unhex 
- [ ] SortOrder 
- [ ] PromotePrecision 
- [ ] CheckOverflow 
- [ ] UnaryMinus 
- [ ] KnownFloatingPointNormalized 
- [ ] ScalarSubquery 
- [ ] UnscaledValue 
- [ ] MakeDecimal 
- [ ] BloomFilterMightContain 
- [ ] RegExpReplace 

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[EPIC] Refactor all expression serde logic out of `QueryPlanSerde` #2019

What is the problem the feature request solves?

Describe the potential solution

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[EPIC] Refactor all expression serde logic out of QueryPlanSerde #2019

Description

What is the problem the feature request solves?

Describe the potential solution

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[EPIC] Refactor all expression serde logic out of `QueryPlanSerde` #2019