Skip to content

Conversation

dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented May 4, 2025

What changes were proposed in this pull request?

This PR aims to support the following APIs in DataFrame.

  • toDF
  • distinct
  • dropDuplicates
  • dropDuplicatesWithinWatermark

Why are the changes needed?

For feature parity.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Pass the CIs.

Was this patch authored or co-authored using generative AI tooling?

No.

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-51995] Support distinct and dropDuplicates(WithinWatermark)? [SPARK-51995] Support toDF, distinct and dropDuplicates(WithinWatermark)? in DataFrame May 4, 2025
@dongjoon-hyun
Copy link
Member Author

Let me merge this to proceed the next steps.

@dongjoon-hyun dongjoon-hyun deleted the SPARK-51995 branch May 4, 2025 02:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant