-
Notifications
You must be signed in to change notification settings - Fork 392
feat: delimiter for agg_concat
#6099
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Greptile OverviewGreptile SummaryThis PR adds an optional Key Changes:
Implementation Details:
Test Coverage: Confidence Score: 5/5
Important Files Changed
Sequence DiagramsequenceDiagram
participant User
participant Python API
participant PyExpr
participant Rust DSL
participant RecordBatch
participant Series
User->>Python API: df.agg(col("a").string_agg(delimiter=" "))
Python API->>PyExpr: agg_concat(delimiter=" ")
PyExpr->>Rust DSL: Expr::agg_concat(delimiter=Some(" "))
Note over Rust DSL: Filter empty delimiter strings<br/>Store as AggExpr::Concat(expr, delimiter)
Rust DSL->>RecordBatch: eval_agg_expr(AggExpr::Concat)
RecordBatch->>Series: agg_concat(groups, delimiter)
alt Delimiter provided for Utf8
Series->>Series: join_with_delimiter(iter, delimiter)
Note over Series: Custom implementation:<br/>- Filter nulls<br/>- Join with delimiter
else No delimiter or List type
Series->>Series: DaftConcatAggable::concat()
Note over Series: Use existing trait implementation
end
Series-->>RecordBatch: Concatenated result
RecordBatch-->>Rust DSL: Series
Rust DSL-->>Python API: PyExpr
Python API-->>User: DataFrame with aggregated column
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
4 files reviewed, no comments
13ca252 to
0cd35f2
Compare
Codecov Report❌ Patch coverage is Additional details and impacted files@@ Coverage Diff @@
## main #6099 +/- ##
===========================================
- Coverage 72.91% 43.39% -29.52%
===========================================
Files 973 909 -64
Lines 126196 112787 -13409
===========================================
- Hits 92016 48947 -43069
- Misses 34180 63840 +29660
🚀 New features to boost your workflow:
|
0cd35f2 to
61e333a
Compare
Changes Made
Related Issues
Closes #2959.