-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Is your feature request related to a problem or challenge?
Tracking ticket for next release, also a place to track desired inclusions
Previous release will be #17558 (Nov, 2025) so next major release would be approximately late Dec 2025 / early Jan 2026
Steps:
- Create a
branch-52branch: - Update version and changelog:
- Test with DataFusion Python
- Test with DataFusion Comet
- Test with DataFusion Ballista
- Test with delta.rs:
- Test vortex:
- Test with iceberg-rust:
- Test with LakeSail:
- Test with parquet viewer
- Test with datafusion-materialized-views
- Voting Thread:
- Publish to crates.io
- Create ticket for next release:
- Create blog post:
Features to mention in the blog (if they make it)
- Support Arrow IPC Stream Files #18457
- [EPIC] A collection of items to improve CASE performance #18075
- Examples of extending SQL syntax #17824 from the @theirix / Add relation planner extension support to customize SQL planning #17843 from @geoffreyclaude
- [EPIC] ListingTable object store usage improvements #17214
- [EPIC] Statistics Cache Improvements #19051
- feat: change Expr OuterReferenceColumn and Alias to Box type for reducing expr struct size #16771
- Support Push down expression evaluation in
TableProviders#14993 / Plan to replaceSchemaAdapterwithPhysicalExprAdapter#16800 from @adriangb - Reworked examples from @cj-zhukov (highlight their existence)
- Push down entire hash table from HashJoinExec into scans #17171 (comment)
- Add push down sort to the source (table provider) #10433 / Establish the high level API for sort pushdown and the optimizer rule and support reverse files and row groups #19064 from @zhuqi-lucas
- feat: Add DELETE/UPDATE hooks to TableProvider trait and to MemTable implementation #19142
- [EPIC] Remove
CoalesceBatchesExecoperator #18779
Bugs that should be fixed
- Regression: error planning TPC-DS query: input schema nullability mismatch #17801
- Add upgrade guide for PhysicalOptimizerRule::optimize_plan #19029
- Add
force_filter_selectionsto restorepushdown_filtersbehavior prior to parquet 57.1.0 upgrade #19003 (review) - Restore IN_LIST performance -- Implement specialized
StaticFiltersfor different data types #18824 - Intermittent CI failures in
cache::list_files_cache::tests::test_cache_with_ttl_and_lru( seems flaky) #19114 - Allow log/pow on negative scale decimals #19250
- Confusing behavior now required to to refresh the files of a listing table #19573
Community Wishlist
- Streaming Aggregate operator not being used in deduplication of pre-sorted Parquet files #16919
- Regression:
DataFrameWriteOptions::with_single_file_outputproduces a directory #13323 - fix(parquet): converting parquet schema with backward compatible repeated struct/primitive with provided arrow schema arrow-rs#8496
- Respect execution timezone in
to_timestampand related functions #18025 - Row group limit pruning #18868
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request