Conversation
Signed-off-by: TheR1sing3un <chaoyang@apache.org> Co-authored-by: Y Ethan Guo <ethan.guoyihua@gmail.com>
Signed-off-by: TheR1sing3un <chaoyang@apache.org>
* perf: reduce unnecessary row group metadata loading --------- Signed-off-by: TheR1sing3un <chaoyang@apache.org>
…nt deadlocks (apache#14225) * Move hudi split loaders to resumable tasks architecture to prevent deadlocks. * Address review comments: Add javadoc and debug logging * Add testcases to verify the fix
…ed and add unpersist (apache#14069) Co-authored-by: Lokesh Jain <ljain@Lokeshs-MacBook-Pro.local>
…e should the meta fields be eliminated (apache#14230) Signed-off-by: TheR1sing3un <chaoyang@apache.org>
…-30065 (apache#13358) Co-authored-by: sumi <sumi.mathew@ibm.com>
1. Claim RFC-81: Introduce Primary Key Sorted Table Signed-off-by: TheR1sing3un <chaoyang@apache.org>
…4161) Co-authored-by: Jonathan Vexler <=> Co-authored-by: sivabalan <n.siva.b@gmail.com> Co-authored-by: Vamsi <vamsi@onehouse.ai> Co-authored-by: Y Ethan Guo <ethan.guoyihua@gmail.com> Co-authored-by: Lin Liu <linliu.code@gmail.com>
1. introduce pk filter to log file Signed-off-by: TheR1sing3un <chaoyang@apache.org>
…ark test to avoid flaky tests (apache#14198) Signed-off-by: TheR1sing3un <chaoyang@apache.org>
…dd new APIs based on current usage of Avro schema (apache#14265) Co-authored-by: Balaji Varadarajan <balaji@Balajis-Laptop.local> Co-authored-by: Balaji Varadarajan <balaji@Balajis-Laptop.attlocal.net> Co-authored-by: Timothy Brown <tim@onehouse.ai>
… use HoodieSchema (apache#14313)
…ache#14287) --------- Co-authored-by: Pavithran Ravichandiran <pavithran@Pavithrans-MacBook-Pro.local>
…pache#14060) Co-authored-by: Jonathan Vexler <=> Co-authored-by: Y Ethan Guo <ethan.guoyihua@gmail.com>
…a subtypes not returned (apache#14346)
…rPushDown` (apache#14332) 1. push down pk filters to log file when spark enable `parquetFilterPushDown` The previous judgment logic was a typo error. Whether to push down depends on the `parquetFilterPushDown`, while whether to perform a record filter at the parquet level depends on the `parquetRecordFilterEnabled` Signed-off-by: TheR1sing3un <chaoyang@apache.org>
…ion in the compaction plan (apache#14362) fix the metrics for file slice with filtered log files. --------- Signed-off-by: TheR1sing3un <chaoyang@apache.org> Co-authored-by: danny0405 <yuzhao.cyz@gmail.com>
the corner case: the load instant range is contained within one of the archived file instant range. --------- Signed-off-by: TheR1sing3un <chaoyang@apache.org> Co-authored-by: danny0405 <yuzhao.cyz@gmail.com>
Signed-off-by: TheR1sing3un <chaoyang@apache.org>
…pache#13792) Co-authored-by: Y Ethan Guo <ethan.guoyihua@gmail.com>
…14309) * feat: Support read virutal metadata columns for Flink reader
…pache#17456) * refactor: Add helper to get HoodieSchema in TableSchemaResolver * address tim comment
…e start and end time for both active and archive timelines (apache#14261) This PR introduces a new comprehensive show_timeline procedure for Hudi Spark SQL that provides detailed timeline information for all table operations. The procedure displays timeline instants including commits, deltacommits, compactions, clustering, cleaning, and rollback operations with support for both active and archived timelines and completed/pending state instants. Features added: Comprehensive timeline view: Shows all timeline instants with detailed metadata including state transitions (REQUESTED, INFLIGHT, COMPLETED) Time-based filtering: Support for startTime and endTime parameters to filter results within specific time ranges Archive timeline support: showArchived parameter to include archived timeline data for complete historical view Generic SQL filtering: filter parameter supporting SQL expressions for flexible result filtering Rich metadata output: Includes formatted timestamps, rollback information, and table type details --------- Co-authored-by: vamshikrishnakyatham <vamshikrishna.kyatham.22@gmail.com> Co-authored-by: Pavithran Ravichandiran <pavithran@Pavithrans-MBP.attlocal.net> Co-authored-by: Pavithran Ravichandiran <pavithran@Pavithrans-MacBook-Pro.local>
…pache#14061) Co-authored-by: Jonathan Vexler <=> Co-authored-by: Y Ethan Guo <ethan.guoyihua@gmail.com> Co-authored-by: Timothy Brown <tim@onehouse.ai>
…pache#14311) * apache#14267 - phase 2: Perform Column Statistics Schema Migration * Change method parameters in HoodieTableMetadataUtil to HoodieSchema * Fix type erasure issue due collector + stream usage * Address comments * Account for decimal being a bytes type * Remove formatting to reduce delta 1 * Remove formatting to reduce delta 2 * Remove formatting to reduce delta 3 * Remove formatting to reduce delta 4 * Remove formatting to reduce delta 5 * Address comments in TestHoodieTableMetadataUtil * Address comments in TestHoodieTableMetadataUtil (hudi-common) * Address comments again * Fix tests * Address comments * Fix checkstyle errors * Use getTableSchema instead of getTableAvroSchema
7717a17 to
57b4de1
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Describe the issue this Pull Request addresses
Summary and Changelog
Impact
Risk Level
Documentation Update
Contributor's checklist