-
Notifications
You must be signed in to change notification settings - Fork 1k
[thrift-remodel] Incorporate changes made to geospatial statistics #8528
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
etseidl
merged 221 commits into
apache:gh5854_thrift_remodel
from
etseidl:merge_geo_spatial
Oct 2, 2025
Merged
Changes from all commits
Commits
Show all changes
221 commits
Select commit
Hold shift + click to select a range
e3a0b50
custom PageLocation decoder for speed
etseidl 71d3859
fix recently added test
etseidl ff42e5a
clippy
etseidl 1f2c216
experimental new form for column index
etseidl 37f3b20
fix for test added in main
etseidl 3d4e28e
refactor new column index
etseidl 2b85b89
checkpoint...everything but stats converter
etseidl 5ee1b8f
fix bug found in testing
etseidl 624b88b
Merge branch 'new_col_idx' into new_col_idx_full
etseidl d99a06a
stats converter works
etseidl 79a6917
get rid of import
etseidl 878d460
get parquet-index working
etseidl 009632a
doc fixes
etseidl 998ac6c
Merge branch 'offset_idx_speedup' into new_col_idx_full
etseidl a822dfd
move column index to its own module
etseidl 20df075
add ColumnIndexIterators trait, simplify stats converter a little
etseidl 7755b7b
restore comment
etseidl 66ed8bc
Merge branch 'new_col_idx' into new_col_idx_full
etseidl f6c5738
further rework...allow for fallback to slow decoder
etseidl 3733b86
Merge branch 'offset_idx_speedup' into new_col_idx_full
etseidl 09d71e1
refactor a bit
etseidl 1ddaa35
simplify reading of int array
etseidl 006d59d
Merge branch 'offset_idx_speedup' into new_col_idx_full
etseidl c271085
get write working for enum and some unions
etseidl 34cdaf2
make test_roundtrip visible
etseidl c9be570
add test for converted_type, start on logical_type
etseidl a9cd09d
checkpoint struct field writing
etseidl ae65167
get some struct examples and lists working
etseidl 272a013
get rid of copied allow
etseidl 632e171
get writer macros for structs working
etseidl 9f01b60
fix bug in struct macro
etseidl 2511f8f
make Repetition public
etseidl 61e9e07
get union working for writes
etseidl e39f119
add some tests
etseidl def3d07
redo OrderedF64 initialization
etseidl 386f222
unused import
etseidl 7ae2304
Merge branch 'gh5854_thrift_remodel' into write_thrift
etseidl 6beb79d
get decryption working
etseidl 1eaa17b
refactor and clippy fixes
etseidl 713e38a
add page header defs
etseidl 79e8f85
totally rework the input side
etseidl b31c9e6
rework struct field reading
etseidl 8c4e49d
fix skipping bool fields
etseidl e0e1852
remove cruft
etseidl 1ebfdf2
Merge branch 'gh5854_thrift_remodel' into write_thrift
etseidl 366326a
Merge branch 'write_thrift' into read_and_crypto
etseidl 7b8777a
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl d8081a9
fix clippy issues
etseidl 5d6c8b1
allow unused page header structs
etseidl 709e813
remove Write from WriteThrift
etseidl def1d68
Merge branch 'write_thrift' into read_and_crypto
etseidl 0579456
finish merge
etseidl c1587c4
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl 04b74f5
stats
etseidl 2250e18
get new page headers working for read and write
etseidl 6af8631
rename page header structs
etseidl 3775222
add some fixmes
etseidl 85f44a5
formatting
etseidl f0e538f
test results differ depending on features
etseidl 763ecd7
error rather than panic on missing required fields
etseidl 734ee9b
add option to read page stats
etseidl 5569757
add comments
etseidl 23636c9
clippy
etseidl 179bb21
switch page header bench to new code
etseidl 4f7bd62
add comment
etseidl 51cf33a
benchmark changes
etseidl b4ca56e
update benchmarks to match thrift-remodel feature branch
etseidl c702a44
add encoding_stats to wide data set
etseidl 0893ec7
clippy
etseidl 689297c
Merge branch 'gh5854_thrift_remodel' into write_thrift
etseidl 7d47857
Merge branch 'write_thrift' into read_and_crypto
etseidl b543838
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl 99ee049
Merge branch 'rework_thrift_reader' into read_page_header
etseidl f158d72
Merge branch 'update_metadata_bench' into read_page_header
etseidl 56f5c5d
remove dup from merge
etseidl b37029e
checkpoint offset index
etseidl 086d04c
write path for column index
etseidl ecd24de
copy over tests from index
etseidl 1e510bc
remove index module
etseidl 138b0d5
Merge branch 'gh5854_thrift_remodel' into write_thrift
etseidl 5b6c177
Merge branch 'write_thrift' into read_and_crypto
etseidl 88959be
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl 9fe5a9a
Merge branch 'rework_thrift_reader' into read_page_header
etseidl 52d73e9
Merge branch 'read_page_header' into write_page_indexes
etseidl 29091cd
refactor column index building
etseidl d13463a
checkpoint
etseidl ee810e1
checkpoint encrypt column meta
etseidl 3afcfac
checkpoint...write code finished
etseidl 486d851
checkpoint...almost works
etseidl 3092ede
some test fixes and cleanup
etseidl da66845
more fixes and cleanup
etseidl 9ab7bb0
clippy fixes
etseidl 544eca0
start removing references to format
etseidl 0b33d25
more format cleanup
etseidl 39a9169
remove format references from docs
etseidl 8de96ce
remove format conversion functions
etseidl 683d4e4
remove format::CompressionCodec
etseidl c729d22
Merge remote-tracking branch 'origin/gh5854_thrift_remodel' into writ…
etseidl 96419c4
Merge branch 'write_thrift' into read_and_crypto
etseidl 6ec102f
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl 976b36d
Merge branch 'rework_thrift_reader' into read_page_header
etseidl ceac418
Merge branch 'read_page_header' into write_page_indexes
etseidl e10e274
Merge branch 'write_page_indexes' into write_file_meta
etseidl ffeaa7a
Merge branch 'write_thrift' into remove_format
etseidl e73a922
remove format from statistics
etseidl f81a732
get a start on some documentation and add some TODOs
etseidl be58ea6
Merge branch 'write_thrift' into read_and_crypto
etseidl 02e5e16
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl 61aa392
Merge branch 'rework_thrift_reader' into read_page_header
etseidl 428e84c
Merge branch 'read_page_header' into write_page_indexes
etseidl 751b0f1
Merge branch 'write_page_indexes' into write_file_meta
etseidl a1cfbec
Merge branch 'write_file_meta' into remove_format
etseidl 7268dd3
fix docs
etseidl 8305915
Merge branch 'write_thrift' into read_and_crypto
etseidl 4221646
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl 4342cb5
Merge branch 'rework_thrift_reader' into read_page_header
etseidl ddbeb55
Merge branch 'read_page_header' into write_page_indexes
etseidl 8919c82
Merge branch 'write_page_indexes' into write_file_meta
etseidl 7112088
Merge branch 'write_file_meta' into remove_format
etseidl f0beb0b
Merge branch 'gh5854_thrift_remodel' into read_and_crypto
etseidl b303e52
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl 2955b85
Merge branch 'rework_thrift_reader' into read_page_header
etseidl 3d33707
Merge branch 'read_page_header' into write_page_indexes
etseidl 45fa0f4
Merge branch 'write_page_indexes' into write_file_meta
etseidl 07157ec
Merge branch 'write_file_meta' into remove_format
etseidl cfa6740
backport fix for tests without encryption
etseidl 6c82028
Merge branch 'read_and_crypto' into rework_thrift_reader
etseidl b16e118
Merge branch 'rework_thrift_reader' into read_page_header
etseidl 1afd866
Merge branch 'read_page_header' into write_page_indexes
etseidl 29ddbc7
Merge branch 'write_page_indexes' into write_file_meta
etseidl 65e42d7
Merge branch 'write_file_meta' into remove_format
etseidl 82f31a4
add documentation
etseidl 608c0f3
Merge branch 'rework_thrift_reader' into read_page_header
etseidl 237ca3d
add docs for ThriftReadInputProtocol
etseidl bdb9aa9
Merge branch 'read_page_header' into write_page_indexes
etseidl 15ed645
Merge branch 'write_page_indexes' into write_file_meta
etseidl 2dcd8d0
Merge branch 'write_file_meta' into remove_format
etseidl 2091e49
move PageEncodingStats to thrift_gen
etseidl 4da5d9e
Merge branch 'gh5854_thrift_remodel' into rework_thrift_reader
etseidl afb4adf
Merge branch 'rework_thrift_reader' into read_page_header
etseidl 9909d0c
Merge branch 'read_page_header' into write_page_indexes
etseidl 418e45c
Merge branch 'write_page_indexes' into write_file_meta
etseidl 023f5d7
Merge branch 'write_file_meta' into remove_format
etseidl e0deed9
move PageEncodingStats to mod.rs since it needs to be public
etseidl 218b42b
cleanup and add some documentation
etseidl 67a82f4
start on documentation of thrift processing
etseidl 1c71b42
more docs
etseidl 49813ea
finish first cut of THRIFT.md
etseidl 5298257
clean up some stale documentation references
etseidl fd63d32
add a todo
etseidl 72ea850
more doc cleanup
etseidl ebae0af
Merge branch 'gh5854_thrift_remodel' into read_page_header
etseidl 7560e70
fix typo
etseidl e94a2de
fix typo
etseidl 1ff8b88
Merge branch 'read_page_header' into write_page_indexes
etseidl b7c64ca
Merge branch 'write_page_indexes' into write_file_meta
etseidl 8d19468
Merge branch 'write_file_meta' into remove_format
etseidl 56a75d6
clean up some imports
etseidl b7a135b
increment shift after test
etseidl 7b549f9
update docs for PageStatistics
etseidl a6ca284
Merge branch 'read_page_header' into write_page_indexes
etseidl dde0770
Merge branch 'write_page_indexes' into write_file_meta
etseidl 344ad12
Merge branch 'write_file_meta' into remove_format
etseidl 943c674
Merge remote-tracking branch 'origin/gh5854_thrift_remodel' into writ…
etseidl b9e97c5
Merge branch 'gh5854_thrift_remodel' into write_page_indexes
etseidl db2115a
Merge branch 'write_page_indexes' into write_file_meta
etseidl c95ff97
Merge branch 'write_file_meta' into remove_format
etseidl 0701d60
backport some doc fixes
etseidl 7fb0e13
Merge branch 'write_page_indexes' into write_file_meta
etseidl e8dde76
Merge branch 'write_file_meta' into remove_format
etseidl 9ebb8b1
fix recently added test
etseidl 4977f2f
fix recently added test
etseidl 2238925
Merge branch 'write_file_meta' into remove_format
etseidl cbf1624
add TODO
etseidl 7ec64a9
Merge branch 'gh5854_thrift_remodel' into write_file_meta
etseidl a87b0a2
forgot to check this in during merge
etseidl 5fd7a8f
Merge branch 'write_file_meta' into remove_format
etseidl 1334370
remove TODO
etseidl 5c5c826
add HeapSize for crypto fields on chunk metadata
etseidl 737f653
Merge branch 'write_file_meta' into remove_format
etseidl facd852
Merge branch 'gh5854_thrift_remodel' into write_file_meta
etseidl 2d789fd
Merge branch 'write_file_meta' into remove_format
etseidl f82fd45
Merge branch 'gh5854_thrift_remodel' into write_file_meta
etseidl 1374686
Merge branch 'write_file_meta' into remove_format
etseidl bd682d1
Merge branch 'gh5854_thrift_remodel' into write_file_meta
etseidl 3b8de59
Merge branch 'write_file_meta' into remove_format
etseidl 1bca0a0
remove unnecessary checks
etseidl 01dc4f3
Merge branch 'write_file_meta' into remove_format
etseidl c3907dc
implement suggestions from review
etseidl 9045533
Merge branch 'gh5854_thrift_remodel' into write_file_meta
etseidl 10427c8
Merge branch 'write_file_meta' into remove_format
etseidl 653fa1a
remove TODO
etseidl 91e3df7
more todos
etseidl 7f03758
Merge branch 'gh5854_thrift_remodel' into remove_format
etseidl 4b8c68b
variant logical type fixes
etseidl 80fc032
remove lint
etseidl 61773a0
remove private APIs from metadata benchmark
etseidl e34d362
merge in changes to benchmark
etseidl a496854
Merge branch 'gh5854_thrift_remodel' into remove_format
etseidl a6a6326
Apply suggestions from code review
etseidl b5651e5
add test of invalid converted type
etseidl ec73f7a
Merge branch 'remove_format' of github.com:etseidl/arrow-rs into remo…
etseidl 282a925
use raw identifier for 'type' in SchemaElement
etseidl 36c1dc1
Merge branch 'gh5854_thrift_remodel' into remove_format
etseidl e623a56
finish merge
etseidl f6be170
fix test
etseidl 8454d50
some doc changes from review comments
etseidl 605292b
Merge branch 'gh5854_thrift_remodel' into remove_format
etseidl cba5d3d
rename more type_ fields as r#type
etseidl e58c955
clean up parquet_thrift
etseidl e6d80f7
make file_path match with/without encryption
etseidl ef5ef6d
clean up some docs
etseidl 0ba2bcb
refactor parser to cluster more encryption specific code
etseidl 70efc43
remove a few allocations
etseidl 26108c0
Merge branch 'gh5854_thrift_remodel' into remove_format
etseidl bb5b688
remove TODO
etseidl 6febae0
merge in changes to geo spatial stats
etseidl 49f3957
Merge branch 'gh5854_thrift_remodel' into merge_geo_spatial
etseidl 793db5b
allow for unknown variants and some doc cleanups
etseidl d8076c2
Merge remote-tracking branch 'origin/gh5854_thrift_remodel' into merg…
etseidl c37bce2
clean up leftover #allow
etseidl File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just pointing out that the order of formatting was switched from crs-algorithm to algorithm-crs. I think this is a beneficial change because the CRS could be a long string.