-
Notifications
You must be signed in to change notification settings - Fork 4k
GH-44345: [C++][Parquet] Fully support arrow decimal32/64 in Parquet #45351
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 16 commits
cba31c7
a9398a2
e1dc023
6032b02
290de24
e5b996e
44f1adc
c017323
63d307b
77dd7d3
d81cf13
424472f
d1687a7
1f0fb7b
52711d5
f64d6d9
3fb307e
f279349
8a78c72
29e98ff
d2e1ffa
a8304f3
de295e3
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Large diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -303,6 +303,9 @@ class PARQUET_EXPORT FileReader { | |
| /// Set number of records to read per batch for the RecordBatchReader. | ||
| virtual void set_batch_size(int64_t batch_size) = 0; | ||
|
|
||
| /// Set whether to enable smallest decimal arrow type | ||
| virtual void set_smallest_decimal_enabled(bool smallest_decimal_enabled) = 0; | ||
|
||
|
|
||
| virtual const ArrowReaderProperties& properties() const = 0; | ||
|
|
||
| virtual const SchemaManifest& manifest() const = 0; | ||
|
|
@@ -403,7 +406,8 @@ ::arrow::Status OpenFile(std::shared_ptr<::arrow::io::RandomAccessFile>, | |
| /// Advanced settings are supported through the FileReaderBuilder class. | ||
| PARQUET_EXPORT | ||
| ::arrow::Result<std::unique_ptr<FileReader>> OpenFile( | ||
| std::shared_ptr<::arrow::io::RandomAccessFile>, ::arrow::MemoryPool* allocator); | ||
| std::shared_ptr<::arrow::io::RandomAccessFile>, ::arrow::MemoryPool* pool, | ||
|
||
| const ArrowReaderProperties& reader_properties = default_arrow_reader_properties()); | ||
|
|
||
| /// @} | ||
|
|
||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are the changes required to the compute kernels required to support Parquet? I can't see why but I might be missing something. Otherwise, we should move adding support for decimal32 and decimal64 to those compute kernels on a different PR and leave this one only with the required parquet changes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok, I see now, on the description says this is required for some tests:
Allow decimal32/64 in Arrow compute vector hash which is needed for some of the existing Parquet testsThere was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm down to split this change to another PR which can cover this support with more tests on the arrow compute side. But yes, there are a few tests in Parquet that hit arrow vector kernel code path