Skip to content

Commit fa31e38

Browse files
committed
Add Parquet dataset metadata management with serialization and filtering capabilities
- Implemented functions for serializing and deserializing Parquet metadata. - Added methods to collect and manage metadata for Parquet files, including filtering and row group processing. - Introduced a new class `ParquetDatasetMetadata` to encapsulate metadata handling for Parquet datasets. - Enhanced file metadata management with JSON serialization for improved compatibility. - Added support for scanning datasets with filter expressions and generating metadata tables. - Implemented caching mechanisms to optimize metadata access and updates.
1 parent 9106071 commit fa31e38

File tree

4 files changed

+4556
-0
lines changed

4 files changed

+4556
-0
lines changed

0 commit comments

Comments
 (0)