Skip to content

Conversation

@xbattlax
Copy link
Contributor

@xbattlax xbattlax commented Jan 6, 2026

Summary

Respect ArrowReaderOptions in ArrowFileReader::get_metadata() by extracting metadata_options and passing it to ParquetMetaDataReader via with_metadata_options().

Changes

  • Update arrow and parquet dependencies from 57.0 to 57.1
  • Update bindings/python/Cargo.lock for the new versions
  • Extract metadata_options from the ArrowReaderOptions parameter
  • Pass the options to ParquetMetaDataReader::with_metadata_options()
  • Remove the TODO comment since the required API is now available

Notes

The ParquetMetaDataReader::with_metadata_options() API was added in parquet 57.1.0.

Security Audit

The security audit failure (RUSTSEC-2026-0001) is a pre-existing issue unrelated to this PR. It affects the rkyv dependency and is being addressed in #1994.

Closes #1934

@xbattlax xbattlax force-pushed the support-arrow-reader-options branch from 32f3eca to 126011a Compare January 6, 2026 10:52
Comment on lines +45 to +52
arrow-arith = "57.1"
arrow-array = "57.1"
arrow-buffer = "57.1"
arrow-cast = "57.1"
arrow-ord = "57.1"
arrow-schema = "57.1"
arrow-select = "57.1"
arrow-string = "57.1"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the associated linked issue: #1934
I see @lgingerich mentioned
so there is no crate dependency upgrade work that needs done

do we still need this pckage upgrade to be done as a part of this change

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, the upgrade to 57.1 is required. The ArrowReaderOptions::metadata_options() method used in this PR was only exposed starting in arrow 57.1 (see apache/arrow-rs#7393). Without the upgrade, the code won't compile.

Respect ArrowReaderOptions by extracting metadata_options and passing
it to ParquetMetaDataReader via with_metadata_options(). This allows
callers to configure metadata decoding behavior.

This change requires parquet 57.1.0 which added the
ParquetMetaDataReader::with_metadata_options() API.

Changes:
- Update arrow and parquet dependencies from 57.0 to 57.1
- Update bindings/python/Cargo.lock for the new versions
- Use metadata_options from ArrowReaderOptions in get_metadata

Closes apache#1934
@xbattlax xbattlax force-pushed the support-arrow-reader-options branch from 65c2f99 to 1f0add9 Compare January 12, 2026 08:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Support ArrowReaderOptions

2 participants