Skip to content

Conversation

@borchero
Copy link
Member

Motivation

In #57, we added support for serializing schemas to JSON. This PR now adds utility functions to serialize the schema as parquet metadata and leverage that schema when reading files.

Changes

  • Add {read,write,scan,sink}_parquet methods to the Schema class

@borchero borchero self-assigned this Jun 17, 2025
@github-actions github-actions bot added the enhancement New feature or request label Jun 17, 2025
@codecov
Copy link

codecov bot commented Jun 17, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (74eebee) to head (02b6c16).

Additional details and impacted files
@@            Coverage Diff            @@
##              main       #66   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           41        41           
  Lines         2216      2255   +39     
=========================================
+ Hits          2216      2255   +39     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copy link
Collaborator

@AndreasAlbertQC AndreasAlbertQC left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Super nice and clear, thanks @borchero !

Copy link
Member

@delsner delsner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice!

@borchero borchero marked this pull request as draft July 3, 2025 22:01
@borchero
Copy link
Member Author

borchero commented Jul 3, 2025

Sorry, I didn't get to work on this again but I still meant to add proper support for partitioned datasets which also introduce some additional challenges at read-time 👀 I'll mark this as draft for now

@borchero borchero marked this pull request as ready for review July 6, 2025 10:27
@borchero borchero requested a review from AndreasAlbertQC July 6, 2025 10:27
@borchero borchero enabled auto-merge (squash) July 8, 2025 14:17
@borchero borchero merged commit f3cb607 into main Jul 8, 2025
18 checks passed
@borchero borchero deleted the read-write branch July 8, 2025 14:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants