-
Notifications
You must be signed in to change notification settings - Fork 536
fix: storage options handling in open_table #3807
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
ACTION NEEDED delta-rs follows the Conventional Commits specification for release automation. The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification. |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #3807 +/- ##
===========================================
+ Coverage 18.79% 76.08% +57.29%
===========================================
Files 74 145 +71
Lines 12036 45327 +33291
Branches 12036 45327 +33291
===========================================
+ Hits 2262 34487 +32225
+ Misses 9588 9148 -440
- Partials 186 1692 +1506 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Refactor open_table_with_storage_options to remove 'aws.' prefixes from options. Signed-off-by: Dan-J-D <[email protected]>
Signed-off-by: Dan-J-D <[email protected]>
Signed-off-by: Dan-J-D <[email protected]>
This reverts commit 422916f. Signed-off-by: Dan-J-D <[email protected]>
This reverts commit 4794f74. Signed-off-by: Dan-J-D <[email protected]>
I'm a bit confused here, why can't the user just call OPTIONS with something like |
When you just put |
Hey @Dan-J-D - thanks for opening this! As I am right now working on the datafusion table provider I noticed that we may need to re-think a bit how we integrate with datafusion's resources, specifically object stores. Recently I opened #3810 specifically for the I.e. looking through databricks and datafusion documentation for the CRATE TABLE commmand i think this property bag might not be the way we want to integrate credentials. Rather we need to make the table provider work with the datafusion sessions and have it leverage whatever stores are available on that. Right now there is unfortunately no good way to achieve what we are trying to do here and it is definitely a more that valid use case! There is more ... the There are some good news - once we leverage the datafusion session to get resources and do some smaller tweaks on the factory, things should just start to work as intended 🤞. |
Yea sounds good, I've been expanding it in my local project. Are yall planning on supporting datafusion-ballista? I have been spending sometime to get it working together properly. |
Some work has been done in the past to support this. IIRC, it's mainly the codecs required to (de-)serialize our scans as datafusion protos. The hope is that a great integration with datafusion is enough to also support ballista. (which might not be totally true). Right now though - unless someone from the community steps up - there is not much capacity to make this work a priority. We would certainly do our best though to address any (specific / scoped) issues raised on this repo that help with that integration. |
Alright, sounds good! I'll close t his PR seeing as there needs to be a bigger rewrite. |
Refactor open_table_with_storage_options to remove 'aws.' prefixes from options.
Description
Fixes issue where all s3 options are prefixed with 'aws.' and it not using the setting from OPTIONS correctly.
Ex.
Related Issue(s)
None that i'm aware of.
Documentation
https://datafusion.apache.org/user-guide/cli/datasources.html#s3