Best Practices for Optimizing BigQuery Partitions via Airbyte Sync #37336
Unanswered
itailulu
asked this question in
Connector Questions
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm using Airbyte to sync data daily from Klaviyo to BigQuery. The current setup partitions data by the
airbyte_extracted_at
column. Given that most of our queries are time-specific, this partitioning strategy leads to suboptimal performance and higher costs due to the necessity of reading the entire table.I've been considering manually creating a table with partitioning configured on event timestamps to better align with our querying needs. However, I'm concerned about the potential impacts this might have on the efficiency and cost of the sync process.
Has anyone in the community dealt with similar challenges? Are there recommended approaches or alternative methods, such as moving data to a correctly partitioned table post-sync?
I’m eager to hear about any best practices or insights you might have.
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions