I'm running snowplow-snowflake-loader 0.9.0 over a few months of data to load it into snowflake.
Unfortunately, dynamodb is throttling a lot of the requests, whether due to autoscaling lag or hitting the max autoscaling configuration. Now I have a lot of events in my "bad" folder with the message:
The level of configured provisioned throughput for the table was exceeded. Consider increasing your provisioning level with the UpdateTable API(Service: AmazonDynamoDBv2; Status Code: 400; Error Code: ProvisionedThroughputExceededException;...
Here I have two questions:
- Can the cross-batch-dedup function retry with a small backoff when receiving this specific error
- How can I recover these events without deleting the destination data and running the whole transform again.