Skip to content
This repository was archived by the owner on Sep 26, 2023. It is now read-only.

cross-batch-dedup does not handle Dynamodb rate-limit errors #164

@m-wynn

Description

@m-wynn

I'm running snowplow-snowflake-loader 0.9.0 over a few months of data to load it into snowflake.

Unfortunately, dynamodb is throttling a lot of the requests, whether due to autoscaling lag or hitting the max autoscaling configuration. Now I have a lot of events in my "bad" folder with the message:

The level of configured provisioned throughput for the table was exceeded. Consider increasing your provisioning level with the UpdateTable API(Service: AmazonDynamoDBv2; Status Code: 400; Error Code: ProvisionedThroughputExceededException;...

Here I have two questions:

  1. Can the cross-batch-dedup function retry with a small backoff when receiving this specific error
  2. How can I recover these events without deleting the destination data and running the whole transform again.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions