You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Context
We use object storage for runs with large payloads or outputs. Currently we don't retry failures when importing payload/output packets from the storage bucket, i.e., transient issues in the object storage service can cause entire runs to fail.
Proposed solution
Add a simple retry mechanism to avoid run disruptions from transient network and object storage errors.