- 
                Notifications
    
You must be signed in to change notification settings  - Fork 289
 
Description
Describe the bug
I have to schedule ~150 EC2 instances up and down every day in one VPC in one AWS Account. About every 30 days we get an error
ERROR   : Error handling scheduling request  <...> botocore.exceptions.ClientError: An error occurred (ValidationException) when calling the PutItem operation: Item size has exceeded the maximum allowed size
see scheduler-error.json for a complete log entry.
This stops the scheduler from working for this AWS Account with the effect that the whole environment that should have been started is not resulting in an outage.
To fix this I need to manually delete the Dynamo TableItem for this AWS Account ID in the StateTable of the solution. Then tha scheduler starts working again.
To Reproduce
I have seen this behavior only in two of our AWS accounts where Kubernetes Clusters with a highly volatile number of EC2 instances are deployed. So I am guessing it might have something to do with the constantly changing number of instances and states the scheduler has to track.
Please complete the following information about the solution:
- Version: v3.0.6
 - Region: eu-central-1
 - Was the solution modified from the version published on this repository? No
 -  Have you checked your service quotas for
the sevices this solution uses? Yes, and I know there is a limit on the PutItem size for dynamoDB but the solution should account for that -  Were there any errors in the CloudWatch Logs?
Error occur in the solutions cloudwatch logs (see attached file) 
Appreciate any help on this. We've been seeing this issue with version < 3 of the scheduler solution as well and I hoped it would have been solved with v3 but clearly it is not.