Help wanted: explosion of containers after upgrade to 7.9.1 #8780
-
|
Hi! I'll keep this brief. Any help is appreciated. We're running Concourse in Kubernetes with the official Helm chart. It's a small installation, at the moment just two workers. The workers are run as a StatefulSet with persistence using Kubernetes local storage. The worker use the containerd runtime with the overlay driver. In addition to to this brief description, I'm pasting some of (what I believe) to be relevant environment variables from the web pod and a worker pod. After upgrading from 7.5 to 7.9.1 we're seeing much more containers on the workers. On 7.5 we peaked at about 350 containers, but on 7.9.1 it keeps going and is currently hitting the configured maximum of 1000 containers after a couple of hours. As far as I can tell, these are 90 % resource check containers that hang around. Prior to upgrading, the number and frequency of resource checks were similar. So, the obvious question is if this is at all expected. I'm not seeing anything obvious in the logs. I'd be happy to complement with additional information. Here are a few (possibly) relevant environement variables. From the worker: And here are some from the web component: Cheers! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
This seems to have been caused by invalid data in the database, which we managed to create as part of the upgrade process. (There were a few backups and restores of the database as part of our upgrade.) Since we're on a tiny installation, starting from scratch was an option for us, and it helped with the explosion of containers. I guess the lesson learned is that one must take care to not mess with Concourse's data. Thank you for a great product, we appreciate your hard work and dedication! |
Beta Was this translation helpful? Give feedback.
This seems to have been caused by invalid data in the database, which we managed to create as part of the upgrade process. (There were a few backups and restores of the database as part of our upgrade.) Since we're on a tiny installation, starting from scratch was an option for us, and it helped with the explosion of containers.
I guess the lesson learned is that one must take care to not mess with Concourse's data.
Thank you for a great product, we appreciate your hard work and dedication!