Skip to content

Dynamic sidecar takes sometimes 30 seconds until it is healthy #1293

@sanderegg

Description

@sanderegg

After a long testing session with @YuryHrytsuk we found out that the dynamic-sidecar sometimes takes way longer than it should to be in a healthy state.

On AWS-master this seems to happen every single time:

  • right after an autoscaled node is made ready,
  • dynamic-sidecar starts,
  • fails to connect to the otel-collector due to DNS resolution failing,
  • fails to connect to the storage micro-service due to DNS resolution failing, and this can take up to 30 seconds

In order to circumvent the issue, we need to access storage via internal IP/port so we do not wait for docker network DNS being ready

an issue is that the dy-sidecar fails after 30 attempts to connect to storage, even though there is already a docker healthcheck that allows for 180s startup time (will need to be fixed @GitHK )

Metadata

Metadata

Assignees

Labels

p:high-priot:infra-opsAdjustments to the way or resources with that microservices are run

Type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions