-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
It looks like we drop a message in the logs:
Line 340 in 515edd0
| fmt.Println("AMQP channel closed - has the connection dropped?") |
This resulted in two instances of cloudops-jenkins not responding to hg.m.o events, which halts rolling out changes to the FirefoxCI tc cluster.
We were wondering if we could either
- add louder notifications: a slack alert, email, ?
- auto-recover, whether that's killing pulse-go for a restart, killing the container for a restart, reconnecting to amqp, ? I'm not sure if this would be on the first failure or after
ttime ornfailed attempts or what.
or both.
@petemoore any thoughts?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels