Skip to content
Discussion options

You must be logged in to vote

@dforste that "doesn't seem unusual" message tells you that the inter-node communication link on that node has been overloaded for a certain amount of time continuously.

That can directly affect the metrics that are aggregated across all nodes: not all responses arrive within the short timeout such operations use, and therefore you get underreported metrics in the UI.

Without clear evidence of other scenarios, that's my conclusion. Perhaps you have periodic processes that publish large messages running at midnight, or something like that.

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@dforste
Comment options

@michaelklishin
Comment options

Answer selected by michaelklishin
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants