(Exchange federation) Intermittent shutdown of federation links #15352

selim1965 · 2026-01-27T08:43:06Z

selim1965
Jan 27, 2026

Describe the bug

Hi, we use celery in a distributed environment with federated RMQ exchanges.

We recently upgraded RabbitMQ from 3.10.25 to 4.2.1 (Erlang 27.3.4.6) and almost every day one particular federated exchange link gets shutdown:

The only way to fix it to restart the downstream RMQ instance.

When I look at the RMQ logs I see this error.

Heartbeat timeout

2026-01-26 12:28:54.581554+00:00 [error] <0.669.0> ** Generic server <0.669.0> terminating
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0> ** Last message in was heartbeat_timeout
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0> ** When Server state == {state,amqp_network_connection,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                             {state,#Port<0.52>,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 <<"client 10.10.0.5:45012 -> 10.144.136.83:5672">>,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 10,<0.706.0>,131072,<0.668.0>,undefined,false},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                             <0.705.0>,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                             {amqp_params_network,<<"iris">>,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 {encrypted,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                     <<"w4mP6JLlcSnkoR03tWKCMog8A1izdqi8YsJ6ASC8cLUG9gkkqLmn59m9cYfi4/OMPvVGA6Aw7CZ0yTkK1g1kxZpA9CSyC2flA8aSM6SgChI=">>},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 10000,none,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                  #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 [{<<"connection_name">>,longstr,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                   <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                 []},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                             2047,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                             [{<<"capabilities">>,table,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                               [{<<"publisher_confirms">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"exchange_exchange_bindings">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"basic.nack">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"consumer_cancel_notify">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"connection.blocked">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"consumer_priorities">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"authentication_failure_close">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"per_consumer_qos">>,bool,true},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                                {<<"direct_reply_to">>,bool,true}]},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                              {<<"cluster_name">>,longstr,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                               <<"catalog_rabbitmq_prod@ilm-sf">>},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                              {<<"copyright">>,longstr,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                               <<"Copyright (c) 2007-2025 Broadcom Inc and/or its subsidiaries">>},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                              {<<"information">>,longstr,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                               <<"Licensed under the MPL 2.0. Website: https://rabbitmq.com">>},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                              {<<"platform">>,longstr,
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                               <<"Erlang/OTP 27.3.4.6">>},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                              {<<"product">>,longstr,<<"RabbitMQ">>},
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                              {<<"version">>,longstr,<<"4.2.1">>}],
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0>                             none,#{},false}
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0> ** Reason for termination ==
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0> ** heartbeat_timeout
2026-01-26 12:28:54.581554+00:00 [error] <0.669.0> 
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>   crasher:
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     initial call: amqp_gen_connection:init/1
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     pid: <0.669.0>
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     registered_name: []
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     exception exit: heartbeat_timeout
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>       in function  gen_server:handle_common_reply/8 (gen_server.erl, line 2476)
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     ancestors: [<0.667.0>,amqp_sup,<0.463.0>]
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     message_queue_len: 0
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     messages: []
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     links: [<0.667.0>]
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     dictionary: [{gen_server_call_timeout,130000},
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>                   {process_name,
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>                       {amqp_gen_connection,
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>                           <<"client 10.10.0.5:45012 -> 10.144.136.83:5672">>}}]
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     trap_exit: true
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     status: running
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     heap_size: 17731
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     stack_size: 29
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>     reductions: 46326
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0>   neighbours:
2026-01-26 12:28:54.582356+00:00 [error] <0.669.0> 
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>     supervisor: {<0.667.0>,amqp_connection_sup}
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>     errorContext: child_terminated
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>     reason: heartbeat_timeout
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>     offender: [{pid,<0.669.0>},
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                {id,connection},
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                {mfargs,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                        [<0.668.0>,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                             {encrypted,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                                 <<"w4mP6JLlcSnkoR03tWKCMog8A1izdqi8YsJ6ASC8cLUG9gkkqLmn59m9cYfi4/OMPvVGA6Aw7CZ0yTkK1g1kxZpA9CSyC2flA8aSM6SgChI=">>},
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                             none,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                             []}]}},
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                {restart_type,transient},
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                {significant,true},
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                {shutdown,brutal_kill},
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0>                {child_type,worker}]
2026-01-26 12:28:54.592472+00:00 [error] <0.667.0> 
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>     supervisor: {<0.667.0>,amqp_connection_sup}
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>     errorContext: shutdown
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>     reason: reached_max_restart_intensity
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>     offender: [{pid,<0.669.0>},
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                {id,connection},
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                {mfargs,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                        [<0.668.0>,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                             {encrypted,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                                 <<"w4mP6JLlcSnkoR03tWKCMog8A1izdqi8YsJ6ASC8cLUG9gkkqLmn59m9cYfi4/OMPvVGA6Aw7CZ0yTkK1g1kxZpA9CSyC2flA8aSM6SgChI=">>},
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                             none,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                             []}]}},
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                {restart_type,transient},
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                {significant,true},
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                {shutdown,brutal_kill},
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0>                {child_type,worker}]
2026-01-26 12:28:54.592814+00:00 [error] <0.667.0> 
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0> ** Generic server <0.703.0> terminating
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0> ** Last message in was heartbeat_timeout
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0> ** When Server state == {state,amqp_network_connection,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                             {state,#Port<0.54>,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 <<"client 10.10.0.5:45024 -> 10.144.136.83:5672">>,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 10,<0.710.0>,131072,<0.702.0>,undefined,false},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                             <0.709.0>,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                             {amqp_params_network,<<"iris">>,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 {encrypted,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                     <<"Qze37mi5xHmDCey+i02mMWrpRDP5JHZeamTqkPnrcsToaLdGTzIUBUGw18VGtRTxrE7NslrPAuy+rpnBVKGsBN3+w7sfj1+t3zWK7AZd+ro=">>},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 10000,none,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                  #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 [{<<"connection_name">>,longstr,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                   <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                 []},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                             2047,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                             [{<<"capabilities">>,table,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                               [{<<"publisher_confirms">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"exchange_exchange_bindings">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"basic.nack">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"consumer_cancel_notify">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"connection.blocked">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"consumer_priorities">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"authentication_failure_close">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"per_consumer_qos">>,bool,true},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                                {<<"direct_reply_to">>,bool,true}]},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                              {<<"cluster_name">>,longstr,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                               <<"catalog_rabbitmq_prod@ilm-sf">>},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                              {<<"copyright">>,longstr,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                               <<"Copyright (c) 2007-2025 Broadcom Inc and/or its subsidiaries">>},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                              {<<"information">>,longstr,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                               <<"Licensed under the MPL 2.0. Website: https://rabbitmq.com">>},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                              {<<"platform">>,longstr,
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                               <<"Erlang/OTP 27.3.4.6">>},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                              {<<"product">>,longstr,<<"RabbitMQ">>},
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                              {<<"version">>,longstr,<<"4.2.1">>}],
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0>                             none,#{},false}
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0> ** Reason for termination ==
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0> ** heartbeat_timeout
2026-01-26 12:28:54.607493+00:00 [error] <0.703.0> 
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>   crasher:
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     initial call: amqp_gen_connection:init/1
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     pid: <0.703.0>
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     registered_name: []
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     exception exit: heartbeat_timeout
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>       in function  gen_server:handle_common_reply/8 (gen_server.erl, line 2476)
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     ancestors: [<0.701.0>,amqp_sup,<0.463.0>]
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     message_queue_len: 0
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     messages: []
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     links: [<0.701.0>]
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     dictionary: [{gen_server_call_timeout,130000},
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>                   {process_name,
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>                       {amqp_gen_connection,
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>                           <<"client 10.10.0.5:45024 -> 10.144.136.83:5672">>}}]
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     trap_exit: true
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     status: running
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     heap_size: 17731
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     stack_size: 29
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>     reductions: 44957
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0>   neighbours:
2026-01-26 12:28:54.608183+00:00 [error] <0.703.0> 
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>     supervisor: {<0.701.0>,amqp_connection_sup}
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>     errorContext: child_terminated
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>     reason: heartbeat_timeout
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>     offender: [{pid,<0.703.0>},
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                {id,connection},
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                {mfargs,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                        [<0.702.0>,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                             {encrypted,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                                 <<"Qze37mi5xHmDCey+i02mMWrpRDP5JHZeamTqkPnrcsToaLdGTzIUBUGw18VGtRTxrE7NslrPAuy+rpnBVKGsBN3+w7sfj1+t3zWK7AZd+ro=">>},
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                             none,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                             []}]}},
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                {restart_type,transient},
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                {significant,true},
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                {shutdown,brutal_kill},
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0>                {child_type,worker}]
2026-01-26 12:28:54.608644+00:00 [error] <0.701.0> 
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>     supervisor: {<0.701.0>,amqp_connection_sup}
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>     errorContext: shutdown
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>     reason: reached_max_restart_intensity
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>     offender: [{pid,<0.703.0>},
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                {id,connection},
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                {mfargs,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                        [<0.702.0>,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                             {encrypted,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                                 <<"Qze37mi5xHmDCey+i02mMWrpRDP5JHZeamTqkPnrcsToaLdGTzIUBUGw18VGtRTxrE7NslrPAuy+rpnBVKGsBN3+w7sfj1+t3zWK7AZd+ro=">>},
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                             none,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                             []}]}},
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                {restart_type,transient},
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                {significant,true},
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                {shutdown,brutal_kill},
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0>                {child_type,worker}]
2026-01-26 12:28:54.608970+00:00 [error] <0.701.0> 
2026-01-26 12:28:54.609333+00:00 [info] <0.684.0> Federation exchange 'celery.pidbox' in vhost '/' disconnected from exchange 'celery.pidbox' in vhost '/' on amqp://pe-catalog-sf-02v
2026-01-26 12:28:54.609333+00:00 [info] <0.684.0> {upstream_channel_down,shutdown}
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0> Federation link could not create a disposable (one-off) channel due to an error error: {badmatch,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                         {error,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                          {noproc,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                           {gen_server,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                            call,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                            [<0.703.0>,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                             {command,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                              {open_channel,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                               none,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                               {amqp_selective_consumer,
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                                []}}},
2026-01-26 12:28:54.609739+00:00 [error] <0.684.0>                                                                                             130000]}}}}
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0> Federation link could not create a disposable (one-off) channel due to an error error: {badmatch,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                         {error,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                          {noproc,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                           {gen_server,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                            call,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                            [<0.703.0>,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                             {command,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                              {open_channel,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                               none,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                               {amqp_selective_consumer,
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                                []}}},
2026-01-26 12:28:54.610022+00:00 [error] <0.684.0>                                                                                             130000]}}}}
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0> ** Generic server <0.752.0> terminating
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0> ** Last message in was heartbeat_timeout
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0> ** When Server state == {state,amqp_network_connection,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                             {state,#Port<0.55>,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 <<"client 10.10.0.5:45026 -> 10.144.136.83:5672">>,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 10,<0.824.0>,131072,<0.751.0>,undefined,false},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                             <0.823.0>,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                             {amqp_params_network,<<"iris">>,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 {encrypted,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                     <<"tpo9epaT8T3cvOYVCBkID7+V/quPPiIPvSau1jkooecwxKYZAgJMAAy0KCZnVsbfXhssKrTEnu3TZFo/Cz0km/DqKnxtBDNTb5DhTQgchK4=">>},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 10000,none,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                  #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 [{<<"connection_name">>,longstr,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                   <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                 []},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                             2047,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                             [{<<"capabilities">>,table,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                               [{<<"publisher_confirms">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"exchange_exchange_bindings">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"basic.nack">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"consumer_cancel_notify">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"connection.blocked">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"consumer_priorities">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"authentication_failure_close">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"per_consumer_qos">>,bool,true},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                                {<<"direct_reply_to">>,bool,true}]},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                              {<<"cluster_name">>,longstr,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                               <<"catalog_rabbitmq_prod@ilm-sf">>},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                              {<<"copyright">>,longstr,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                               <<"Copyright (c) 2007-2025 Broadcom Inc and/or its subsidiaries">>},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                              {<<"information">>,longstr,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                               <<"Licensed under the MPL 2.0. Website: https://rabbitmq.com">>},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                              {<<"platform">>,longstr,
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                               <<"Erlang/OTP 27.3.4.6">>},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                              {<<"product">>,longstr,<<"RabbitMQ">>},
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                              {<<"version">>,longstr,<<"4.2.1">>}],
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0>                             none,#{},false}
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0> ** Reason for termination ==
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0> ** heartbeat_timeout
2026-01-26 12:28:55.599542+00:00 [error] <0.752.0> 
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>   crasher:
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     initial call: amqp_gen_connection:init/1
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     pid: <0.752.0>
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     registered_name: []
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     exception exit: heartbeat_timeout
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>       in function  gen_server:handle_common_reply/8 (gen_server.erl, line 2476)
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     ancestors: [<0.750.0>,amqp_sup,<0.463.0>]
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     message_queue_len: 0
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     messages: []
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     links: [<0.750.0>]
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     dictionary: [{gen_server_call_timeout,130000},
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>                   {process_name,
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>                       {amqp_gen_connection,
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>                           <<"client 10.10.0.5:45026 -> 10.144.136.83:5672">>}}]
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     trap_exit: true
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     status: running
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     heap_size: 17731
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     stack_size: 29
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>     reductions: 44957
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0>   neighbours:
2026-01-26 12:28:55.600350+00:00 [error] <0.752.0> 
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>     supervisor: {<0.750.0>,amqp_connection_sup}
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>     errorContext: child_terminated
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>     reason: heartbeat_timeout
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>     offender: [{pid,<0.752.0>},
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                {id,connection},
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                {mfargs,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                        [<0.751.0>,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                             {encrypted,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                                 <<"tpo9epaT8T3cvOYVCBkID7+V/quPPiIPvSau1jkooecwxKYZAgJMAAy0KCZnVsbfXhssKrTEnu3TZFo/Cz0km/DqKnxtBDNTb5DhTQgchK4=">>},
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                             none,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                             []}]}},
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                {restart_type,transient},
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                {significant,true},
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                {shutdown,brutal_kill},
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0>                {child_type,worker}]
2026-01-26 12:28:55.600865+00:00 [error] <0.750.0> 
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>     supervisor: {<0.750.0>,amqp_connection_sup}
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>     errorContext: shutdown
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>     reason: reached_max_restart_intensity
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>     offender: [{pid,<0.752.0>},
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                {id,connection},
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                {mfargs,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                        [<0.751.0>,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                             {encrypted,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                                 <<"tpo9epaT8T3cvOYVCBkID7+V/quPPiIPvSau1jkooecwxKYZAgJMAAy0KCZnVsbfXhssKrTEnu3TZFo/Cz0km/DqKnxtBDNTb5DhTQgchK4=">>},
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                             none,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-celery-exchanges)">>}],
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                             []}]}},
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                {restart_type,transient},
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                {significant,true},
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                {shutdown,brutal_kill},
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0>                {child_type,worker}]
2026-01-26 12:28:55.601219+00:00 [error] <0.750.0> 
2026-01-26 12:28:55.601763+00:00 [info] <0.730.0> Federation exchange 'celeryev' in vhost '/' disconnected from exchange 'celeryev' in vhost '/' on amqp://pe-catalog-sf-02v
2026-01-26 12:28:55.601763+00:00 [info] <0.730.0> {upstream_channel_down,shutdown}
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0> Federation link could not create a disposable (one-off) channel due to an error error: {badmatch,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                         {error,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                          {noproc,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                           {gen_server,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                            call,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                            [<0.752.0>,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                             {command,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                              {open_channel,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                               none,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                               {amqp_selective_consumer,
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                                []}}},
2026-01-26 12:28:55.602271+00:00 [error] <0.730.0>                                                                                             130000]}}}}
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0> Federation link could not create a disposable (one-off) channel due to an error error: {badmatch,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                         {error,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                          {noproc,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                           {gen_server,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                            call,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                            [<0.752.0>,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                             {command,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                              {open_channel,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                               none,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                               {amqp_selective_consumer,
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                                []}}},
2026-01-26 12:28:55.602506+00:00 [error] <0.730.0>                                                                                             130000]}}}}
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0> ** Generic server <0.777.0> terminating
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0> ** Last message in was heartbeat_timeout
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0> ** When Server state == {state,amqp_network_connection,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                             {state,#Port<0.56>,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 <<"client 10.10.0.5:45028 -> 10.144.136.83:5672">>,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 10,<0.828.0>,131072,<0.776.0>,undefined,false},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                             <0.827.0>,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                             {amqp_params_network,<<"iris">>,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 {encrypted,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                     <<"PtCdf4sObwTEkmenIkjXOqQdrbwl6wSu4XW4YL887M5Lt6UDX/Tn0oS6Fnhll9dtc0b/9hVw1MgZ6XR14F16dHJuxJMP4npYfc0dF1L10ns=">>},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 10000,none,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                  #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 [{<<"connection_name">>,longstr,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                   <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-catalog-exchanges)">>}],
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                 []},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                             2047,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                             [{<<"capabilities">>,table,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                               [{<<"publisher_confirms">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"exchange_exchange_bindings">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"basic.nack">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"consumer_cancel_notify">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"connection.blocked">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"consumer_priorities">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"authentication_failure_close">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"per_consumer_qos">>,bool,true},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                                {<<"direct_reply_to">>,bool,true}]},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                              {<<"cluster_name">>,longstr,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                               <<"catalog_rabbitmq_prod@ilm-sf">>},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                              {<<"copyright">>,longstr,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                               <<"Copyright (c) 2007-2025 Broadcom Inc and/or its subsidiaries">>},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                              {<<"information">>,longstr,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                               <<"Licensed under the MPL 2.0. Website: https://rabbitmq.com">>},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                              {<<"platform">>,longstr,
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                               <<"Erlang/OTP 27.3.4.6">>},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                              {<<"product">>,longstr,<<"RabbitMQ">>},
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                              {<<"version">>,longstr,<<"4.2.1">>}],
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0>                             none,#{},false}
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0> ** Reason for termination ==
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0> ** heartbeat_timeout
2026-01-26 12:28:55.606359+00:00 [error] <0.777.0> 
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>   crasher:
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     initial call: amqp_gen_connection:init/1
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     pid: <0.777.0>
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     registered_name: []
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     exception exit: heartbeat_timeout
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>       in function  gen_server:handle_common_reply/8 (gen_server.erl, line 2476)
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     ancestors: [<0.775.0>,amqp_sup,<0.463.0>]
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     message_queue_len: 0
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     messages: []
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     links: [<0.775.0>]
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     dictionary: [{gen_server_call_timeout,130000},
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>                   {process_name,
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>                       {amqp_gen_connection,
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>                           <<"client 10.10.0.5:45028 -> 10.144.136.83:5672">>}}]
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     trap_exit: true
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     status: running
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     heap_size: 17731
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     stack_size: 29
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>     reductions: 44966
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0>   neighbours:
2026-01-26 12:28:55.607030+00:00 [error] <0.777.0> 
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>     supervisor: {<0.775.0>,amqp_connection_sup}
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>     errorContext: child_terminated
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>     reason: heartbeat_timeout
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>     offender: [{pid,<0.777.0>},
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                {id,connection},
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                {mfargs,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                        [<0.776.0>,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                             {encrypted,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                                 <<"PtCdf4sObwTEkmenIkjXOqQdrbwl6wSu4XW4YL887M5Lt6UDX/Tn0oS6Fnhll9dtc0b/9hVw1MgZ6XR14F16dHJuxJMP4npYfc0dF1L10ns=">>},
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                             none,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-catalog-exchanges)">>}],
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                             []}]}},
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                {restart_type,transient},
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                {significant,true},
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                {shutdown,brutal_kill},
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0>                {child_type,worker}]
2026-01-26 12:28:55.607455+00:00 [error] <0.775.0> 
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>     supervisor: {<0.775.0>,amqp_connection_sup}
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>     errorContext: shutdown
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>     reason: reached_max_restart_intensity
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>     offender: [{pid,<0.777.0>},
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                {id,connection},
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                {mfargs,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                    {amqp_gen_connection,start_link,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                        [<0.776.0>,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                         {amqp_params_network,<<"iris">>,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                             {encrypted,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                                 <<"PtCdf4sObwTEkmenIkjXOqQdrbwl6wSu4XW4YL887M5Lt6UDX/Tn0oS6Fnhll9dtc0b/9hVw1MgZ6XR14F16dHJuxJMP4npYfc0dF1L10ns=">>},
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                             <<"/">>,"pe-catalog-sf-02v",5672,2047,0,10,10000,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                             none,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                             [#Fun<amqp_uri.9.132594875>,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                              #Fun<amqp_uri.9.132594875>],
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                             [{<<"connection_name">>,longstr,
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                               <<"Federation link (upstream: pe-catalog-sf-02v, policy: federated-catalog-exchanges)">>}],
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                             []}]}},
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                {restart_type,transient},
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                {significant,true},
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                {shutdown,brutal_kill},
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0>                {child_type,worker}]
2026-01-26 12:28:55.607791+00:00 [error] <0.775.0> 
2026-01-26 12:28:55.608322+00:00 [info] <0.754.0> Federation exchange 'catalog' in vhost '/' disconnected from exchange 'catalog' in vhost '/' on amqp://pe-catalog-sf-02v
2026-01-26 12:28:55.608322+00:00 [info] <0.754.0> {upstream_channel_down,shutdown}
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0> Federation link could not create a disposable (one-off) channel due to an error error: {badmatch,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                         {error,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                          {noproc,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                           {gen_server,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                            call,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                            [<0.777.0>,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                             {command,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                              {open_channel,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                               none,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                               {amqp_selective_consumer,
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                                []}}},
2026-01-26 12:28:55.608523+00:00 [error] <0.754.0>                                                                                             130000]}}}}
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0> Federation link could not create a disposable (one-off) channel due to an error error: {badmatch,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                         {error,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                          {noproc,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                           {gen_server,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                            call,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                            [<0.777.0>,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                             {command,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                              {open_channel,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                               none,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                               {amqp_selective_consumer,
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                                []}}},
2026-01-26 12:28:55.608772+00:00 [error] <0.754.0>                                                                                             130000]}}}}

Our code is mostly the same and the federation configuration has not changed.

Answered by michaelklishin

Feb 3, 2026

@selim1965 my earlier recommendation quite explicitly referred to Heartbeats and TCP Proxies.

Per your own words, the hypothesis was correct. If so, what exactly would lead you to believe that going back to higher heartbeat values is a good idea? I won't take "I just want to clarify" for an answer, we are not your "free RabbitMQ DevOps on the Internet".

A heartbeat frame takes less than 20 bytes and with a 10 second timeout, happens every 5 seconds (with correctly implemented clients, or every 10 seconds with others). Both sound acceptable to me.

I cannot rule out other changes or factors but the doc section describes a very specific scenario.

Please take it from here.

View full answer

lukebakken · 2026-01-27T15:21:34Z

lukebakken
Jan 27, 2026
Maintainer

Please familiarize yourself with GitHub's features for formatting comments when you have to provide a lot of text:

Pasting a wall of text, as you did, is lazy. I re-formatted your text for you.

With regard to the federation error, it's pretty clear: heartbeat_timeout

You've provided only this information for us to work with:

You recently upgraded RabbitMQ.
Your code is "mostly the same", whatever that means.
Federation configuration has not changed.

Heartbeat timeouts are usually due to network devices interfering between an AMQP client (your downstream broker) and AMQP server (the upstream).

You should take a look here at our general guidelines for how to report RabbitMQ issues:

https://github.com/rabbitmq/support-tools/blob/main/docs/Reporting_RabbitMQ_Issues.md#general-questions

Hopefully those questions will lead you to a root cause.

0 replies

michaelklishin · 2026-01-27T16:37:29Z

michaelklishin
Jan 27, 2026
Maintainer

@selim1965 those links run into missed heartbeats. Our team cannot help you with those.

I don't see why that failure scenario would be handled differently from the rest. If you can reproduce this behavior with ToxiProxy, we'd be interested in learning more. We cannot tell you what may be preventing that link from restarting and reconnecting, see Troubleshooting Network Connectivity.

False positives from heartbeats are very rare assuming a reasonably high value is used (e.g. not 1-2 seconds, those are guaranteed to produce false positives).

Restarting nodes should not be necessary. To restart federation links, you can (pick one or more options, they do not depend on one another):

Remove or update the policy that enabled federation for the object in question
Disable the federation plugin on the node hosting the links and re-enable it after, say, 20-30 seconds

0 replies

selim1965 · 2026-01-27T18:31:36Z

selim1965
Jan 27, 2026
Author

Thank you @michaelklishin, we'll continue troubleshooting and take your suggestions into account.

The errors indicate a heartbeat_timeout of 130000 was reached. We don't explicitly set that in our code and I haven't seen any reference to that either in RMQ documentation or in Celery documentation. If you have any insight as to where that number is coming from, it will be greatly appreciated.

2 replies

michaelklishin Jan 27, 2026
Maintainer

@selim1965 the Heartbeats guide has been around since 2008 or so. In RabbitMQ and most clients the default is 60 seconds, the rest is explained in the docs.

Celery's underlying client, Kombu, is a client historically plagued by very unorthodox (that's as polite as I can be) decisions so I don't know what kind of a mad heartbeat default it might have.

michaelklishin Jan 27, 2026
Maintainer

If you mean that the value of 130s is non-standard, then I'd agree.

If your links often go for over two minutes without any traffic, my best guess is that an intermediary can be closing those links as "stale", see Heartbeats and TCP Proxies.

130s is so high that I'd argue it no longer serves the purpose of the heartbeats feature. Give 20-30 seconds a try (that means a tiny frame exchange every 10-15 seconds) to prove this hypothesis right or wrong.

selim1965 · 2026-01-28T00:36:02Z

selim1965
Jan 28, 2026
Author

We have our heartbeat set to 10 seconds currently:

amqp://iris:[redacted]@pe-catalog-sf-02v?heartbeat=10&connection_timeout=10000

0 replies

selim1965 · 2026-02-03T01:49:33Z

selim1965
Feb 3, 2026
Author

Just wanted to give an update. We have not seen this happen for the last week so.

Maybe it was an isolated incident since we do have remote studios where we lose connection once in a while.

Do you recommend that we increase the heartbeat timeout from 10 to 30 ?

Thanks
-Selim

1 reply

michaelklishin Feb 3, 2026
Maintainer

@selim1965 my earlier recommendation quite explicitly referred to Heartbeats and TCP Proxies.

Per your own words, the hypothesis was correct. If so, what exactly would lead you to believe that going back to higher heartbeat values is a good idea? I won't take "I just want to clarify" for an answer, we are not your "free RabbitMQ DevOps on the Internet".

A heartbeat frame takes less than 20 bytes and with a 10 second timeout, happens every 5 seconds (with correctly implemented clients, or every 10 seconds with others). Both sound acceptable to me.

I cannot rule out other changes or factors but the doc section describes a very specific scenario.

Please take it from here.

Answer selected by michaelklishin

(Exchange federation) Intermittent shutdown of federation links #15352

Uh oh!

Uh oh!

selim1965 Jan 27, 2026

Describe the bug

Replies: 5 comments · 3 replies

Uh oh!

lukebakken Jan 27, 2026 Maintainer

Uh oh!

michaelklishin Jan 27, 2026 Maintainer

Uh oh!

selim1965 Jan 27, 2026 Author

Uh oh!

michaelklishin Jan 27, 2026 Maintainer

Uh oh!

michaelklishin Jan 27, 2026 Maintainer

Uh oh!

selim1965 Jan 28, 2026 Author

Uh oh!

selim1965 Feb 3, 2026 Author

Uh oh!

Uh oh!

michaelklishin Feb 3, 2026 Maintainer

selim1965
Jan 27, 2026

Replies: 5 comments 3 replies

lukebakken
Jan 27, 2026
Maintainer

michaelklishin
Jan 27, 2026
Maintainer

selim1965
Jan 27, 2026
Author

michaelklishin Jan 27, 2026
Maintainer

michaelklishin Jan 27, 2026
Maintainer

selim1965
Jan 28, 2026
Author

selim1965
Feb 3, 2026
Author

michaelklishin Feb 3, 2026
Maintainer