Connections terminute after running into a file_handle_cache exception (which terminates) #8776

fernandomacho · 2023-07-05T07:10:49Z

fernandomacho
Jul 5, 2023

Describe the bug

Several times when try to open web socket over mqtt-web plugin get this error (the authentication backed used is rabbitmq_auth_backend_http with rabbitmq_auth_backend_cache active)

2023-07-05 08:51:51.460129+02:00 [info] <0.5985839.0> Accepted Web MQTT connection 127.0.0.1:24402 -> 127.0.0.1:15675 for client ID [email protected].
2023-07-05 08:51:51.460561+02:00 [info] <0.5985839.0> Web MQTT closing connection 127.0.0.1:24402 -> 127.0.0.1:15675
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0> ** Generic server file_handle_cache terminating
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0> ** Last message in was {'$gen_cast',{release,1,socket,<0.5985839.0>}}
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0> ** When Server state == {fhc_state,#Ref<0.396640665.3379429377.3328>,32671,0,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                    {0,{[],[]}},
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                    29401,1,0,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                    {0,{[],[]}},
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                    {0,{[],[]}},
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                    #Ref<0.396640665.3379429377.3327>,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                    undefined,fun rabbit_alarm:set_alarm/1,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                    fun rabbit_alarm:clear_alarm/1,0,0}
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0> ** Reason for termination ==
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0> ** {badarg,[{ets,update_counter,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                  [#Ref<0.396640665.3379429377.3327>,<0.5985839.0>,{5,-1}],
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                  [{error_info,#{cause => badkey,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                 module => erl_stdlib_errors}}]},
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>             {file_handle_cache,update_counts,4,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                [{file,"file_handle_cache.erl"},{line,1418}]},
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>             {file_handle_cache,handle_cast,2,
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>                                [{file,"file_handle_cache.erl"},{line,1194}]},
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>             {gen_server2,handle_msg,2,[{file,"gen_server2.erl"},{line,1056}]},
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>             {proc_lib,init_p_do_apply,3,[{file,"proc_lib.erl"},{line,241}]}]}
2023-07-05 08:51:51.460795+02:00 [error] <0.6079115.0>
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>   crasher:
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     initial call: file_handle_cache:init/1
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     pid: <0.6079115.0>
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     registered_name: file_handle_cache
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     exception exit: {badarg,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                         [{ets,update_counter,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                              [#Ref<0.396640665.3379429377.3327>,<0.5985839.0>,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                               {5,-1}],
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                              [{error_info,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                                   #{cause => badkey,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                                     module => erl_stdlib_errors}}]},
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                          {file_handle_cache,update_counts,4,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                              [{file,"file_handle_cache.erl"},{line,1418}]},
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                          {file_handle_cache,handle_cast,2,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                              [{file,"file_handle_cache.erl"},{line,1194}]},
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                          {gen_server2,handle_msg,2,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                              [{file,"gen_server2.erl"},{line,1056}]},
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                          {proc_lib,init_p_do_apply,3,
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>                              [{file,"proc_lib.erl"},{line,241}]}]}
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>       in function  gen_server2:terminate/3 (gen_server2.erl, line 1172)
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     ancestors: [file_handle_cache_sup,rabbit_sup,<0.246.0>]
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     message_queue_len: 0
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     messages: []
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     links: [<0.954.0>]
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     dictionary: []
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     trap_exit: false
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     status: running
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     heap_size: 6772
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     stack_size: 28
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>     reductions: 17856
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>   neighbours:
2023-07-05 08:51:51.461726+02:00 [error] <0.6079115.0>
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>     supervisor: {local,file_handle_cache_sup}
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>     errorContext: child_terminated
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>     reason: {badarg,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                 [{ets,update_counter,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                      [#Ref<0.396640665.3379429377.3327>,<0.5985839.0>,{5,-1}],
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                      [{error_info,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                           #{cause => badkey,module => erl_stdlib_errors}}]},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                  {file_handle_cache,update_counts,4,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                      [{file,"file_handle_cache.erl"},{line,1418}]},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                  {file_handle_cache,handle_cast,2,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                      [{file,"file_handle_cache.erl"},{line,1194}]},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                  {gen_server2,handle_msg,2,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                      [{file,"gen_server2.erl"},{line,1056}]},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                  {proc_lib,init_p_do_apply,3,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                      [{file,"proc_lib.erl"},{line,241}]}]}
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>     offender: [{pid,<0.6079115.0>},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                {id,file_handle_cache},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                {mfargs,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                    {file_handle_cache,start_link,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                        [fun rabbit_alarm:set_alarm/1,
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                         fun rabbit_alarm:clear_alarm/1]}},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                {restart_type,transient},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                {shutdown,300000},
2023-07-05 08:51:51.463046+02:00 [error] <0.954.0>                {child_type,worker}]
2023-07-05 08:51:51.463978+02:00 [info] <0.6079177.0> Limiting to approx 32671 file handles (29401 sockets)

After some errors I need restart rabbitmq for 500 error

Reproduction steps

try to open connection
validate credentials
connect socket

Expected behavior

Not crash connection

Additional context

The JS library used is Paho

Answered by michaelklishin

Jul 10, 2023

3.12.2-beta.1 is up on GitHub.

Here is a direct link to the .deb file in case you'd prefer to install via dpkg -i (temporarily), in addition to the Cloudsmith repo for preview releases.

View full answer

michaelklishin · 2023-07-05T07:14:58Z

michaelklishin
Jul 5, 2023
Maintainer

What runs into an exception is a function that maintainers file handle metrics. It tries to update a counter that does not exist.

So this is not something MQTT-specific. Any connection to this node will fail.

What version of RabbitMQ is used? Can you share full logs? There may be an error logged earlier that would provide a clue as to what the root cause is.

1 reply

fernandomacho Jul 5, 2023
Author

The version is 3.12.1
[email protected]

The example can be reproduced with the example that comes with the Paho library.
Normally, the connections do not present errors, but after some time of use, the effect is that you cannot connect to the management web with a 500 error and the only option is to restart the cluster nodes one by one.

The configuration I have is a "websocks" vhost with quorum as default queues and in the rabbitmq.conf file the following config:

`mqtt.vhost = websocks
mqtt.exchange = amq.topic
mqtt.allow_anonymous = false
mqtt.tcp_listen_options.keepalive = true
mqtt.tcp_listen_options.nodelay = true
mqtt.durable_queue_type = quorum

auth_backends.1 = internal
auth_backends.2 = cache
auth_cache.cache_ttl = 180000000
auth_cache.cached_backend = http
auth_http.http_method = post
auth_http.user_path = http://wa.unir.net/v3/mqtt/user
auth_http.vhost_path = http://wa.unir.net/v3/mqtt/vhost
auth_http.resource_path = http://wa.unir.net/v3/mqtt/resource
auth_http.topic_path = http://wa.unir.net/v3/mqtt/topic
ssl_options.verify = verify_none
ssl_options.fail_if_no_peer_cert = false
log.default.level = critical`

curiously when a client connects, despite having configured that the queues should be created as quorum (mqtt.durable_queue_type = quorum), the queues are created as "classic"

Attach two logs, the second log is after 500 error
[email protected]

michaelklishin · 2023-07-05T07:17:24Z

michaelklishin
Jul 5, 2023
Maintainer

Also, I'm not sure how steps 1 and 3 are different. Can you please share an executable way to reproduce? (a public repository on GitHub or at least an archive with some code)

There is an example WebSockets-over-MQTT plugin, rabbitmq_web_mqtt_examples, that can be enabled and used to verify such WS connections.

Does said plugin connect sucessfully?

1 reply

fernandomacho Jul 5, 2023
Author

some times connect and other times no connect, when reply the error

michaelklishin · 2023-07-05T07:20:05Z

michaelklishin
Jul 5, 2023
Maintainer

Also, can this be RabbitMQ 3.11.x running on Erlang 26? That is not a supported combination and the symptoms are very similar: all client connections run into exceptions that do not seem related.

1 reply

fernandomacho Jul 5, 2023
Author

run 3.12.1 over Erlang 26.0.2

michaelklishin · 2023-07-05T07:55:08Z

michaelklishin
Jul 5, 2023
Maintainer

I confirmed that http://localhost:15670/web-mqtt-examples/bunny.html works as expected on

RabbitMQ 3.12.1 on Erlang 26
RabbitMQ 3.11.19 on Erlang 25.3

1 reply

fernandomacho Jul 5, 2023
Author

yes normally works fine, but when connect several clients over ws this error is reported continuously

fernandomacho · 2023-07-05T07:57:48Z

fernandomacho
Jul 5, 2023
Author

send log of other cluster node. This node not is accessible over web management
[email protected]

0 replies

michaelklishin · 2023-07-05T08:01:33Z

michaelklishin
Jul 5, 2023
Maintainer

According to the logs on innova3, there is a large number of clients that never close their connection cleanly:

2023-07-03 16:25:12.431024+02:00 [warning] <0.11207254.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.431056+02:00 [warning] <0.11197489.0> closing AMQP connection <0.11197489.0> (172.16.0.25:32868 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.431056+02:00 [warning] <0.11197489.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.431065+02:00 [warning] <0.11223781.0> closing AMQP connection <0.11223781.0> (172.16.0.25:35758 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.431065+02:00 [warning] <0.11223781.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.431074+02:00 [warning] <0.11183585.0> closing AMQP connection <0.11183585.0> (172.16.0.25:34170 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.431074+02:00 [warning] <0.11183585.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.431678+02:00 [warning] <0.5539727.0> closing AMQP connection <0.5539727.0> (172.16.0.25:52868 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.431678+02:00 [warning] <0.5539727.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.430745+02:00 [warning] <0.11242491.0> closing AMQP connection <0.11242491.0> (172.16.0.25:41006 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.430745+02:00 [warning] <0.11242491.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.435640+02:00 [warning] <0.11248720.0> closing AMQP connection <0.11248720.0> (172.16.0.25:28882 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.435640+02:00 [warning] <0.11248720.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.435676+02:00 [warning] <0.11244738.0> closing AMQP connection <0.11244738.0> (172.16.0.25:17230 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.435676+02:00 [warning] <0.11244738.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.435698+02:00 [warning] <0.11190727.0> closing AMQP connection <0.11190727.0> (172.16.0.25:48248 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.435698+02:00 [warning] <0.11190727.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.435791+02:00 [warning] <0.11205997.0> closing AMQP connection <0.11205997.0> (172.16.0.25:9412 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.435791+02:00 [warning] <0.11205997.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.435935+02:00 [warning] <0.11236855.0> closing AMQP connection <0.11236855.0> (172.16.0.25:8594 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.435935+02:00 [warning] <0.11236855.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.435934+02:00 [warning] <0.11230859.0> closing AMQP connection <0.11230859.0> (172.16.0.25:5914 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:12.435934+02:00 [warning] <0.11230859.0> client unexpectedly closed TCP connection
2023-07-03 16:25:12.435945+02:00 [warning] <0.11165563.0> closing AMQP connection <0.11165563.0> (172.16.0.25:48754 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):

2023-07-03 16:25:04.766391+02:00 [warning] <0.175096.0> closing AMQP connection <0.175096.0> (172.16.0.25:42746 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa'):
2023-07-03 16:25:04.766391+02:00 [warning] <0.175096.0> client unexpectedly closed TCP connection
2023-07-03 16:25:04.804852+02:00 [error] <0.11182723.0> closing AMQP connection <0.11182723.0> (172.16.0.25:61352 -> 172.16.0.25:5672):
2023-07-03 16:25:04.804852+02:00 [error] <0.11182723.0> missed heartbeats from client, timeout: 60s
2023-07-03 16:25:05.436919+02:00 [error] <0.11189826.0> closing AMQP connection <0.11189826.0> (172.16.0.25:61392 -> 172.16.0.25:5672):
2023-07-03 16:25:05.436919+02:00 [error] <0.11189826.0> missed heartbeats from client, timeout: 60s
2023-07-03 16:25:06.183917+02:00 [error] <0.11161507.0> closing AMQP connection <0.11161507.0> (172.16.0.25:61404 -> 172.16.0.25:5672):
2023-07-03 16:25:06.183917+02:00 [error] <0.11161507.0> missed heartbeats from client, timeout: 60s

and then a warning (could be entirely unrelated) when the node is asked to shut down:

2023-07-03 16:25:18.757433+02:00 [warning] <0.2100.0> HTTP listener registry could not find context rabbitmq_management_tls

This churn happens for a couple of days and then all connections start failing with

2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0> Error in process <0.6515011.0> on node rabbit@innova3 with exit value:
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0> {badarg,[{ets,update_counter,
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>               [file_handle_cache_stats,{mnesia_ram_tx,count},1],
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>               [{error_info,#{cause => id,module => erl_stdlib_errors}}]},
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>          {file_handle_cache_stats,update,1,
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>                                   [{file,"file_handle_cache_stats.erl"},
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>                                    {line,46}]},
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>          {rabbit_mnesia,'-execute_mnesia_transaction/1-fun-0-',1,
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>                         [{file,"rabbit_mnesia.erl"},{line,876}]},
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>          {worker_pool_worker,'-run/2-fun-0-',3,
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0>                              [{file,"worker_pool_worker.erl"},{line,69}]}]}
2023-07-05 09:46:53.870707+02:00 [error] <0.6515011.0> 
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>     supervisor: {local,worker_pool_sup}
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>     errorContext: child_terminated
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>     reason: {badarg,
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                 [{ets,update_counter,
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                      [file_handle_cache_stats,{mnesia_ram_tx,count},1],
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                      [{error_info,
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                           #{cause => id,module => erl_stdlib_errors}}]},
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                  {file_handle_cache_stats,update,1,
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                      [{file,"file_handle_cache_stats.erl"},{line,46}]},
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                  {rabbit_mnesia,'-execute_mnesia_transaction/1-fun-0-',1,
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                      [{file,"rabbit_mnesia.erl"},{line,876}]},
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                  {worker_pool_worker,'-run/2-fun-0-',3,
2023-07-05 09:46:53.870973+02:00 [error] <0.957.0>                      [{file,"worker_pool_worker.erl"},{line,69}]}]}

My best guess from just these logs is that file handle cache somehow does not deal with constant high connection churn, or in particular the kind of churn where clients do not clearly close connections.

Note the average connection life span length:

2023-07-05 09:22:19.960017+02:00 [info] <0.6277658.0> accepting AMQP connection <0.6277658.0> (172.16.0.25:63954 -> 172.16.0.25:5672)
2023-07-05 09:22:19.962595+02:00 [info] <0.6277658.0> connection <0.6277658.0> (172.16.0.25:63954 -> 172.16.0.25:5672): user 'wa' authenticated and granted access to vhost '/wa'
2023-07-05 09:22:19.969331+02:00 [info] <0.6277658.0> closing AMQP connection <0.6277658.0> (172.16.0.25:63954 -> 172.16.0.25:5672, vhost: '/wa', user: 'wa')

1 reply

fernandomacho Jul 5, 2023
Author

the connection to rabbit is used in two ways, the first is in a RESTful API, it may be that these disconnections that you see are from the API and through consumers in an infinite loop. In this case the connection is not closed, but in the first case it is, since when the API is instantiated the connection is opened and until the execution thread "dies" the connection is not closed. The decision to close the API thread is delegated to the nginx fpm processes

michaelklishin · 2023-07-05T08:03:35Z

michaelklishin
Jul 5, 2023
Maintainer

On node innova, it is mostly a similar story, although the logs do not go back as far:

2023-07-03 13:14:04.624286+02:00 [error] <0.7328749.0> closing AMQP connection <0.7328749.0> (172.16.0.24:43344 -> 172.16.0.24:5672):
2023-07-03 13:14:04.624286+02:00 [error] <0.7328749.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:06.498261+02:00 [error] <0.7305851.0> closing AMQP connection <0.7305851.0> (172.16.0.24:43390 -> 172.16.0.24:5672):
2023-07-03 13:14:06.498261+02:00 [error] <0.7305851.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:08.973502+02:00 [error] <0.7329297.0> closing AMQP connection <0.7329297.0> (172.16.0.24:43480 -> 172.16.0.24:5672):
2023-07-03 13:14:08.973502+02:00 [error] <0.7329297.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:09.086178+02:00 [error] <0.7329388.0> closing AMQP connection <0.7329388.0> (172.16.0.24:43514 -> 172.16.0.24:5672):
2023-07-03 13:14:09.086178+02:00 [error] <0.7329388.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:11.238324+02:00 [error] <0.7346382.0> Channel error on connection <0.7346320.0> (172.16.0.24:57224 -> 172.16.0.24:5672, vhost: '/', user: 'callback_beca'), channel 1:
2023-07-03 13:14:11.238324+02:00 [error] <0.7346382.0> operation queue.declare caused a channel exception not_found: failed to perform operation on queue 'callback_beca' in vhost '/' due to timeout
2023-07-03 13:14:12.490209+02:00 [error] <0.7329823.0> closing AMQP connection <0.7329823.0> (172.16.0.24:59558 -> 172.16.0.24:5672):
2023-07-03 13:14:12.490209+02:00 [error] <0.7329823.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:13.048262+02:00 [error] <0.7329869.0> closing AMQP connection <0.7329869.0> (172.16.0.24:59572 -> 172.16.0.24:5672):
2023-07-03 13:14:13.048262+02:00 [error] <0.7329869.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:13.447268+02:00 [error] <0.7315321.0> closing AMQP connection <0.7315321.0> (172.16.0.24:59594 -> 172.16.0.24:5672):
2023-07-03 13:14:13.447268+02:00 [error] <0.7315321.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:13.597238+02:00 [error] <0.7242652.0> closing AMQP connection <0.7242652.0> (172.16.0.24:59624 -> 172.16.0.24:5672):
2023-07-03 13:14:13.597238+02:00 [error] <0.7242652.0> missed heartbeats from client, timeout: 60s
2023-07-03 13:14:13.779342+02:00 [error] <0.7053788.0> Channel error on connection <0.7346792.0> (172.16.0.24:17452 -> 172.16.0.24:5672, vhost: '/', user: 'callback_beca'), channel 1:
2023-07-03 13:14:13.779342+02:00 [error] <0.7053788.0> operation queue.declare caused a channel exception not_found: failed to perform operation on queue 'callback_beca' in vhost '/' due to timeout
2023-07-03 13:14:13.813227+02:00 [error] <0.7330092.0> closing AMQP connection <0.7330092.0> (172.16.0.24:59634 -> 172.16.0.24:5672):
2023-07-03 13:14:13.813227+02:00 [error] <0.7330092.0> missed heartbeats from client, timeout: 60s

0 replies

michaelklishin · 2023-07-05T08:07:17Z

michaelklishin
Jul 5, 2023
Maintainer

@fernandomacho I do not understand what may be happening with the metric update. That code path can be made more defensive so that it does not throw (the metric is non-essential) but my hypothesis is that it caused by excessive connection churn from your clients.

Please eliminate the churn and use long-lived connections or this proxy.

When/if I find a way to inspect the relevant metrics store, I will share a rabbitmqctl eval command here.

0 replies

michaelklishin · 2023-07-05T08:10:25Z

michaelklishin
Jul 5, 2023
Maintainer

@fernandomacho can you please run

rabbitmqctl eval 'file_handle_cache:info().'

and share the output every hour or so when the cluster accepts connection, and once or twice (say, every few minutes) when it does not?

0 replies

fernandomacho · 2023-07-05T08:12:18Z

fernandomacho
Jul 5, 2023
Author

3 replies

fernandomacho Jul 5, 2023
Author

Now need restart the cluster, is a production cluster....

michaelklishin Jul 5, 2023
Maintainer

So file_handle_cache (a RabbitMQ node component) is not running on this node.

I cannot find any exceptions from it terminating in the provided logs. Are there older logs that you can upload?

fernandomacho Jul 5, 2023
Author

I send you 3 log files (1 per cluster node)
The logs begin over 2023-07-03

Archive.zip

Normally the log is configured to critical level

fernandomacho · 2023-07-05T08:17:30Z

fernandomacho
Jul 5, 2023
Author

I have had to reboot two cluster nodes. The result of the command that you ask me in the node that I have not had to restart (innova2) is:
[{total_limit,32671},
{total_used,342},
{sockets_limit,29401},
{sockets_used,342}]

and on two restarted nodes:

ovh-innova:
[{total_limit,32671},
{total_used,1558},
{sockets_limit,29401},
{sockets_used,1418}]

and innova3:
[{total_limit,32671},{total_used,155},{sockets_limit,29401},{sockets_used,35}]

1 reply

michaelklishin Jul 5, 2023
Maintainer

Thanks. We need full logs to understand why file_handle_cache terminates in your environment.

It can be related to the connection churn or not but that's the root cause of everything else that fails. We cannot guess what it may be reasonably quickly without relevant log entries (exceptions).

michaelklishin · 2023-07-05T08:24:53Z

michaelklishin
Jul 5, 2023
Maintainer

2023-07-03 16:25:18.799931+02:00 [error] <0.4723320.0>  operation none caused a connection exception connection_forced: "broker forced connection closure with reason 'shutdown'"
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0> ** Generic server file_handle_cache terminating
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0> ** Last message in was {'$gen_cast',{close,<0.1139.0>,undefined}}
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0> ** When Server state == {fhc_state,#Ref<0.961047823.1019084801.70364>,32671,0,
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0>                                    {0,{[],[]}},
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0>                                    29401,0,0,
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0>                                    {0,{[],[]}},
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0>                                    {0,{[],[]}},
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0>                                    #Ref<0.961047823.1019084801.70363>,
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0>                                    undefined,fun rabbit_alarm:set_alarm/1,
2023-07-03 16:25:18.804270+02:00 [error] <0.11070570.0>                                    fun rabbit_alarm:clear_alarm/1,0,0}

seems to be most relevant. I am not sure if this is due to node termination, likely not.

0 replies

fernandomacho · 2023-07-05T08:27:19Z

fernandomacho
Jul 5, 2023
Author

ok I config log level to info and try to reproduce

1 reply

michaelklishin Jul 5, 2023
Maintainer

We have enough information for now. Please try to reduce connection churn from your application, this is a very rare behavior to see (and file_handle_cache hasn't changed in years), so my intuition tells me there is a connection to the high churn.

I'm looking at where file_handle_cache could be made more defensive and instead of throwing exceptions, at least do nothing (so some metrics won't be available).

michaelklishin · 2023-07-05T08:36:51Z

michaelklishin
Jul 5, 2023
Maintainer

So yeah, I have enough evidence that this scenario only be triggered by high connection churn. Here is what happens:

A client connection is opened, and an Erlang process (a "green thread" if you will) is started for it to handle it. This connection has an identifier such as <0.1139.0>
Connection is closed a few milliseconds later
file_handle_cache handles the connection closure event and clears a row with some metrics

So far so good. Now, concurrently with that, another connection is open and

It gets the same local Erlang process identifier, say, <0.1139.0>
It starts performing operations that result in metrics being updated
But the above connection closure event handler will concurrently delete the metric row table
Now all metric updates (writes) operation fail due to the missing key
And with that, all client operations on a connection

In other words, for this scenario to happen you need one very short lived connection and another very short-lived connection to get "assigned" the same Erlang process ("green thread") ID, then two independent metric table updates can step over one another.

Getting rid of high connection churn should help.

Thank you for providing the logs, @fernandomacho.

4 replies

michaelklishin Jul 5, 2023
Maintainer

I will file an issue about making file_handle_cache more safe w.r.t. such scenarios.

michaelklishin Jul 5, 2023
Maintainer

#8784

fernandomacho Jul 5, 2023
Author

I'm going to move those connections to amqproxy and see the result.
When do you think version 3.12.2 will be released?

michaelklishin Jul 5, 2023
Maintainer

We do not make ETA promises. See RabbitMQ change log to get an idea of the frequency of patch releases.

fernandomacho · 2023-07-05T16:40:16Z

fernandomacho
Jul 5, 2023
Author

Do you think it's a problem of migrating queues to quorum instead of using websockets?

14 replies

michaelklishin Jul 9, 2023
Maintainer

@fernandomacho it is increasingly looking like what our team calls "a rabbit hole issue" where it is never one thing but three, five or a dozen or different things come up, and we never have enough information to reason about your workload or reproduce it.

They are rarely fixed with one magical PR.

So far our recommendations are:

Use a proxy or reduce connection churn
Use non-mirrored CQv2 for MQTT
When you report a behavior change (a publisher timeout, for example), please collect logs from all nodes, and perhaps a traffic capture that you can send to our team privately (see https://rabbitmq.com/contact.html and the Security part on the home page)

We cannot fix an issue or a group of issues that we cannot even distantly reproduce. And reproducing is quite difficult with two sentence problem definitions that keep changing.

fernandomacho Jul 9, 2023
Author

The rabbit cluster is on production state.

The problem is that if I configure the MQTT queues without a mirror and for whatever reason the client (websocket) disconnects and reconnects (to other cluster node) and there are messages in the queue, they will never be sent, is this correct?
Regarding the use of queues, v2 is configured like this by default for non-quorum queues, so I understand that the MQTT queues will be created in v2.
Remember that I activated rmqproxy with the MQTT plugin active and the effect was the same, the server was constantly down. Disabling MQTT made it stable.
The problem with timeouts is the following: When starting the REST API, I can open the connection to Rabbit, but from the time a thread is started until it is used, some time can pass and rabbit may close, not having received anything. the connection, so when I use rabbit in the API, each operation opens -> publishes -> closes. I can give this a spin to see how to do it another way.

It seems to me that there is one thing clear, that the bug is due to the use of MQTT and MQTT web, because the moment you start to use the server it falls, only one detail in the client part (browser) I am using the library from Paho, which is the one you recommend, and this library is not connecting and disconnecting continuously, but once connected, it waits for a message and the connection remains open, so it does not seem that the problem is that the use of MQTT can generate many openings /closes/opens connections.

The problem is that, since the cluster is in production, I cannot activate MQTT, at least until you publish the statistics fix, because if I raise it now, it will surely fall again.

michaelklishin Jul 10, 2023
Maintainer

As long as the queue is durable and the name (which for MQTT is derived from client ID and QoS) is the same, all previously enqueued messages will be available in a non-mirrored queue regardless of where the client connections. You can and should disable CQ mirroring, setting up a short-lived mirrored queue or a quorum queue takes much longer than a non-mirrored one.

With very short lived connections, you do not get much in terms of data safety because such clients generally do not wait for any confirmations.

michaelklishin Jul 10, 2023
Maintainer

3.12.2-beta.1 was built and published yesterday #8776 (comment) for you to try.

michaelklishin Jul 10, 2023
Maintainer

We are not getting any closer to having a way to reproduce, so I am out of ideas as to what the root cause may be and what else would help.

michaelklishin · 2023-07-06T08:27:47Z

michaelklishin
Jul 6, 2023
Maintainer

@lhoguin another thing to investigate would be this: assuming that the FHC process fails and is restarted, what would its peak restart rate be? With connection churn above a certain level its
supervisor can give up for good, rendering the node unuseable.

Alternatively we can consider dropping file_handle_cache from the Web MQTT and Web STOMP plugins but then we would lose a metric that controls one of the important alarms. Maybe that'd be acceptable?

2 replies

lhoguin Jul 6, 2023
Maintainer

We could use update_counter/4 instead of update_counter/3 and provide a default (which is a good question). That's likely the quickest temporary fix.

We could drop the FHC FD management. I'm sure @mkuratczyk or @kjnilsson would be OK with that. It's not super useful nowadays since we expect users to have high FD limits. But probably something best suited for 4.0.

michaelklishin Jul 6, 2023
Maintainer

We could also make FHC FD management optional in those plugins.

michaelklishin · 2023-07-06T08:48:03Z

michaelklishin
Jul 6, 2023
Maintainer

In the meantime, here is what I have #8790.

0 replies

michaelklishin · 2023-07-10T07:00:29Z

michaelklishin
Jul 10, 2023
Maintainer

3.12.2-beta.1 is a preview build that includes #8790 and was published to this Cloudsmith repository.

I will publish a GitHub release shortly, it will include a .deb package that can be downloaded
and installed with dpkg -i as an alternative to a separate apt repo.

In it, you can disable FHC for the Web MQTT plugin:

web_mqtt.enable_file_handle_cache = false

and all FHC relevant FHC operations that update metric counters are now exception-safe.
That's the best we can offer without a way to reproduce this behavior.

1 reply

michaelklishin Jul 10, 2023
Maintainer

Ugh, looks like we have to rename web_mqtt.enable_file_handle_cache to web_mqtt.activate_file_handle_cache for 3.12.2 because "enable" is not an inclusive term according to some, but 3.12.2-beta.1 is already out and has it as is.

michaelklishin · 2023-07-10T07:12:30Z

michaelklishin
Jul 10, 2023
Maintainer

We are not getting any closer to having a way to reproduce, so I am out of ideas as to what the root cause may be and what else would help.

Without a way to describe (e.g. with a traffic capture) the workload and relevant information collected from all nodes, I conclude that this is a bag of different aspects:

Something causes clients to time out when publishing. We don't have any details or logs
For that reason, this environment uses a new connection per message, so high connection churn is guaranteed
There is a terminating component that keeps track of some metrics. FHC: use exception-safe versions of ets update functions #8790 provides a safer version, 3.12.2-beta.1 allows you to try it
There's MQTT involved but it's not clear to me whether it gets any MQTT client connections, what their churn rate is, whether only MQTT-over-WebSockets is used (the churn rate there is usually high)
FHC: use exception-safe versions of ets update functions #8790 includes a way to disable FHC use for rabbitmq_web_mqtt as another potentially helpful measure: web_mqtt.enable_file_handle_cache = false

0 replies

michaelklishin · 2023-07-10T07:59:46Z

michaelklishin
Jul 10, 2023
Maintainer

3.12.2-beta.1 is up on GitHub.

Here is a direct link to the .deb file in case you'd prefer to install via dpkg -i (temporarily), in addition to the Cloudsmith repo for preview releases.

0 replies

michaelklishin · 2023-07-10T08:13:06Z

michaelklishin
Jul 10, 2023
Maintainer

In addition to #8790 3.12.2-beta.1, we need the following
information in order to continue investigating:

Node configuration and state

Much of this is provided by this script that VMware support and the RabbitMQ core team use,
but there's nothing wrong with collecting this information manually:

rabbitmq-diagnostics status from all nodes
rabbitmq-diagnostics environment from all nodes
Logs from all nodes (from the time when you had incidents, starting one hour before and finishing one hour after)
rabbitmqctl export_definitions from any node (it will export your topology to a file), this file is very sensitive (see below)

Workload description

More importantly now, we need to understand what your clients do, so any code or a reasonably
detailed algorithm of what the clients do:

What protocols are actually used (not just what plugins are enabled)
How long lived the connections are
How long lived the channels (in case of AMQP 0-9-1) are
How the publishing is done
How consumers are set up, how they acknowledge messages

Ideally, if it is enough to use only AMQP 0-9-1 clients, a set of PerfTest flags that would roughly simulate your
workload would help. It does not support connection churn but can simulate a very broad range of workloads otherwise (using a --preconfigured topology specifically).

For MQTT or Web MQTT connections, there are tools from Mosquitto, emqx that help simulate workloads
much like RabbitMQ PerfTest does.

Alternatively you can take a traffic capture with
tcpdump on the affected node, starting before the problematic behavior exhibits itself.
But describing the workload may be easier in some environments, e.g. Kubernetes.

How to send collected data privately

This information can be sensitive, e.g. virtual host names, queue and stream names, logs, etc.

Feel free to send this as a single archive to the address our team uses for security disclosures and private communication.

0 replies

fernandomacho · 2023-07-10T15:48:53Z

fernandomacho
Jul 10, 2023
Author

Hello, I will install the version you have released first thing tomorrow morning.
I understand that the information you are requesting is with the new version, is that correct?

I will try to explain the use:
1.- A REST API in PHP. In this scenario it is not possible to guarantee the long duration of the connections, because the processes that the web server starts up may die after n uses or may take n time to be used, in both cases the result is a timeout.

2.- I am migrating part of the API functionality to websockets, which is when the problem appeared. The clients (browsers) using the Paho library for javascript, open a connection by websockets to rabbit (previously there is a process of creating a username and password that is used to connect). Rabbitmq validates the user through the rabbitmq_auth_backend_http plugin and using the authentication cache through the rabbitmq_auth_backend_cache plugin.
When an event occurs that affects the client that has a websock open, a message is published in the topic queue of that vhost with the corresponding routing key so that it reaches the client. In this task, the connection to rabbit is opened, the message is published and the connection is closed in an orderly manner. The question is, I can not leave the connection open? My answer, and logically I could be wrong, is no, since the publication of these events is not continuous and therefore I cannot guarantee that when I use it it will be open.

3.- Consumers of messages, these processes open the connection only once and in principle it remains open, since the message "wait" process itself prevents the socket from being closed.

Right now each server (there are three) can have between 3000 and 4000 open connections, I understand that there are a lot but it shouldn't be a problem by far.
The cluster is handling about 300 published messages/sec and logically as many consumed

Right now everything is configured, both the api and the consumers to make use of amqproxy.

I send you some pictures with stats:

Note that despite using amqproxy the number of open and closed chucks is very similar. The amqproxy configuration is:
`[main]
log_level = info
idle_connection_timeout = 5
#upstream = amqp://127.0.0.1:35672
upstream = amqp://localhost:5672

[listen]
address = 0.0.0.0
port = 5673`

y la de rabbitmq

`mqtt.vhost = websocks
mqtt.exchange = amq.topic
mqtt.allow_anonymous = false
mqtt.tcp_listen_options.keepalive = true
mqtt.tcp_listen_options.nodelay = true

auth_backends.1 = internal
auth_backends.2 = cache
auth_cache.cache_ttl = 1800000
auth_cache.cached_backend = http
auth_http.http_method = post
auth_http.user_path = http://xxxxxxx
auth_http.vhost_path = http://xxxxxxx
auth_http.resource_path = http://xxxxxxx
auth_http.topic_path = http://xxxxxxx
ssl_options.verify = verify_none
ssl_options.fail_if_no_peer_cert = false
classic_queue.default_version = 2

log.default.level = critical`

5 replies

fernandomacho Jul 10, 2023
Author

Iif you need queues/exchange definitions say me and send you privately

fernandomacho Jul 11, 2023
Author

Hello, this morning I installed the version that you have provided me and for the moment everything is correct.
In AMPQ the volume of connections is the usual and in MQTT I have four users testing with Websocks.

Do you want me to send you the files you told me about the servers (https://github.com/rabbitmq/support-tools/blob/main/scripts/rabbitmq-collect-env)?
I don't know if the explanation I gave about the use we make of RabbitMQ is enough or if you need some more detail.

michaelklishin Jul 11, 2023
Maintainer

If you could send them, we'd definitely appreciate it. But good to know what the 3.12.2 preview has made at least some difference for your environment!

michaelklishin Jul 11, 2023
Maintainer

The same for the definition export file (you are welcome to delete all the users and permissions, that part is surely not relevant). Thank you!

michaelklishin Jul 11, 2023
Maintainer

Our team is waiting for a confirmation that 3.12.2-beta.1 indeed addressed the issue for the MQTT users on a slightly longer time scale, if all is well then we would quickly ship a GA patch release.

michaelklishin · 2023-07-11T14:30:07Z

michaelklishin
Jul 11, 2023
Maintainer

@fernandomacho we have put together a workload simulation that has this connection churn rate with QQs. No issues so far. One thing that stands out to us is that on your chart, the closed connection rate is 0.

Can you tell us (or share some code) about how the connections are closed? If they are never closed,
it would suggest a connection leak at 35 per second, 126K per hour. That would sure bring down a node in a few hours, at most a couple of days.

1 reply

fernandomacho Jul 11, 2023
Author

private function close_rabbit($connection, $channel)
    {
        $channel->close();
        $connection->close();
    } /** End function close_rabbit */
`
where $channel and $conection is the result of:
`
private function open_rabbit_queue($config)
    {
        if (!is_array($config)) {
            $config = $this->config[$config];
        }
        $vhost = (isset($config['vhost']) ? $config['vhost'] : '/');
        try {
            $connection = new AMQPStreamConnection(
                $config['host'],
                $config['port'],
                $config['username'],
                $config['password'],
                $vhost,
                false, /*insists*/
                'AMQPLAIN', /*login method*/
                null, /*login response*/
                null, /*locale*/
                602.0, /*connection timeout*/
                602.0, /*read write timeout */
                null, /* context */
                true, /* keepalive */
                60, /* hearbeat */
                602.0, /*channel_rpc_timeout*/
                null /* SSL */
            );
            $channel = $connection->channel();
...

fernandomacho · 2023-07-11T14:54:59Z

fernandomacho
Jul 11, 2023
Author

Hi, first of all I would like to thank you for how you have handled this problem and how you have helped me. The truth is that he was a bit desperate with this issue.

Right now it's been running smoothly with websocks since this morning (in a user controlled environment), tomorrow I'll add more users over websocks and I give you feedback.

Now I generate the files for the three servers and send them by mail to the address you gave me along with the definitions.

2 replies

fernandomacho Jul 11, 2023
Author

I send the information

michaelklishin Jul 11, 2023
Maintainer

We have received the archives, thank you.

michaelklishin · 2023-07-13T14:08:59Z

michaelklishin
Jul 13, 2023
Maintainer

Without any extra feedback, I will assume that #8790 did address the issue as first reported. We also have some data to dig into, although our initial attempts to reproduce using comparable churn rates, failed to make the exception manifest itself.

2 replies

fernandomacho Jul 13, 2023
Author

Hi, all is stable;)

michaelklishin Jul 13, 2023
Maintainer

We will ship 3.12.2 next Monday. Thanks for confirming!

fernandomacho · 2023-07-14T06:11:44Z

fernandomacho
Jul 14, 2023
Author

Thanks!

…

On 13 Jul 2023, at 22:41, Michael Klishin ***@***.***> wrote: We will ship 3.12.2 next Monday. Thanks for confirming! — Reply to this email directly, view it on GitHub <#8776 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AO2NF7XKLD4FCFHVCJ66AWTXQBMOJANCNFSM6AAAAAAZ6QVK4U>. You are receiving this because you were mentioned.

0 replies

Connections terminute after running into a file_handle_cache exception (which terminates) #8776

Uh oh!

Uh oh!

fernandomacho Jul 5, 2023

Describe the bug

Reproduction steps

Expected behavior

Additional context

Replies: 29 comments · 52 replies

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 5, 2023 Maintainer

Uh oh!

fernandomacho Jul 5, 2023 Author

Uh oh!

michaelklishin Jul 9, 2023 Maintainer

Uh oh!

Uh oh!

fernandomacho Jul 9, 2023 Author

Uh oh!

fernandomacho
Jul 5, 2023

Replies: 29 comments 52 replies

michaelklishin
Jul 5, 2023
Maintainer

fernandomacho Jul 5, 2023
Author

michaelklishin
Jul 5, 2023
Maintainer

fernandomacho Jul 5, 2023
Author

michaelklishin
Jul 5, 2023
Maintainer

fernandomacho Jul 5, 2023
Author

michaelklishin
Jul 5, 2023
Maintainer

fernandomacho Jul 5, 2023
Author

fernandomacho
Jul 5, 2023
Author

michaelklishin
Jul 5, 2023
Maintainer

fernandomacho Jul 5, 2023
Author

michaelklishin
Jul 5, 2023
Maintainer

michaelklishin
Jul 5, 2023
Maintainer

michaelklishin
Jul 5, 2023
Maintainer

fernandomacho
Jul 5, 2023
Author

fernandomacho Jul 5, 2023
Author

michaelklishin Jul 5, 2023
Maintainer

fernandomacho Jul 5, 2023
Author

fernandomacho
Jul 5, 2023
Author

michaelklishin Jul 5, 2023
Maintainer

michaelklishin
Jul 5, 2023
Maintainer

fernandomacho
Jul 5, 2023
Author

michaelklishin Jul 5, 2023
Maintainer

michaelklishin
Jul 5, 2023
Maintainer

michaelklishin Jul 5, 2023
Maintainer

michaelklishin Jul 5, 2023
Maintainer

fernandomacho Jul 5, 2023
Author

michaelklishin Jul 5, 2023
Maintainer

fernandomacho
Jul 5, 2023
Author

michaelklishin Jul 9, 2023
Maintainer

fernandomacho Jul 9, 2023
Author