Shovels: fix shovel status and deletion of failed shovels #14637

dcorbacho · 2025-09-29T10:17:08Z

These shovels are stuck in a restart loop and need to be listed on shovel status, which also allows for its deletion

mkuratczyk · 2025-09-29T11:16:31Z

One of two scenarios seems to be resolved.

Scenario 1 (seems solved):
I declared many shovels with an invalid src-uri:

for i in (seq 100); rabbitmqctl set_parameter shovel myshovel-$i '{"src-protocol": "amqp091", "src-uri": "amqp://foo", "src-queue": "q1", "dest-protocol": "amqp10", "dest-uri": "amqp://localhost", "dest-address": "/queues/q2"}'; end;

and delete them shortly after

for i in (seq 100); rabbitmqctl delete_shovel myshovel-$i; end

This was not deterministic but usually afterwards I'd have 1 shovel listed on Shovel status page, even though it was "deleted" (the delete_shovel command succeeded). I can no longer reproduce this.

Scenario 2 (still failing):
Declare a shovel with a non-existent dest-address:

rabbitmqctl set_parameter shovel myshovel '{"src-protocol": "amqp091", "src-uri": "amqp://localhost", "src-queue": "q1", "dest-protocol": "amqp10", "dest-uri": "amqp://localhost", "dest-address": "/queues/q2"}'

Such a shovel is failing with:

{outbound_link_detached,{'v1_0.error',{symbol,<<"amqp:not-found">>},
                                      {utf8,<<"no queue 'q2' in vhost '/'">>},
                                      undefined}}

Now try to delete it:

$ rabbitmqctl delete_shovel myshovel
Deleting shovel myshovel in vhost /
Stack trace: 

** (FunctionClauseError) no function clause matching in :proplists.get_value/3
    (stdlib 7.1) proplists.erl:222: :proplists.get_value(:node, {:outbound_link_detached, {:"v1_0.error", {:symbol, "amqp:not-found"}, {:utf8, "no queue 'q2' in vhost '/'"}, :undefined}}, :rabbit@K6L59PF0JR)
    (rabbitmq_shovel 4.2.0+beta.4.10.g9f39f60.dirty) Elixir.RabbitMQ.CLI.Ctl.Commands.DeleteShovelCommand.erl:84: RabbitMQ.CLI.Ctl.Commands.DeleteShovelCommand.run/2
    (rabbitmqctl 4.2.0+beta.4.9.ga09383d.dirty) lib/rabbitmqctl.ex:185: RabbitMQCtl.maybe_run_command/3
    (rabbitmqctl 4.2.0+beta.4.9.ga09383d.dirty) lib/rabbitmqctl.ex:153: anonymous fn/5 in RabbitMQCtl.do_exec_parsed_command/5
    (rabbitmqctl 4.2.0+beta.4.9.ga09383d.dirty) lib/rabbitmqctl.ex:653: RabbitMQCtl.maybe_with_distribution/3
    (rabbitmqctl 4.2.0+beta.4.9.ga09383d.dirty) lib/rabbitmqctl.ex:118: RabbitMQCtl.exec_command/2
    (rabbitmqctl 4.2.0+beta.4.9.ga09383d.dirty) lib/rabbitmqctl.ex:52: RabbitMQCtl.main1/1
    (elixir 1.18.4) lib/kernel/cli.ex:137: anonymous fn/3 in Kernel.CLI.exec_fun/2

Error:
:function_clause

mkuratczyk · 2025-09-29T11:46:25Z

one more scenario I now tried: a happily running shovel can be deleted from any node (rabbitmqctl -n value). However, a shovel with an invalid URL, even with this branch, requires targeting a specific node:

make start-cluster

# declare a shovel running on rabbit-1, the URL is not reachable
rabbitmqctl -n rabbit-1 set_parameter shovel myshovel-1 '{"src-protocol": "amqp091", "src-uri": "amqp://localhost", "src-queue": "q1", "dest-protocol": "amqp10", "dest-uri": "amqp://foo", "dest-address": "/queues/q2"}'

# declare a shovel running on rabbit-2, the URL is not reachable
rabbitmqctl -n rabbit-2 set_parameter shovel myshovel-2 '{"src-protocol": "amqp091", "src-uri": "amqp://localhost", "src-queue": "q1", "dest-protocol": "amqp10", "dest-uri": "amqp://foo", "dest-address": "/queues/q2"}'

# delete shovel on rabbit-1 from rabbit-1 (works)
rabbitmqctl -n rabbit-1 delete_shovel myshovel-1

# delete shovel on rabbit-2 from rabbit-1 (doesn't work)
rabbitmqctl -n rabbit-1 delete_shovel myshovel-2
Deleting shovel myshovel-2 in vhost /
Error:
Shovel with the given name was not found on the target node 'rabbit-1@K6L59PF0JR' and/or virtual host '/'. It may be failing to connect and report its state, will delete its runtime parameter...

michaelklishin · 2025-09-29T15:13:39Z

Just FTR, distributed shovels in Tanzu RabbitMQ should deal with that scenario well @mkuratczyk :)

dcorbacho · 2025-09-29T19:44:37Z

deps/rabbitmq_shovel/src/rabbit_shovel_status.erl

           %% terminated
           ({Name, Type, {terminated, Reason}, Metrics, Timestamp}) ->
-             {Name, Type, {terminated, Reason}, Metrics, Timestamp};
+             {Name, Type, {terminated, [{node, Node}], Reason}, Metrics, Timestamp};


@michaelklishin Is this a breaking change? It seems just delete/restart CLI commands use cluster_status_with_nodes

Fixes deletion and restart

Shovels: fix shovel status and deletion of failed shovels (backport #14637)

Shovels: fix shovel status and deletion of failed shovels

9f39f60

These shovels are stuck in a restart loop and need to be listed on shovel status, which also allows for its deletion

dcorbacho requested a review from mkuratczyk September 29, 2025 10:17

Shovels: more detailed error message

99fea41

Shovel: fix deletion of terminated shovels

7f1ceb2

dcorbacho force-pushed the issue-14623 branch from f8d7977 to 52b46c8 Compare September 29, 2025 19:43

dcorbacho commented Sep 29, 2025

View reviewed changes

Shovels: return hosting node in terminated shovel status

13201b2

Fixes deletion and restart

dcorbacho force-pushed the issue-14623 branch from 52b46c8 to 13201b2 Compare September 30, 2025 06:26

Shovels: tests for deletion of failed shovels

33a6a20

dcorbacho marked this pull request as ready for review September 30, 2025 08:22

Shovels: make changes to shovel status backward compatible

c9697a6

dcorbacho force-pushed the issue-14623 branch from e71db0b to c9697a6 Compare September 30, 2025 15:54

michaelklishin added the backport-v4.2.x label Sep 30, 2025

michaelklishin added this to the 4.3.0 milestone Sep 30, 2025

michaelklishin merged commit 270c43f into main Sep 30, 2025
285 checks passed

michaelklishin deleted the issue-14623 branch September 30, 2025 16:25

mergify bot mentioned this pull request Sep 30, 2025

Shovels: fix shovel status and deletion of failed shovels (backport #14637) #14649

Merged

michaelklishin added a commit that referenced this pull request Sep 30, 2025

Merge pull request #14649 from rabbitmq/mergify/bp/v4.2.x/pr-14637

22b216c

Shovels: fix shovel status and deletion of failed shovels (backport #14637)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shovels: fix shovel status and deletion of failed shovels #14637

Shovels: fix shovel status and deletion of failed shovels #14637

Uh oh!

dcorbacho commented Sep 29, 2025

Uh oh!

mkuratczyk commented Sep 29, 2025

Uh oh!

mkuratczyk commented Sep 29, 2025

Uh oh!

michaelklishin commented Sep 29, 2025

Uh oh!

dcorbacho Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Shovels: fix shovel status and deletion of failed shovels #14637

Shovels: fix shovel status and deletion of failed shovels #14637

Uh oh!

Conversation

dcorbacho commented Sep 29, 2025

Uh oh!

mkuratczyk commented Sep 29, 2025

Uh oh!

mkuratczyk commented Sep 29, 2025

Uh oh!

michaelklishin commented Sep 29, 2025

Uh oh!

dcorbacho Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants