[Question] Issues with Multi-Armed RL agent #1852

pmfaustino · 2025-02-11T13:30:53Z

pmfaustino
Feb 11, 2025

Hi everyone,

I have successfully trained a direct RL agent for a single-arm robot (Franka Panda) to pick up a cube and lift it to a desired position. However, I’m encountering difficulties when attempting to apply the same approach to a multi-arm robot (ABB Yumi) for the same task.

At first I tried to use the closest arm to pick the cube, but the agent developed a bias towards one arm, which resulted in situations like being unable to lift the cube because the closest arm would just ignore it and the cube would be out of reach of the farthest arm. Then I’ve explored conventional "multi-armed bandit" strategies like Epsilon Greedy and Thompson Sampling, but, despite adjusting various parameters, the agent seems to favor one arm consistently.

Here are the observations I’m using for training:

Joint positions and velocities
Object position
Distance from the left and right grippers to the object
Goal position
Distance from the object to the goal
Actions taken

The reward system I’ve designed includes the following components:

A distance reward, which is inversely proportional to the distance between the gripper and the object (the closer the gripper, the higher the reward).
A lift reward, which is granted when the object is lifted above a minimum threshold.
A goal reward, which is inversely proportional to the distance from the object to the goal (the closer the object is to the goal, the higher the reward).

I’m reaching out to see if anyone has encountered a similar problem or can suggest a different approach that might help solve this issue. Any advice or insights on training multi-arm RL agents effectively would be greatly appreciated!

Thank you in advance for your help!

RandomOakForest · 2025-02-13T17:52:58Z

RandomOakForest
Feb 13, 2025
Maintainer

Thanks for posting this. Great work! I'll move this post into our Discussions section for the team to follow up.

0 replies

celestialdr4g0n · 2025-02-28T05:43:49Z

celestialdr4g0n
Feb 28, 2025

How are you doing, did you try punish the bias arm agent reward? I mean both agent will not receiving reward or be punished because the other agent failed to finish its subtasks.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Question] Issues with Multi-Armed RL agent #1852

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[Question] Issues with Multi-Armed RL agent #1852

Uh oh!

pmfaustino Feb 11, 2025

Replies: 2 comments

Uh oh!

RandomOakForest Feb 13, 2025 Maintainer

Uh oh!

celestialdr4g0n Feb 28, 2025

pmfaustino
Feb 11, 2025

RandomOakForest
Feb 13, 2025
Maintainer

celestialdr4g0n
Feb 28, 2025