Replies: 7 comments
-
This is the output of the last iteration
|
Beta Was this translation helpful? Give feedback.
-
Using CoM may require longer training. Let's disable CoM reward for now, until it is fixed. |
Beta Was this translation helpful? Give feedback.
-
Yes, that's the behavior I'm getting, table values that are lower even though "total_reward" is around 600 and the sim character seems to follow the reference COM for longer (as in, the character doesn't veer leftwards for some episodes) |
Beta Was this translation helpful? Give feedback.
-
Using
seems to walk just fine though, without jitter. Isn't the walk args policy using the CoM reward? |
Beta Was this translation helpful? Give feedback.
-
That args file is loading the checkpoint of the policy trained without the CoM reward. If you override the
|
Beta Was this translation helpful? Give feedback.
-
Thanks. How can you disable the CoM reward? Do we have to manually edit the file humanoid_stable_pd.py? |
Beta Was this translation helpful? Give feedback.
-
The argument |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I tried to run the motion file walk using the checkpoint
humanoid3d_walk_COMenabled.ckpt
, but the character walks in an odd way and the movements are jittery. You can see the result in this video.I've also tried to re-download the checkpoint and/or retrain the policy, but the results are the same. The total reward is around 600, but the reward shown in the table is around 60/70.
Do you know which might be the issue? @ManifoldFR @erwincoumans
Beta Was this translation helpful? Give feedback.
All reactions