Replies: 1 comment 1 reply
-
Hello, to guarantee determinism, the simulation should be stopped and restarted for each trajectory. In addition, we are aware that spawning environments away from the origin, which is often the case when running with parallel environments, can cause floating point precision errors in simulation that eventually propagate to larger errors. In this case, it is recommended to spawn all environments at the world origin. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the bug
I'm using IsaacLab as the dynamics function in an MPC framework but encountered issues with trajectory replay.$s_1, a_1, …, s_H, a_H$ . When I reset the environment to state $s_{10}$ and execute the actions $a_{10}, …, a_H$ , the resulting trajectory deviates from the expected states $s_{11}, …, s_H$ . However, when starting from $s_1$ and replaying the entire trajectory, the results are as expected. This issue doesn't occur in the Cartpole environment.
In the Ant environment, I rolled out the trajectory for states and actions:
Additional experiments:
Steps to reproduce
Copy and paste code in
.txt
to.py
: test_ant.txtSystem Info
Describe the characteristic of your environment:
Checklist
Acceptance Criteria
Add the criteria for which this task is considered done. If not known at issue creation time, you can add this once the issue is assigned.
Beta Was this translation helpful? Give feedback.
All reactions