You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(rl): add crash_penalty for simulation instability
Add configurable crash_penalty (default -100.0) to AndesEnv that
penalizes the agent when TDS terminates due to stability violations.
This gives the RL agent a strong signal to avoid destabilizing actions.
0 commit comments