Accompanying code for the paper An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients, submitted to IJCNN 2021.
Install MuJoCo as in https://github.com/openai/mujoco-py?tab=readme-ov-file#install-mujoco
Add to .bashrc
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$HOME/.mujoco/mujoco210/binInstall everything with
bash setup.shpython scripts/episodic/launch_episodic_test_functions.py python scripts/lqr_pg/launch_exp_oracle_pg_lqr.py
python scripts/lqr_pg/launch_exp_oracle_pg_lqr_error.py
python scripts/lqr_pg/launch_exp_oracle_pg_lqr_error_training.py python scripts/off_policy/launch_exp_ddpg.py
python scripts/off_policy/launch_exp_sac.py
python scripts/off_policy/launch_exp_sac_extra_samples.py
python scripts/off_policy/launch_exp_sac_mvd.py
python scripts/off_policy/launch_exp_sac_sf.py
python scripts/off_policy/launch_exp_sac_sf_extra_samples.py
python scripts/off_policy/launch_exp_td3.pypython scripts/on_policy/launch_exp_tree_mvd_lunarlander.py
python scripts/on_policy/launch_exp_tree_mvd_pendulum.py
python scripts/on_policy/launch_exp_tree_mvd_room.py
python scripts/on_policy/launch_exp_trustregion_lunarlander.py
python scripts/on_policy/launch_exp_trustregion_pendulum.py
python scripts/on_policy/launch_exp_trustregion_room.pymkdir out
python plots/plot_test_functions.py
python plots/plot_lqr.py
python plots/plot_lqr_error.py
python plots/plot_lqr_error_training.py
python plots/plot_off_policy.py
python plots/plot_on_policy.pyCheck the plots in the out directory.