I applied the code to train MountainCar-v0 but failed after 10 million timesteps. The command is as follows.
!python ppo.py --gym-id MountainCar-v0 --total-timesteps 10000000

Are these arguments correct? Or is it very hard to train MountainCar-v0 by PPO?