Are these arguments correct? Or is it very hard to train MountainCar-v0 by PPO?

I applied the code to train MountainCar-v0 but failed after 10 million timesteps. The command is as follows.

!python ppo.py --gym-id MountainCar-v0 --total-timesteps 10000000

![ret](https://github.com/vwxyzjn/ppo-implementation-details/assets/56531349/8c9156cc-977a-43f7-bb32-66005612f98c)

Are these arguments correct? Or is it very hard to train MountainCar-v0 by PPO?