Skip to content

Are these arguments correct? Or is it very hard to train MountainCar-v0 by PPO? #7

@alanyuwenche

Description

@alanyuwenche

I applied the code to train MountainCar-v0 but failed after 10 million timesteps. The command is as follows.

!python ppo.py --gym-id MountainCar-v0 --total-timesteps 10000000

ret

Are these arguments correct? Or is it very hard to train MountainCar-v0 by PPO?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions