Skip to content

[Question] atari/montezumarevenge/expert-v0 only contains 0 returns trajectories #316

@Seraphli

Description

@Seraphli

Question

I notice the dataset about Montezuma's revenge only contains 0 episodic reward trajectories. The document says the collecting script is using PPO Impala. Should this game use CleanRL PPO + RND to collect dataset?

https://wandb.ai/openrlbenchmark/openrlbenchmark/reports/-MontezumaRevenge-CleanRL-s-PPO-RND--VmlldzoyNTIyNjc5

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions