Skip to content

RelEnt uses trajectories of varying length #40

@maxmdaniel

Description

@maxmdaniel

Our implementation of RelEnt currently works with trajectories of varying length. (This is because we rely on our collect_trajs util, which returns when an episode ends.)

By contrast, the RelEnt paper does all calculations under the assumption of a fixed trajectory length.

I'm not sure if this is problematic, but open this issue lest we forget to look into this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions