Skip to content

Training data for BEST_WALK_ONNX_2.onnx #43

@Juliaj

Description

@Juliaj

Hello, I am experimenting with BEST_WALK_ONNX_2.onnx, see example. I'm running Mini Duck in Mujoco simulation.

I've observed some instability in the inference scenario, in which Duck falls down immediately after receiving a command to move forward. I'm investigating whether this is due to observation distribution shift in my env. Would you be able to provide some training data (observations plus actions) for BEST_WALK_ONNX_2.onnx ?

It appears that the model expects 3 previous actions (last_action, last_last_action, last_last_last_action). How should these be initialized with the first command, meaning when Duck stands still? Tips and suggestions are appreciated.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions