Skip to content

Quality of VMΒ #24

@RewindL

Description

@RewindL

Thanks for your impressive work.

I have trained mistral VM following PRM/train_VM_mistral.py and use it to guide ToT evaluation in evaluate.py. But after training 2 epochs following recommended setting, the test accuracy is only 0.1582. And it outputs unreliable scores to tree nodes.

Since the default depth and branch is limited and exploring nodes are ranked by values, an relatively accurate score seems necessary. So I wonder if this is a normal situation, and how do you handle this problem?

Looking forward for your help, thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions