Skip to content

Regarding training a new model from scratch without using a trained pre-trained model #23

@wanshishuns

Description

@wanshishuns

Hello, first of all, it is a great honor for me to see the article published by your team, and I benefited a lot after reading it. When I was learning from your code, I hope that the pre-trained model can not be applied to the training. In addition to modifying the parameters in the config, as for the code part of the frozen ViT model proposed by you, do you need to unfreeze the model before training? Because I have encountered this kind of problem during training, and the hidden_size problem appears in the training terminal, and there is no hidden_size parameter in your config, could you please help me? I will be very grateful.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions