why load_from_checkpoint is not defaultly inplace? #6888

MagicFrogSJTU · 2021-04-08T10:15:25Z

MagicFrogSJTU
Apr 8, 2021

Why

new_model = MyModel.load_from_checkpoint(checkpoint_path="example.ckpt")

instead of

MyModel.load_from_checkpoint(checkpoint_path="example.ckpt")

The former one is not friendly to native PyTorch users. And cost me an afternoon to find the bug.

Answered by FlorianMF

Apr 30, 2021

load_from_checkpoint is a class function which instantiates the object with the hyper_parameters in the checkpoint and then loads the state_dict.
If done like in your proposal my_model.load_from_checkpoint(checkpoint_path="example.ckpt") similar to pytorch's load_state_dict you'd have to create the my_model object first with the hyper_parameters parsed from the checkpoint and then load the weights. load_from_checkpoint combines both.
Maybe adding a way to load the state_dict from a checkpoint into an already instantiated object without using a classmethod would make sense. Something like load_state_dict_from_checkpoint.
The alternative is to load the checkpoint and then call model.load_st…

View full answer

FlorianMF · 2021-04-30T09:21:31Z

FlorianMF
Apr 30, 2021

load_from_checkpoint is a class function which instantiates the object with the hyper_parameters in the checkpoint and then loads the state_dict.
If done like in your proposal my_model.load_from_checkpoint(checkpoint_path="example.ckpt") similar to pytorch's load_state_dict you'd have to create the my_model object first with the hyper_parameters parsed from the checkpoint and then load the weights. load_from_checkpoint combines both.
Maybe adding a way to load the state_dict from a checkpoint into an already instantiated object without using a classmethod would make sense. Something like load_state_dict_from_checkpoint.
The alternative is to load the checkpoint and then call model.load_state_dict(checkpoint['state_dict'], strict=strict).

1 reply

MagicFrogSJTU Apr 30, 2021
Author

load_from_checkpoint is a class function which instantiates the object with the hyper_parameters in the checkpoint and then loads the state_dict.
If done like in your proposal my_model.load_from_checkpoint(checkpoint_path="example.ckpt") similar to pytorch's load_state_dict you'd have to create the my_model object first with the hyper_parameters parsed from the checkpoint and then load the weights. load_from_checkpoint combines both.
Maybe adding a way to load the state_dict from a checkpoint into an already instantiated object without using a classmethod would make sense. Something like load_state_dict_from_checkpoint.
The alternative is to load the checkpoint and then call model.load_state_dict(checkpoint['state_dict'], strict=strict).

Yeah, I found it is a class method later. Thanks for your details!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

why load_from_checkpoint is not defaultly inplace? #6888

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

why load_from_checkpoint is not defaultly inplace? #6888

Uh oh!

Uh oh!

MagicFrogSJTU Apr 8, 2021

Replies: 1 comment · 1 reply

Uh oh!

FlorianMF Apr 30, 2021

Uh oh!

MagicFrogSJTU Apr 30, 2021 Author

MagicFrogSJTU
Apr 8, 2021

Replies: 1 comment 1 reply

FlorianMF
Apr 30, 2021

MagicFrogSJTU Apr 30, 2021
Author