How to make model independent on Trainer's strategy during inference? #11243

jakub-h · 2021-12-23T13:10:27Z

jakub-h
Dec 23, 2021

I have a model trained using ddp and saved using pytorch_lightning.callbacks.model_checkpoint.ModelCheckpoint.
When I load it afterwards with

model = MyCustomModel.load_from_checkpoint(path_to_checkpoint)
model.eval()

and try to make predictions for a batch (correctly preprocessed by the same procedure used during training)

predictions = model(batch)

I get almost the same outputs regardless the input (there is a difference, but is negligible). However, when I use a Trainer (with identical parameters as used during the training + DataModule used in training) for the inference, I obtain correct predictions. By correct I mean meaningful and dependent on the input (i.e. not almost constant for any input). Trainer used for training:

trainer = Trainer(
    gpus=-1,
    accelerator="gpu",
    strategy="ddp",
    max_steps=70000,
    sync_batchnorm=True,
    deterministic=True,
)

predictions = trainer.predict(model=model, datamodule=data_module)

Then I experimented with the Trainer's params and figured out that when I use sync_batchnorm=False, dp instead of ddp or cpu instead of gpu for the inference Trainer, I get the wrong results again.

Am I missing something crucial? I would like to use the trained model independently on the trainer and I don't see any reason why a model trained with ddp could not be run on e.g. CPU for the inference. Specifically, I want to use Captum which works with PyTorch models without Trainers.

I work on CentOS Stream 8 server with torch==1.10.0+cu111 and pytorch-lightning==1.5.0. Thank you for any guidance, I'm clueless.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to make model independent on Trainer's strategy during inference? #11243

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How to make model independent on Trainer's strategy during inference? #11243

Uh oh!

jakub-h Dec 23, 2021

Replies: 0 comments

jakub-h
Dec 23, 2021