TPU training and WandB implementation causes "AssertionError: can only test a child process" #6639
Unanswered
adamDhalla
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 1 comment
-
@adamDhalla is this still an issue for you? I think wandb support for TPUs should be better in the latest version |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I'm not raising this issue as a bug since it's already been raised here, and was seemingly solved.
I have pretty much the exact problem - I'm trying to implement weights and biases with a TPU and it keeps showing "AssertionError: can only test a child process". The person who raised the old error also replicated the issue in the boring model here.
The thing is he never explains how he solves the issue - he explains the issue itself, but doesn't say how he solves it. I'm wondering where I should be initializing the wandb object:
Right now, I'm initializing both of those lines above the trainer. The trainer requires wandb_logger be passed through, so I can't do the wandb_logger = WandbLogger... line any higher up.
The whole problem seems to come about as a problem with distributed processing and WandB.
Thanks,
Adam
Beta Was this translation helpful? Give feedback.
All reactions