NeptuneLogger raises Error occurred during asynchronous operation processing: Cannot upload file
during checkpointing
#12547
Unanswered
dagap
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 2 comments
-
Hi @dagap Prince Canuma here, a Data scientist at neptune.ai, I'm here to help you with this one, Can you provide me a minimal reproducible code? |
Beta Was this translation helpful? Give feedback.
0 replies
-
Hi @dagap I'm just checking in to see if you still need help with this one or if you have any other doubts I can help with? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am using pytorch lightning and using the Neptune logger in conjunction. So, my trainer setup has the following parameters:
My
ModelCheckpoint
callback has the following parameters:Now, when I run the training, it is running fine but when trying to upload everything to Neptune, it is showing the following errors:
Not sure what is going on. I tried to set the
log_every_n_steps
to 0 to try and remove intermediate checkpoint logging but that does not seem to work either.The trainer does upload the final model to the checkpoint store. it seems though it is the intermediate checkpoints that cannot be uploaded.
Beta Was this translation helpful? Give feedback.
All reactions