-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Lightning-AI pytorch-lightning Discussions
Pinned Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
Discussions
-
You must be logged in to vote ⚡ Callback Checkpointing never called
MaugrimEP askedJul 22, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Is there a simple way to use old (PyTorch Lightning < 1.6)
progress tracking (internal)global_step
update behavior?Related to the progress tracking dataclasses lecacosa askedMay 9, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Registering logger/trainer inside nested PL models
lightningmodulepl.LightningModule trainerCompRhys askedMay 24, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ use custom cloud_build_config with the root LightningFlow
app (removed)Generic label for Lightning App package aniketmaurya askedJul 22, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ when and how the trainer or module move the data to gpu?
data handlingGeneric data-related topic accelerator: cudaCompute Unified Device Architecture GPU plGeneric label for PyTorch Lightning package FutureWithoutEnding askedJul 18, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote 🤖 Restarting parts of cluster
distributedGeneric distributed-related topic -
You must be logged in to vote 💬 -
You must be logged in to vote ⚡ How to log with accumulate_grad_batches
FrancescoSaverioZuppichini askedAug 2, 2021 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ How to delete all the gradients in between operations
malfonsoarquimea askedJul 19, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote 💭 -
You must be logged in to vote ⚡ How can I show my information on the progress bar?
xmy0916 askedJul 19, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ LightningCLI: Passing objects via link_arguments
lightningclipl.cli.LightningCLI lneukom askedJul 4, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ How to create a dataset and dataloader for whole images?
data handlingGeneric data-related topic performance plGeneric label for PyTorch Lightning package malfonsoarquimea askedJul 18, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ How to perform ungraceful shutdown
plGeneric label for PyTorch Lightning package smolPixel askedJul 18, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Run on certain GPUs
lightningclipl.cli.LightningCLI trainer: argument plGeneric label for PyTorch Lightning package quancs askedAug 14, 2021 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ How to retrive training states (epoch, ‘loss_val’, etc..) from a checkpoint?
checkpointingRelated to checkpointing plGeneric label for PyTorch Lightning package acercyc askedJul 18, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Restore optimization/scheduler states saved without pytorch-lightning
checkpointingRelated to checkpointing optimization plGeneric label for PyTorch Lightning package ewrfcas askedJul 17, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 🤖 Sharding and training multiple models at once for a large scale reinforcement learning
strategy: deepspeed plGeneric label for PyTorch Lightning package -
You must be logged in to vote ⚡ RuntimeError: Expected all tensors to be on the same device
DataParallel strategy: ddpDistributedDataParallel accelerator: cudaCompute Unified Device Architecture GPU plGeneric label for PyTorch Lightning package ddicostanzo askedJul 15, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ how to run horovod strategy?
strategy: horovod (removed) plGeneric label for PyTorch Lightning package JiahaoYao askedJul 14, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ where the new callbaclks located?
callback hooksRelated to the hooks API JiahaoYao askedJul 10, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ monitoring metric and model saving
loggingRelated to the `LoggerConnector` and `log()` trainer: validateStruggle-Forever askedJul 13, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ How to immediately end training?
vedantroy askedJul 7, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Are we able to construct the model after DDP is initialized?
strategy: ddpDistributedDataParallel MultiPath askedJul 13, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡