-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Lightning-AI pytorch-lightning Lightning-trainer-api-trainer-lightningmodule-lightningdatamodule Discussions
Pinned Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
⚡ Lightning Trainer API: Trainer, LightningModule, LightningDataModule Discussions
Questions about the Lightning Module, Trainer, or anything lighting related!
-
You must be logged in to vote ⚡ "resume from checkpoint" lead to CUDA out of memory
Defiler24 askedJan 21, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Attribute Error when resuming training from checkpoint file
pamparana34 askedAug 8, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ What is updated *per-epoch* (and not *per-batch*)?
jpcbertoldo askedAug 5, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Proper way of making predictions with multi-GPUs
marcmk6 askedAug 5, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Change the scheduler interval in CLI
lightningclipl.cli.LightningCLI ForJadeForest askedAug 2, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Why does pytorch lightning cause more GPU memory usage?
accelerator: cudaCompute Unified Device Architecture GPU performance plGeneric label for PyTorch Lightning package chuzheng88 askedJul 14, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ global_step while using two dataloaders
fugokidi askedAug 3, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ -
You must be logged in to vote ⚡ -
You must be logged in to vote ⚡ Weighting different losses based on random initialization of network
data handlingGeneric data-related topic lightningmodulepl.LightningModule Michael-Geuenich askedJul 23, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Why does DDP mode continue the program in multiple process for longer than intended?
strategy: ddpDistributedDataParallel hfaghihi15 askedJun 2, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Error: found at least two devices with DataParallel
strategy: dp (removed in pl)DataParallel avivko askedMay 30, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Access model weights during training
vokcow askedAug 1, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ limit_train_batches
data handlingGeneric data-related topic trainer: argumentsirtris askedJul 28, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Mixed precision on an Huggingface models
marcmk6 askedJul 29, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ train_time_interval checkpoint and metric value
DA-L3 askedJul 28, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Handling batch normalization with gradient accumulation
brunomaga askedJul 27, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Efficient way of loading data from an h5 dataset?
malfonsoarquimea askedJul 26, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ About the loss on the progress bar
Struggle-Forever askedJul 26, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Cant see validation accuracy & loss in validation_step?
SamPusegaonkar askedJul 19, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Callback Checkpointing never called
MaugrimEP askedJul 22, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Is there a simple way to use old (PyTorch Lightning < 1.6)
progress tracking (internal)global_step
update behavior?Related to the progress tracking dataclasses lecacosa askedMay 9, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote ⚡ Registering logger/trainer inside nested PL models
lightningmodulepl.LightningModule trainerCompRhys askedMay 24, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ use custom cloud_build_config with the root LightningFlow
app (removed)Generic label for Lightning App package aniketmaurya askedJul 22, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ when and how the trainer or module move the data to gpu?
data handlingGeneric data-related topic accelerator: cudaCompute Unified Device Architecture GPU plGeneric label for PyTorch Lightning package FutureWithoutEnding askedJul 18, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered