-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Lightning-AI pytorch-lightning Discussions
Pinned Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
Discussions
-
You must be logged in to vote 🤖 DDP Hangs with TORCH_DISTRIBUTED_DEBUG = DETAIL
strategy: ddpDistributedDataParallel -
You must be logged in to vote 💬 -
You must be logged in to vote 🤖 -
You must be logged in to vote ⚡ Inprecision in pytorch-lightning's Gradient Accumulation?
leehawk787 askedOct 7, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ -
You must be logged in to vote ⚡ Advanced profiling
mshooter askedOct 3, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 💬 -
You must be logged in to vote 😎 -
You must be logged in to vote ⚡ Why does each process get a different global seed in FSDP?
willtryagain askedSep 30, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Logging for elements of compound/weighted loss
maciejzj askedSep 29, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 💬 -
You must be logged in to vote ⚡ LightningCLI: suggested structure for callbacks/loggers
adamjstewart askedSep 25, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 💬 -
You must be logged in to vote ⚡ How to keep lr fixed at first N epoch, and then use scheduler for the rest of training
Mo-Junyang askedSep 28, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Closed · Unanswered -
You must be logged in to vote ⚡ ModelCheckpoint seems not work correctly with check_val_every_n_epoch>1
Mo-Junyang askedSep 26, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Closed · Unanswered -
You must be logged in to vote 💬 -
You must be logged in to vote ⚡ PyTorch Profiler Stats Only Showing for "Records"
alexander-zhang askedAug 25, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 🤖 -
You must be logged in to vote ⚡ -
You must be logged in to vote ⚡ Custom Dataloader for Very Large Datasets
VRM1 askedApr 10, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 💬 -
You must be logged in to vote ⚡ Where to transform and inverse-transform
aurany askedJan 3, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote 🤖 -
You must be logged in to vote ⚡