Skip to content

Choose rank check based on filesystem #11136

@carmocca

Description

@carmocca

This could be an issue in the future if the node's filesystem becomes one a single volume.

Ideally, I think we should detect if the filesystem is shared. The condition would become (if self.global_rank if fs.shared_filesystem else self.local_rank)

Originally posted by @tchaton in #11070 (comment)

Refers to this check:

https://github.com/PyTorchLightning/pytorch-lightning/blob/7e10f6d41fee5c1ac4dceab8abc0177955239094/pytorch_lightning/plugins/training_type/tpu_spawn.py#L335-L337

cc @Borda @awaelchli @rohitgr7 @akihironitta @tchaton @kaushikb11

Metadata

Metadata

Assignees

No one assigned

    Labels

    distributedGeneric distributed-related topicfeatureIs an improvement or enhancementpriority: 2Low priority task

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions