Skip to content

Conversation

@r4victor
Copy link
Collaborator

@r4victor r4victor commented Jun 6, 2025

Addresses #2712

Tested running torch distributed training on two Nebius 8xH100 with InfiniBand.

@r4victor r4victor merged commit 5d7c488 into master Jun 6, 2025
39 checks passed
@r4victor r4victor deleted the issue_2712_infiniband_docker branch June 6, 2025 05:14
haydnli-shopify pushed a commit to haydnli-shopify/dstack that referenced this pull request Jun 10, 2025
haydnli-shopify pushed a commit to haydnli-shopify/dstack that referenced this pull request Jun 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants