Skip to content

Commit eafe380

Browse files
[Docs] Updated the AWS example under Clusters (#3387)
* [Docs] Updated the AWS example under `Clusters` * [Docs] Updated the AWS example under Clusters Added a redirect * [Docs] Updated the AWS example under Clusters Deleted outdated archive folder
1 parent 7f97339 commit eafe380

File tree

10 files changed

+16
-254
lines changed

10 files changed

+16
-254
lines changed

docs/blog/archive/ambassador-program.md

Lines changed: 0 additions & 65 deletions
This file was deleted.

docs/blog/archive/efa.md

Lines changed: 0 additions & 173 deletions
This file was deleted.

docs/blog/posts/changelog-07-25.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -144,7 +144,7 @@ resources:
144144

145145
#### AWS EFA
146146

147-
EFA is a network interface for EC2 that enables low-latency, high-bandwidth communication between nodes—crucial for scaling distributed deep learning. With `dstack`, EFA is automatically enabled when using supported instance types in fleets. Check out our [example](../../examples/clusters/efa/index.md)
147+
EFA is a network interface for EC2 that enables low-latency, high-bandwidth communication between nodes—crucial for scaling distributed deep learning. With `dstack`, EFA is automatically enabled when using supported instance types in fleets. Check out our [example](../../examples/clusters/aws/index.md)
148148

149149
#### Default Docker images
150150

docs/docs/concepts/fleets.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -107,7 +107,7 @@ This ensures all instances are provisioned with optimal inter-node connectivity.
107107
Note, EFA requires the `public_ips` to be set to `false` in the `aws` backend configuration.
108108
Otherwise, instances are only connected by the default VPC subnet.
109109

110-
Refer to the [EFA](../../examples/clusters/efa/index.md) example for more details.
110+
Refer to the [AWS](../../examples/clusters/aws/index.md) example for more details.
111111

112112
??? info "GCP"
113113
When you create a fleet with GCP, `dstack` automatically configures [GPUDirect-TCPXO and GPUDirect-TCPX](https://cloud.google.com/kubernetes-engine/docs/how-to/gpu-bandwidth-gpudirect-tcpx-autopilot) networking for the A3 Mega and A3 High instance types, as well as RoCE networking for the A4 instance type.

docs/docs/guides/clusters.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ For cloud fleets, fast interconnect is currently supported only on the `aws`, `g
2222

2323
!!! info "Backend configuration"
2424
Note, EFA requires the `public_ips` to be set to `false` in the `aws` backend configuration.
25-
Refer to the [EFA](../../examples/clusters/efa/index.md) example for more details.
25+
Refer to the [AWS](../../examples/clusters/aws/index.md) example for more details.
2626

2727
=== "GCP"
2828
When you create a cloud fleet with GCP, `dstack` automatically configures [GPUDirect-TCPXO and GPUDirect-TCPX](https://cloud.google.com/kubernetes-engine/docs/how-to/gpu-bandwidth-gpudirect-tcpx-autopilot) networking for the A3 Mega and A3 High instance types, as well as RoCE networking for the A4 instance type.

docs/examples.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -113,7 +113,7 @@ hide:
113113
<a href="/examples/clusters/efa"
114114
class="feature-cell sky">
115115
<h3>
116-
AWS EFA
116+
AWS
117117
</h3>
118118

119119
<p>
File renamed without changes.
Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# AWS EFA
1+
# AWS
22

33
In this guide, we’ll walk through how to run high-performance distributed training on AWS using [Amazon Elastic Fabric Adapter (EFA)](https://aws.amazon.com/hpc/efa/) with `dstack`.
44

@@ -37,11 +37,11 @@ projects:
3737

3838
Once your backend is ready, define a fleet configuration.
3939

40-
<div editor-title="examples/clusters/efa/fleet.dstack.yml">
40+
<div editor-title="examples/clusters/aws/efa-fleet.dstack.yml">
4141

4242
```yaml
4343
type: fleet
44-
name: my-efa-fleet
44+
name: efa-fleet
4545
4646
nodes: 2
4747
placement: cluster
@@ -57,14 +57,14 @@ Provision the fleet with `dstack apply`:
5757
<div class="termy">
5858

5959
```shell
60-
$ dstack apply -f examples/clusters/efa/fleet.dstack.yml
60+
$ dstack apply -f examples/clusters/aws/efa-fleet.dstack.yml
6161
6262
Provisioning...
6363
---> 100%
6464
65-
FLEET INSTANCE BACKEND INSTANCE TYPE GPU PRICE STATUS CREATED
66-
my-efa-fleet 0 aws (us-west-2) p4d.24xlarge H100:8:80GB $98.32 idle 3 mins ago
67-
1 aws (us-west-2) p4d.24xlarge H100:8:80GB $98.32 idle 3 mins ago
65+
FLEET INSTANCE BACKEND INSTANCE TYPE GPU PRICE STATUS CREATED
66+
efa-fleet 0 aws (us-west-2) p4d.24xlarge H100:8:80GB $98.32 idle 3 mins ago
67+
1 aws (us-west-2) p4d.24xlarge H100:8:80GB $98.32 idle 3 mins ago
6868
```
6969

7070
</div>
@@ -76,7 +76,7 @@ Provisioning...
7676

7777
```yaml
7878
type: fleet
79-
name: my-efa-fleet
79+
name: efa-fleet
8080
8181
nodes: 2
8282
placement: cluster

mkdocs.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -145,11 +145,10 @@ plugins:
145145
'docs/examples/deployment/tgi/index.md': 'examples/inference/tgi/index.md'
146146
'providers.md': 'partners.md'
147147
'backends.md': 'partners.md'
148-
'blog/ambassador-program.md': 'blog/archive/ambassador-program.md'
149148
'blog/monitoring-gpu-usage.md': 'blog/posts/dstack-metrics.md'
150149
'blog/inactive-dev-environments-auto-shutdown.md': 'blog/posts/inactivity-duration.md'
151150
'blog/data-centers-and-private-clouds.md': 'blog/posts/gpu-blocks-and-proxy-jump.md'
152-
'blog/distributed-training-with-aws-efa.md': 'examples/clusters/efa/index.md'
151+
'blog/distributed-training-with-aws-efa.md': 'examples/clusters/aws/index.md'
153152
'blog/dstack-stats.md': 'blog/posts/dstack-metrics.md'
154153
'docs/concepts/metrics.md': 'docs/guides/metrics.md'
155154
'docs/guides/monitoring.md': 'docs/guides/metrics.md'
@@ -166,11 +165,12 @@ plugins:
166165
'examples/deployment/trtllm/index.md': 'examples/inference/trtllm/index.md'
167166
'examples/fine-tuning/trl/index.md': 'examples/single-node-training/trl/index.md'
168167
'examples/fine-tuning/axolotl/index.md': 'examples/single-node-training/axolotl/index.md'
169-
'blog/efa.md': 'examples/clusters/efa/index.md'
168+
'blog/efa.md': 'examples/clusters/aws/index.md'
170169
'docs/concepts/repos.md': 'docs/concepts/dev-environments.md#repos'
171170
'examples/clusters/a3high/index.md': 'examples/clusters/gcp/index.md'
172171
'examples/clusters/a3mega/index.md': 'examples/clusters/gcp/index.md'
173172
'examples/clusters/a4/index.md': 'examples/clusters/gcp/index.md'
173+
'examples/clusters/efa/index.md': 'examples/clusters/aws/index.md'
174174
- typeset
175175
- gen-files:
176176
scripts: # always relative to mkdocs.yml
@@ -326,7 +326,7 @@ nav:
326326
- NCCL tests: examples/clusters/nccl-tests/index.md
327327
- RCCL tests: examples/clusters/rccl-tests/index.md
328328
- GCP: examples/clusters/gcp/index.md
329-
- AWS EFA: examples/clusters/efa/index.md
329+
- AWS: examples/clusters/aws/index.md
330330
- Crusoe: examples/clusters/crusoe/index.md
331331
- Inference:
332332
- SGLang: examples/inference/sglang/index.md

0 commit comments

Comments
 (0)