You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Description**
Made updates to the docs to reflect the latest state and add improvement to the content to make them more useful.
**Testing Done**
The docs look correct, verified each action item.
|`DIRECTORY`| PATH | No | Target directory (default: current directory) |
41
41
|`--version`| TEXT | No | Schema version to use |
42
42
43
43
```{important}
44
44
The `resource_name_prefix` parameter in the generated `config.yaml` file serves as the primary identifier for all AWS resources created during deployment. Each deployment must use a unique resource name prefix to avoid conflicts. This prefix is automatically appended with a unique identifier during cluster creation to ensure resource uniqueness.
45
+
46
+
**Cluster stack names must be unique within each AWS region.** If you attempt to create a cluster stack with a name that already exists in the same region, the deployment will fail.
45
47
```
46
48
47
49
## hyp create
@@ -61,14 +63,18 @@ hyp create [OPTIONS]
61
63
|`--region`| TEXT | No | AWS region where the cluster stack will be created |
62
64
|`--debug`| FLAG | No | Enable debug logging |
63
65
64
-
## hyp update hyp-cluster
66
+
## hyp update cluster
65
67
66
68
Update an existing HyperPod cluster configuration.
67
69
70
+
```{important}
71
+
**Runtime vs Configuration Commands**: This command modifies an **existing, deployed cluster's** runtime settings (instance groups, node recovery). This is different from `hyp configure`, which only modifies local configuration files before cluster creation.
|`--node-recovery`| TEXT | No | Node recovery setting (Automatic or None) |
83
89
|`--debug`| FLAG | No | Enable debug logging |
84
90
85
-
## hyp list hyp-cluster
91
+
## hyp list cluster-stack
86
92
87
93
List all HyperPod cluster stacks (CloudFormation stacks).
88
94
89
95
#### Syntax
90
96
91
97
```bash
92
-
hyp list hyp-cluster [OPTIONS]
98
+
hyp list cluster-stack [OPTIONS]
93
99
```
94
100
95
101
#### Parameters
@@ -100,14 +106,18 @@ hyp list hyp-cluster [OPTIONS]
100
106
|`--status`| TEXT | No | Filter by stack status. Format: "['CREATE_COMPLETE', 'UPDATE_COMPLETE']" |
101
107
|`--debug`| FLAG | No | Enable debug logging |
102
108
103
-
## hyp describe hyp-cluster
109
+
## hyp describe cluster-stack
104
110
105
111
Describe a specific HyperPod cluster stack.
106
112
113
+
```{note}
114
+
**Region-Specific Stack Names**: Cluster stack names are unique within each AWS region. When describing a stack, ensure you specify the correct region where the stack was created, or the command will fail to find the stack.
115
+
```
116
+
107
117
#### Syntax
108
118
109
119
```bash
110
-
hyp describe hyp-cluster STACK-NAME [OPTIONS]
120
+
hyp describe cluster-stack STACK-NAME [OPTIONS]
111
121
```
112
122
113
123
#### Parameters
@@ -195,6 +205,10 @@ hyp get-monitoring [OPTIONS]
195
205
196
206
Configure cluster parameters interactively or via command line.
197
207
208
+
```{important}
209
+
**Pre-Deployment Configuration**: This command modifies local `config.yaml` files **before** cluster creation. For updating **existing, deployed clusters**, use `hyp update cluster` instead.
210
+
```
211
+
198
212
#### Syntax
199
213
200
214
```bash
@@ -208,13 +222,23 @@ This command dynamically supports all configuration parameters available in the
208
222
| Parameter | Type | Required | Description |
209
223
|-----------|------|----------|-------------|
210
224
|`--resource-name-prefix`| TEXT | No | Prefix for all AWS resources |
211
-
|`--stage`| TEXT | No | Deployment stage ("gamma" or "prod") |
212
-
|`--vpc-cidr`| TEXT | No | VPC CIDR block |
213
-
|`--kubernetes-version`| TEXT | No | Kubernetes version for EKS cluster |
225
+
|`--create-hyperpod-cluster-stack`| BOOLEAN | No | Create HyperPod Cluster Stack |
226
+
|`--hyperpod-cluster-name`| TEXT | No | Name of SageMaker HyperPod Cluster |
227
+
|`--create-eks-cluster-stack`| BOOLEAN | No | Create EKS Cluster Stack |
228
+
|`--kubernetes-version`| TEXT | No | Kubernetes version |
229
+
|`--eks-cluster-name`| TEXT | No | Name of the EKS cluster |
230
+
|`--create-helm-chart-stack`| BOOLEAN | No | Create Helm Chart Stack |
231
+
|`--namespace`| TEXT | No | Namespace to deploy HyperPod Helm chart |
232
+
|`--node-provisioning-mode`| TEXT | No | Continuous provisioning mode |
214
233
|`--node-recovery`| TEXT | No | Node recovery setting ("Automatic" or "None") |
215
-
|`--env`| JSON | No | Environment variables as JSON object |
216
-
|`--args`| JSON | No | Command arguments as JSON array |
217
-
|`--command`| JSON | No | Command to run as JSON array |
234
+
|`--create-vpc-stack`| BOOLEAN | No | Create VPC Stack |
235
+
|`--vpc-id`| TEXT | No | Existing VPC ID |
236
+
|`--vpc-cidr`| TEXT | No | VPC CIDR block |
237
+
|`--create-security-group-stack`| BOOLEAN | No | Create Security Group Stack |
238
+
|`--enable-hp-inference-feature`| BOOLEAN | No | Enable inference operator |
239
+
|`--stage`| TEXT | No | Deployment stage ("gamma" or "prod") |
240
+
|`--create-fsx-stack`| BOOLEAN | No | Create FSx Stack |
241
+
|`--storage-capacity`| INTEGER | No | FSx storage capacity in GiB |
218
242
|`--tags`| JSON | No | Resource tags as JSON object |
219
243
220
244
**Note:** The exact parameters available depend on your current template type and version. Run `hyp configure --help` to see all available options for your specific configuration.
@@ -302,18 +326,56 @@ The `config.yaml` file supports the following parameters:
302
326
303
327
| Parameter | Type | Description | Default |
304
328
|-----------|------|-------------|---------|
305
-
|`template`| TEXT | Template name | "hyp-cluster" |
306
-
|`namespace`| TEXT | Kubernetes namespace | "kube-system" |
307
-
|`stage`| TEXT | Deployment stage | "gamma" |
308
-
|`resource_name_prefix`| TEXT | Resource name prefix | "sagemaker-hyperpod-eks" |
309
-
|`vpc_cidr`| TEXT | VPC CIDR block | "10.192.0.0/16" |
329
+
|`resource_name_prefix`| TEXT | Prefix for all AWS resources (4-digit UUID added during submission) | "hyp-eks-stack" |
**Note:** The actual available configuration parameters depend on the specific template schema version. Use `hyp init hyp-cluster` to see all available parameters for your version.
334
+
|`eks_cluster_name`| TEXT | Name of the EKS cluster | "eks-cluster" |
|`fsx_subnet_id`| TEXT | Subnet ID for FSx creation | "" |
371
+
|`fsx_availability_zone_id`| TEXT | Availability zone for FSx subnet | "" |
372
+
|`per_unit_storage_throughput`| INTEGER | Per unit storage throughput | 250 |
373
+
|`data_compression_type`| TEXT | Data compression type ("NONE" or "LZ4") | "NONE" |
374
+
|`file_system_type_version`| FLOAT | File system type version | 2.15 |
375
+
|`storage_capacity`| INTEGER | Storage capacity in GiB | 1200 |
376
+
|`fsx_file_system_id`| TEXT | Existing FSx file system ID | "" |
377
+
378
+
**Note:** The actual available configuration parameters depend on the specific template schema version. Use `hyp init cluster-stack` to see all available parameters for your version.
Copy file name to clipboardExpand all lines: doc/getting_started/cluster_management.rst
+26-7Lines changed: 26 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,6 +15,8 @@ Before you begin, ensure you have:
15
15
.. note::
16
16
**Region Configuration**: For commands that accept the ``--region`` option, if no region is explicitly provided, the command will use the default region from your AWS credentials configuration.
17
17
18
+
**Cluster stack names must be unique within each AWS region.** If you attempt to create a cluster stack with a name that already exists in the same region, the deployment will fail.
19
+
18
20
Creating Your First Cluster
19
21
----------------------------
20
22
@@ -37,7 +39,7 @@ It's recommended to start with a new and clean directory for each cluster config
The ``hyp configure`` command only modifies local configuration files. It does not affect existing deployed clusters.
76
81
77
82
4. Create the Cluster
78
83
~~~~~~~~~~~~~~~~~~~~~
79
84
85
+
.. warning::
86
+
**Cluster Stack Name Uniqueness**: Cluster stack names must be unique within each AWS region. Ensure your ``resource_name_prefix`` in ``config.yaml`` generates a unique stack name for the target region to avoid deployment conflicts.
87
+
80
88
.. tab-set::
81
89
82
90
.. tab-item:: CLI
@@ -102,7 +110,7 @@ Check the status of your cluster:
**Region-Specific Stack Names**: Cluster stack names are unique within each AWS region. When describing a stack, ensure you specify the correct region where the stack was created, or the command will fail to find the stack.
0 commit comments