@@ -54,44 +54,6 @@ The resize.py is deployed on the bastion node as part of the HPC cluster Stack d
5454
5555```
5656playbooks/resize.py
57-
58- python3 playbooks/resize.py -h
59- usage: resize.py [-h] [--compartment_ocid COMPARTMENT_OCID]
60- [--cluster_name CLUSTER_NAME] [--nodes NODES [NODES ...]]
61- [--slurm_only_update [{true,false}]]
62- [{add,remove,list,reconfigure}] [number]
63-
64- Script to resize the CN
65-
66- positional arguments:
67- {add,remove,list,reconfigure}
68- Mode type. add/remove node options, implicitly
69- configures newly added nodes. Also implicitly
70- reconfigure/restart services like Slurm to recognize
71- new nodes. Similarly for remove option, terminates
72- nodes and implicitly reconfigure/restart services like
73- Slurm on rest of the cluster nodes to remove reference
74- to deleted nodes.
75- number Number of nodes to add or delete if a list of
76- hostnames is not defined
77-
78- optional arguments:
79- -h, --help show this help message and exit
80- --compartment_ocid COMPARTMENT_OCID
81- OCID of the compartment, defaults to the Compartment
82- OCID of the localhost
83- --cluster_name CLUSTER_NAME
84- Name of the cluster to resize. Defaults to the name
85- included in the bastion
86- --nodes NODES [NODES ...]
87- Number of nodes to add or delete if a list of
88- hostnames is not defined
89- --slurm_only_update [{true,false}]
90- To update /etc/hosts, slurm config and restart slurm
91- services.
92- [opc@assuring-woodcock-bastion ~]$
93-
94-
9557```
9658
9759** Add nodes**
@@ -102,19 +64,12 @@ Consist of the following sub-steps:
10264 - Configures newly added nodes to be ready to run the jobs
10365 - Reconfigure services like Slurm to recognize new nodes on all nodes
10466
105- Add one node
106- ```
107- python3 playbooks/resize.py add 1
108-
109- ```
110-
111- Add three node
67+
11268```
113- python3 playbooks/resize.py add 3
69+ python3 playbooks/resize.py.aug15 add 1
11470
11571```
11672
117-
11873** Remove nodes**
11974
12075Consist of the following sub-steps:
@@ -125,47 +80,47 @@ Consist of the following sub-steps:
12580
12681Remove specific node:
12782```
128- python3 playbooks/resize.py remove --nodes inst-dpi8e-assuring-woodcock
83+ python3 playbooks/resize.py.aug15 remove --nodes inst-dpi8e-assuring-woodcock
12984```
13085or
13186
13287Remove a list of nodes (space seperated):
13388```
134- python3 playbooks/resize.py remove --nodes inst-dpi8e-assuring-woodcock inst-ed5yh-assuring-woodcock
89+ python3 playbooks/resize.py.aug15 remove --nodes inst-dpi8e-assuring-woodcock inst-ed5yh-assuring-woodcock
13590```
13691or
13792Remove one node randomly:
13893```
139- python3 playbooks/resize.py remove 1
94+ python3 playbooks/resize.py.aug15 remove 1
14095```
14196or
14297Remove 3 nodes randomly:
14398```
144- python3 playbooks/resize.py remove 3
99+ python3 playbooks/resize.py.aug15 remove 3
145100
146101```
147102
148103** Reconfigure nodes**
149104
150105This allows users to reconfigure nodes (Ansible tasks) of the cluster.
151106
152- If you would like to do a ** slurm config update ONLY ** on all nodes of the cluster.
107+ If you would like to do a slurm config update on all nodes of the cluster.
153108
154109```
155- python3 playbooks/resize.py reconfigure --slurm_only_update true
110+ python3 playbooks/resize.py.aug15 reconfigure --slurm_only_update true
156111```
157112
158- Full reconfiguration of all nodes of the cluster. This will run the same steps, which are ran when a new cluster is created. If you manually updated configs which are created/updated as part of cluster configuration, then this command will overwrite your manual changes.
113+ Full reconfiguration of all nodes of the cluster. This runs the same steps, which are ran when a new cluster is created. If you manually updated configs which are created/updated as part of cluster configuration, then this command will overwrite your manual changes.
159114
160115```
161- python3 playbooks/resize.py reconfigure
116+ python3 playbooks/resize.py.aug15 reconfigure
162117```
163118
164- If you would like to fully reconfigure ONLY a specific node/nodes (space seperated).
119+ If you would like to fully reconfigure ONLY a specific node/nodes.
165120
166121```
167- python3 playbooks/resize.py reconfigure [--nodes NODES [NODES ...]]
168- Example: python3 resize.py reconfigure --nodes inst-gsezk-topical-goblin inst-jvpps-topical-goblin
122+ python3 playbooks/resize.py.aug15 reconfigure [--nodes NODES [NODES ...]]
123+ Example: python3 resize.py.aug15 reconfigure --nodes inst-gsezk-topical-goblin inst-jvpps-topical-goblin inst-ytuqj-topical-goblin
169124```
170125
171126
0 commit comments