Skip to content

Commit 0644502

Browse files
committed
removed topology from default groups + added docs
1 parent ca6d858 commit 0644502

File tree

2 files changed

+8
-2
lines changed

2 files changed

+8
-2
lines changed

docs/production.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -118,6 +118,13 @@ and referenced from the `site` and `production` environments, e.g.:
118118
- If floating IPs are required for login nodes, create these in OpenStack and add the IPs into
119119
the OpenTofu `login` definition.
120120

121+
- Consider enabling topology aware scheduling. This is currently only supported if your cluster does not include any baremetal nodes. This can be enabled by:
122+
1. Creating Availability Zones in your OpenStack project for each physical rack
123+
2. Setting the `availability_zone` fields of compute groups in your OpenTofu configuration
124+
3. Adding the `compute` group as a child of `topology` in `environments/$ENV/inventory/groups`
125+
4. (Optional) If you are aware of the physical topology of switches above the rack-level, override `topology_above_rack_topology` in your groups vars
126+
(see [topology docs](../ansible/roles/topology/README.md) for more detail)
127+
121128
- Consider whether mapping of baremetal nodes to ironic nodes is required. See
122129
[PR 485](https://github.com/stackhpc/ansible-slurm-appliance/pull/485).
123130

environments/site/inventory/groups

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -140,9 +140,8 @@ builder
140140
# Hosts to configure for node health checks
141141
compute
142142

143-
[topology:children]
143+
[topology]
144144
# Compute nodes to be included in the Slurm topology plugin's topology tree. See ansible/roles/topology
145145
# Should be set to `compute` if enabled
146146
# Note that this feature currently assumes all compute nodes are VMs, enabling
147147
# when the cluster contains baremetal compute nodes may lead to unexpected scheduling behaviour
148-
compute

0 commit comments

Comments
 (0)