Skip to content

Commit 61668fe

Browse files
authored
Merge branch 'main' into feat/extend-ood
2 parents c00d818 + bd878f0 commit 61668fe

File tree

8 files changed

+26
-22
lines changed

8 files changed

+26
-22
lines changed

.github/workflows/extra.yml

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -43,6 +43,7 @@ jobs:
4343
OS_CLOUD: openstack
4444
CI_CLOUD: ${{ vars.CI_CLOUD }} # default from repo settings
4545
ARK_PASSWORD: ${{ secrets.ARK_PASSWORD }}
46+
PACKER_GITHUB_API_TOKEN: ${{ secrets.GITHUB_TOKEN }}
4647

4748
steps:
4849
- uses: actions/checkout@v2
@@ -95,7 +96,7 @@ jobs:
9596
. environments/.stackhpc/activate
9697
cd packer/
9798
packer init .
98-
99+
99100
PACKER_LOG=1 packer build \
100101
-on-error=${{ vars.PACKER_ON_ERROR }} \
101102
-var-file=$PKR_VAR_environment_root/${{ env.CI_CLOUD }}.pkrvars.hcl \
@@ -104,7 +105,7 @@ jobs:
104105
-var "inventory_groups=${{ matrix.build.inventory_groups }}" \
105106
-var "volume_size=${{ matrix.build.volume_size }}" \
106107
openstack.pkr.hcl
107-
108+
108109
- name: Get created image names from manifest
109110
id: manifest
110111
run: |

.github/workflows/fatimage.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,7 @@ jobs:
3939
CI_CLOUD: ${{ github.event.inputs.ci_cloud }}
4040
ARK_PASSWORD: ${{ secrets.ARK_PASSWORD }}
4141
LEAFCLOUD_PULP_PASSWORD: ${{ secrets.LEAFCLOUD_PULP_PASSWORD }}
42+
PACKER_GITHUB_API_TOKEN: ${{ secrets.GITHUB_TOKEN }}
4243

4344
steps:
4445
- uses: actions/checkout@v2

.github/workflows/nightlybuild.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,6 +36,7 @@ jobs:
3636
CI_CLOUD: ${{ github.event.inputs.ci_cloud || vars.CI_CLOUD }}
3737
ARK_PASSWORD: ${{ secrets.ARK_PASSWORD }}
3838
LEAFCLOUD_PULP_PASSWORD: ${{ secrets.LEAFCLOUD_PULP_PASSWORD }}
39+
PACKER_GITHUB_API_TOKEN: ${{ secrets.GITHUB_TOKEN }}
3940

4041
steps:
4142
- uses: actions/checkout@v2

ansible/cleanup.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -71,12 +71,12 @@
7171
vars:
7272
image_info:
7373
branch: "{{ lookup('pipe', 'git rev-parse --abbrev-ref HEAD') }}"
74-
build: "{{ ansible_nodename | split('.') | first }}" # hostname is image name, which contains build info
74+
build: "{{ ansible_nodename | split('.') | first }}" # hostname is image name, which contains build info
7575
os: "{{ ansible_distribution }} {{ ansible_distribution_version }}"
7676
kernel: "{{ ansible_kernel }}"
7777
ofed: "{{ ansible_facts.packages['mlnx-ofa_kernel'].0.version | default('-') }}"
7878
doca: "{{ ansible_facts.packages[doca_profile | default('doca-ofed') ].0.version | default('-') }}"
79-
cuda: "{{ ansible_facts.packages['cuda'].0.version | default('-') }}"
79+
cuda: "{{ ansible_facts.packages['cuda-toolkit'].0.version | default('-') }}"
8080
slurm-ohpc: "{{ ansible_facts.packages['slurm-ohpc'].0.version | default('-') }}"
8181

8282
- name: Show image summary

docs/operations.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -74,6 +74,7 @@ By default, the following utility packages are installed during the StackHPC ima
7474
- postfix
7575
- git
7676
- latest python version for system (3.6 for for Rocky 8.9 and 3.12 for Rocky 9.4)
77+
- s-nail
7778

7879
Additional packages can be added during image builds by:
7980
- adding the `extra_packages` group to the build `inventory_groups` (see

docs/production.md

Lines changed: 15 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -48,21 +48,28 @@ and referenced from the `site` and `production` environments, e.g.:
4848

4949
```
5050
...
51+
variable "environment_root" {
52+
type = string
53+
description = "Path to environment root, automatically set by activate script"
54+
}
55+
5156
module "cluster" {
5257
source = "../../site/tofu/"
58+
environment_root = var.environment_root
5359

5460
cluster_name = "foo"
5561
...
5662
}
5763
```
5864

5965
Note that:
60-
- Environment-specific variables (`cluster_name`) should be hardcoded
61-
into the cluster module block.
62-
- Environment-independent variables (e.g. maybe `cluster_net` if the
63-
same is used for staging and production) should be set as *defaults*
64-
in `environments/site/tofu/variables.tf`, and then don't need to
65-
be passed in to the module.
66+
67+
- Environment-specific variables (`cluster_name`) should be hardcoded
68+
into the cluster module block.
69+
- Environment-independent variables (e.g. maybe `cluster_net` if the
70+
same is used for staging and production) should be set as *defaults*
71+
in `environments/site/tofu/variables.tf`, and then don't need to
72+
be passed in to the module.
6673
6774
- Vault-encrypt secrets. Running the `generate-passwords.yml` playbook creates
6875
a secrets file at `environments/$ENV/inventory/group_vars/all/secrets.yml`.
@@ -120,22 +127,14 @@ and referenced from the `site` and `production` environments, e.g.:
120127
set the "attach" options and run `tofu apply` again - this should show there
121128
are no changes planned.
122129
123-
- Enable `etc_hosts` templating:
124-
125-
```yaml
126-
# environments/site/inventory/groups:
127-
[etc_hosts:children]
128-
cluster
129-
```
130-
131130
- Configure Open OnDemand - see [specific documentation](openondemand.md).
132131
133132
- Remove the `demo_user` user from `environments/$ENV/inventory/group_vars/all/basic_users.yml`
134133
135134
- Consider whether having (read-only) access to Grafana without login is OK. If not, remove `grafana_auth_anonymous` in `environments/$ENV/inventory/group_vars/all/grafana.yml`
136135
137-
- If floating IPs are required for login nodes, modify the OpenTofu configurations
138-
appropriately.
136+
- If floating IPs are required for login nodes, create these in OpenStack and add the IPs into
137+
the OpenTofu `login` definition.
139138
140139
- Consider whether mapping of baremetal nodes to ironic nodes is required. See
141140
[PR 485](https://github.com/stackhpc/ansible-slurm-appliance/pull/485).
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
22
"cluster_image": {
3-
"RL8": "openhpc-RL8-250704-1445-ff88ca4e",
4-
"RL9": "openhpc-RL9-250704-1445-ff88ca4e"
3+
"RL8": "openhpc-RL8-250708-1502-1494192e",
4+
"RL9": "openhpc-RL9-250708-1547-1494192e"
55
}
66
}

environments/common/inventory/group_vars/all/defaults.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -94,6 +94,7 @@ appliances_extra_packages_default:
9494
- postfix
9595
- git
9696
- "{{ 'python36' if ansible_distribution_version == '8.9' else 'python312' }}"
97+
- s-nail
9798

9899
appliances_extra_packages_other: []
99100

0 commit comments

Comments
 (0)