Skip to content

Commit 1ba943a

Browse files
authored
Merge branch 'main' into fix/additional-nodes-inventory
2 parents 1cc6a25 + b7bd7a9 commit 1ba943a

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

70 files changed

+179
-139
lines changed

.github/bin/get-s3-image.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ if [ -n "$image_exists" ]; then
1717
else
1818
echo "Image $image_name not found in OpenStack. Getting it from S3."
1919

20-
wget https://object.arcus.openstack.hpc.cam.ac.uk/swift/v1/AUTH_3a06571936a0424bb40bc5c672c4ccb1/$bucket_name/$image_name --progress=dot:giga
20+
wget https://leafcloud.store/swift/v1/AUTH_f39848421b2747148400ad8eeae8d536/$bucket_name/$image_name --progress=dot:giga
2121

2222
echo "Uploading image $image_name to OpenStack..."
2323
openstack image create --file $image_name --disk-format qcow2 $image_name --progress

.github/workflows/fatimage.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ jobs:
3131
source_image_name: Rocky-8-GenericCloud-Base-8.10-20240528.0.x86_64.raw
3232
inventory_groups: control,compute,login,update
3333
- image_name: openhpc-RL9
34-
source_image_name: Rocky-9-GenericCloud-Base-9.5-20241118.0.x86_64.raw
34+
source_image_name: Rocky-9-GenericCloud-Base-9.6-20250531.0.x86_64.qcow2
3535
inventory_groups: control,compute,login,update
3636
env:
3737
ANSIBLE_FORCE_COLOR: True

.github/workflows/release-image.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ jobs:
2121
- uses: actions/checkout@v2
2222

2323
- name: Write s3cmd configuration
24-
run: echo "${{ secrets.ARCUS_S3_CFG }}" > ~/.s3cfg
24+
run: echo "${{ secrets.LEAFCLOUD_S3_CFG }}" > ~/.s3cfg
2525

2626
- name: Install s3cmd
2727
run: |

.github/workflows/s3-image-sync.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
name: Upload CI-tested images to Arcus S3 and sync clouds
1+
name: Upload CI-tested images to Leafcloud S3 and sync clouds
22
on:
33
workflow_dispatch:
44
push:
@@ -21,7 +21,7 @@ jobs:
2121

2222
- name: Write s3cmd configuration
2323
run: |
24-
echo "${{ secrets['ARCUS_S3_CFG'] }}" > ~/.s3cfg
24+
echo "${{ secrets['LEAFCLOUD_S3_CFG'] }}" > ~/.s3cfg
2525
shell: bash
2626

2727
- name: Install s3cmd
@@ -73,7 +73,7 @@ jobs:
7373

7474
- name: Write s3cmd configuration
7575
run: |
76-
echo "${{ secrets['ARCUS_S3_CFG'] }}" > ~/.s3cfg
76+
echo "${{ secrets['LEAFCLOUD_S3_CFG'] }}" > ~/.s3cfg
7777
shell: bash
7878

7979
- name: Install s3cmd and qemu-utils

.github/workflows/upload-release-image.yml.sample

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ on:
1919
inputs:
2020
image_name:
2121
type: string
22-
description: Image name from: (https://object.arcus.openstack.hpc.cam.ac.uk/swift/v1/AUTH_3a06571936a0424bb40bc5c672c4ccb1/{BUCKET_NAME})
22+
description: Image name from: (https://leafcloud.store/swift/v1/AUTH_f39848421b2747148400ad8eeae8d536/{BUCKET_NAME})
2323
required: true
2424
bucket_name:
2525
type: choice

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ Run the following from the repository root to activate the venv:
6161
Use the `cookiecutter` template to create a new environment to hold your configuration:
6262

6363
cd environments
64-
cookiecutter skeleton
64+
cookiecutter ../cookiecutter
6565

6666
and follow the prompts to complete the environment name and description.
6767

ansible/fatimage.yml

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -190,6 +190,11 @@
190190
tasks_from: install.yml
191191
when: "'alertmanager' in group_names"
192192

193+
- name: Download HPL source
194+
include_role:
195+
name: hpctests
196+
tasks_from: source-hpl.yml
197+
193198
- hosts: prometheus
194199
become: yes
195200
gather_facts: yes

ansible/roles/alertmanager/README.md

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -11,12 +11,9 @@ Note that:
1111
- No Grafana dashboard for alerts is currently provided.
1212

1313
Alertmanager is enabled by default on the `control` node in the
14-
[everything](../../../environments/common/layouts/everything) template which
15-
`cookiecutter` uses for a new environment's `inventory/groups` file.
14+
`site` environment's `inventory/groups` file.
1615

1716
In general usage may only require:
18-
- Adding the `control` node into the `alertmanager` group in `environments/site/groups`
19-
if upgrading an existing environment.
2017
- Enabling the Slack integration (see section below).
2118
- Possibly setting `alertmanager_web_external_url`.
2219

ansible/roles/block_devices/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ This is a convenience wrapper around the ansible modules:
1111

1212
To avoid issues with device names changing after e.g. reboots, devices are identified by serial number and mounted by filesystem UUID.
1313

14-
**NB:** This role is ignored[^1] during Packer builds as block devices will not be attached to the Packer build VMs. This role is therefore deprecated and it is suggested that `cloud-init` is used instead. See e.g. `environments/skeleton/{{cookiecutter.environment}}/tofu/control.userdata.tpl`.
14+
**NB:** This role is ignored[^1] during Packer builds as block devices will not be attached to the Packer build VMs. This role is therefore deprecated and it is suggested that `cloud-init` is used instead. See e.g. `environments/site/tofu/control.userdata.tpl`.
1515

1616
[^1]: See `environments/common/inventory/group_vars/builder/defaults.yml`
1717

ansible/roles/freeipa/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ Support FreeIPA in the appliance. In production use it is expected the FreeIPA s
77

88
## Usage
99
- Add hosts to the `freeipa_client` group and run (at a minimum) the `ansible/iam.yml` playbook.
10-
- Host names must match the domain name. By default (using the skeleton OpenTofu) hostnames are of the form `nodename.cluster_name.cluster_domain_suffix` where `cluster_name` and `cluster_domain_suffix` are OpenTofu variables.
10+
- Host names must match the domain name. By default (using the site OpenTofu) hostnames are of the form `nodename.cluster_name.cluster_domain_suffix` where `cluster_name` and `cluster_domain_suffix` are OpenTofu variables.
1111
- Hosts discover the FreeIPA server FQDN (and their own domain) from DNS records. If DNS servers are not set this is not set from DHCP, then use the `resolv_conf` role to configure this. For example when using the in-appliance FreeIPA development server:
1212

1313
```ini
@@ -28,7 +28,7 @@ Support FreeIPA in the appliance. In production use it is expected the FreeIPA s
2828
- For production use with an external FreeIPA server, a random one-time password (OTP) must be generated when adding hosts to FreeIPA (e.g. using `ipa host-add --random ...`). This password should be set as a hostvar `freeipa_host_password`. Initial host enrolment will use this OTP to enrol the host. After this it becomes irrelevant so it does not need to be committed to git. This approach means the appliance does not require the FreeIPA administrator password.
2929
- For development use with the in-appliance FreeIPA server, `freeipa_host_password` will be automatically generated in memory.
3030
- The `control` host must define `appliances_state_dir` (on persistent storage). This is used to back-up keytabs to allow FreeIPA clients to automatically re-enrol after e.g. reimaging. Note that:
31-
- This is implemented when using the skeleton OpenTofu; on the control node `appliances_state_dir` defaults to `/var/lib/state` which is mounted from a volume.
31+
- This is implemented when using the site OpenTofu; on the control node `appliances_state_dir` defaults to `/var/lib/state` which is mounted from a volume.
3232
- Nodes are not re-enroled by a [Slurm-driven reimage](../../collections/ansible_collections/stackhpc/slurm_openstack_tools/roles/rebuild/README.md) (as that does not run this role).
3333
- If both a backed-up keytab and `freeipa_host_password` exist, the former is used.
3434

0 commit comments

Comments
 (0)