Skip to content

Commit 179ed4b

Browse files
author
Matt Pryor
committed
Make the resource consumtion more obvious
1 parent e770e34 commit 179ed4b

File tree

4 files changed

+64
-28
lines changed

4 files changed

+64
-28
lines changed

group_vars/openstack.yml

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -17,9 +17,6 @@ terraform_project_path: "{{ playbook_dir }}/terraform"
1717
terraform_state: "{{ cluster_state | default('present') }}"
1818
cluster_ssh_user: rocky
1919

20-
# Set the size of the state volume to metrics_db_maximum_size + 10
21-
state_volume_size: "{{ metrics_db_maximum_size + 10 }}"
22-
2320
# Provision a single "standard" compute partition using the supplied
2421
# node count and flavor
2522
openhpc_slurm_partitions:

group_vars/prometheus.yml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,4 +8,5 @@ openondemand_address: "{{ hostvars[groups['openondemand'].0].api_address if 'ope
88
prometheus_scrape_configs: "{{ prometheus_scrape_configs_default + (openondemand_scrape_configs if ( 'openondemand' in groups ) else [] ) }}"
99

1010
# Set Prometheus storage retention size
11-
prometheus_storage_retention_size: "{{ metrics_db_maximum_size }}GB"
11+
# We reserve 10GB of the state volume for cluster state, the rest is for metrics
12+
prometheus_storage_retention_size: "{{ state_volume_size - 10 }}GB"

ui-meta/slurm-infra-fast-volume-type.yml

Lines changed: 31 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,24 @@ parameters:
1212
kind: cloud.ip
1313
immutable: true
1414

15+
- name: login_flavor
16+
label: Login node size
17+
description: The size to use for the login node.
18+
kind: cloud.size
19+
immutable: true
20+
options:
21+
min_ram: 2048
22+
min_disk: 20
23+
24+
- name: control_flavor
25+
label: Control node size
26+
description: The size to use for the control node.
27+
kind: cloud.size
28+
immutable: true
29+
options:
30+
min_ram: 2048
31+
min_disk: 20
32+
1533
- name: compute_count
1634
label: Compute node count
1735
description: The number of compute nodes in the cluster.
@@ -23,16 +41,16 @@ parameters:
2341
- name: compute_flavor
2442
label: Compute node size
2543
description: The size to use for the compute node.
26-
kind: "cloud.size"
44+
kind: cloud.size
2745
immutable: true
2846
options:
2947
min_ram: 2048
3048
min_disk: 20
3149

3250
- name: home_volume_size
3351
label: Home volume size (GB)
34-
description: The size of the cloud volume to use for home directories
35-
kind: integer
52+
description: The size of the cloud volume to use for home directories.
53+
kind: cloud.volume_size
3654
immutable: true
3755
options:
3856
min: 10
@@ -51,19 +69,20 @@ parameters:
5169
options:
5270
checkboxLabel: Put home directories on high-performance storage?
5371

54-
- name: metrics_db_maximum_size
55-
label: Metrics database size (GB)
72+
- name: state_volume_size
73+
label: State volume size (GB)
5674
description: |
75+
The size of the state volume, used to hold and persist important files and data. Of
76+
this volume, 10GB is set aside for cluster state and the remaining space is used
77+
to store cluster metrics.
78+
5779
The oldest metrics records in the [Prometheus](https://prometheus.io/) database will be
58-
discarded to ensure that the database does not grow larger than this size.
59-
60-
**A cloud volume of this size +10GB will be created to hold and persist the metrics
61-
database and important Slurm files.**
62-
kind: integer
80+
discarded to ensure that the database does not grow larger than this volume.
81+
kind: cloud.volume_size
6382
immutable: true
6483
options:
65-
min: 10
66-
default: 10
84+
min: 20
85+
default: 20
6786

6887
- name: cluster_run_validation
6988
label: Post-configuration validation

ui-meta/slurm-infra.yml

Lines changed: 31 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,24 @@ parameters:
1212
kind: cloud.ip
1313
immutable: true
1414

15+
- name: login_flavor
16+
label: Login node size
17+
description: The size to use for the login node.
18+
kind: cloud.size
19+
immutable: true
20+
options:
21+
min_ram: 2048
22+
min_disk: 20
23+
24+
- name: control_flavor
25+
label: Control node size
26+
description: The size to use for the control node.
27+
kind: cloud.size
28+
immutable: true
29+
options:
30+
min_ram: 2048
31+
min_disk: 20
32+
1533
- name: compute_count
1634
label: Compute node count
1735
description: The number of compute nodes in the cluster.
@@ -23,34 +41,35 @@ parameters:
2341
- name: compute_flavor
2442
label: Compute node size
2543
description: The size to use for the compute node.
26-
kind: "cloud.size"
44+
kind: cloud.size
2745
immutable: true
2846
options:
2947
min_ram: 2048
3048
min_disk: 20
3149

3250
- name: home_volume_size
3351
label: Home volume size (GB)
34-
description: The size of the cloud volume to use for home directories
35-
kind: integer
52+
description: The size of the cloud volume to use for home directories.
53+
kind: cloud.volume_size
3654
immutable: true
3755
options:
3856
min: 10
3957
default: 100
4058

41-
- name: metrics_db_maximum_size
42-
label: Metrics database size (GB)
59+
- name: state_volume_size
60+
label: State volume size (GB)
4361
description: |
62+
The size of the state volume, used to hold and persist important files and data. Of
63+
this volume, 10GB is set aside for cluster state and the remaining space is used
64+
to store cluster metrics.
65+
4466
The oldest metrics records in the [Prometheus](https://prometheus.io/) database will be
45-
discarded to ensure that the database does not grow larger than this size.
46-
47-
**A cloud volume of this size +10GB will be created to hold and persist the metrics
48-
database and important Slurm files.**
49-
kind: integer
67+
discarded to ensure that the database does not grow larger than this volume.
68+
kind: cloud.volume_size
5069
immutable: true
5170
options:
52-
min: 10
53-
default: 10
71+
min: 20
72+
default: 20
5473

5574
- name: cluster_run_validation
5675
label: Post-configuration validation

0 commit comments

Comments
 (0)