Skip to content

Commit 4a6b229

Browse files
Improve ReadME for autoscaling
1 parent 7b82997 commit 4a6b229

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

README.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,12 +44,18 @@ TODO
4444

4545
# Autoscaling
4646

47-
The autoscaling will work in a “cluster per job” approach. This means that for job waiting in the queue, we will launch new cluster specifically for that job. Autoscaling will also take care of spinning down clusters. By default, a cluster is left Idle for 10 minutes before shutting down. Autoscaling is achieved with a cronjob to be able to quickly switch from one scheduler to the next.
47+
The autoscaling will work in a “cluster per job” approach. This means that for job waiting in the queue, we will launch new cluster specifically for that job. Autoscaling will also take care of spinning down clusters. By default, a cluster is left Idle for 10 minutes before shutting down. Autoscaling is achieved with a cronjob to be able to quickly switch from one scheduler to the next.
48+
49+
Initial cluster deployed through the stack will never be spun down.
4850

4951
There is a configuration file at `/opt/oci-hpc/autoscaling/queues.conf` with an example at `/opt/oci-hpc/autoscaling/queues.conf.example`to show how to add multiple queues and multiple instance types. Examples are included for HPC, GPU or Flex VMs.
5052

5153
You will be able to use the instance type name as a feature in the job definition to make sure it runs/create the right kind of node.
5254

55+
You can only have one default instance-type per queue and one default queue. To submit to a non default queue, either add this line to the SBATCH file: `#SBATCH --partition compute` or in the command line: `sbatch -p queuename job.sh`
56+
57+
The key word `permanent` allows will spin up clusters but not delete them untill it is set to false. It is not needed to reconfigure slurm after you change that value.
58+
5359
After a modification of the `/opt/oci-hpc/autoscaling/queues.conf`, you need to run
5460
`/opt/oci-hpc/autoscaling/slurm_config.sh`
5561

0 commit comments

Comments
 (0)