Skip to content

Commit bc8efb4

Browse files
authored
Update and rename articles/cyclecloud/how-to/slurm-cloud-bursting-setup.md to articles/cyclecloud/how-to/bursting/slurm-cloud-bursting-setup.md
1 parent 591d853 commit bc8efb4

File tree

2 files changed

+72
-213
lines changed

2 files changed

+72
-213
lines changed
Lines changed: 72 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,72 @@
1+
---
2+
title: Cloud Bursting Using Azure CycleCloud and Slurm
3+
description: Learn how to configure Cloud bursting using Azure CycleCloud and Slurm.
4+
author: vinil-v
5+
ms.date: 04/17/2025
6+
ms.author: padmalathas
7+
---
8+
9+
# What is Cloud Bursting?
10+
11+
Cloud bursting is a configuration in cloud computing that allows an organization to handle peaks in IT demand by using a combination of private and public clouds. When the resources in a private cloud reach their maximum capacity, the overflow traffic is directed to a public cloud to ensure there is no interruption in services. This setup provides flexibility and cost savings, as you only pay for the additional resources when there is a demand for them.
12+
13+
For example, an application can run on a private cloud and "burst" to a public cloud only when necessary to meet peak demands. This approach helps avoid the costs associated with maintaining extra capacity that is not always in use
14+
15+
Cloud bursting can be used in various scenarios, such as enabling on-premises workloads to be sent to the cloud for processing, known as hybrid HPC (High-Performance Computing). This allows users to optimize their resource utilization and cost efficiency while accessing the scalability and flexibility of the cloud.
16+
17+
## Overview
18+
19+
This document offers a step-by-step guide on installing and configuring a Slurm scheduler to burst computing resources into the cloud using Azure CycleCloud. It explains how to create a Hybrid HPC (High-Performance Computing) environment by extending on-premises Slurm clusters into Azure, allowing for seamless access to scalable and flexible cloud computing resources. The guide provides a practical example of optimizing compute capacity by integrating local infrastructure with cloud-based solutions.
20+
21+
22+
## Requirements to Setup Slurm Cloud Bursting Using CycleCloud on Azure
23+
24+
## Azure subscription account
25+
You must obtain an Azure subscription or be assigned as an Owner role of the subscription.
26+
27+
* To create an Azure subscription, go to the [Create a Subscription](/azure/cost-management-billing/manage/create-subscription#create-a-subscription) documentation.
28+
* To access an existing subscription, go to the [Azure portal](https://portal.azure.com/).
29+
30+
## Network infrastructure
31+
If you intend to create a Slurm cluster entirely within Azure, you must deploy both the head node(s) and the CycleCloud compute nodes within a single Azure Virtual Network (VNET).
32+
33+
![Slurm cluster](../images/slurm-cloud-burst/diagram.png)
34+
35+
However, if your goal is to establish a hybrid HPC cluster with the head node(s) located on your on-premises corporate network and the compute nodes in Azure, you will need to set up a [Site-to-Site](/azure/vpn-gateway/tutorial-site-to-site-portal) VPN or an [ExpressRoute](/azure/expressroute/) connection between your on-premises network and the Azure VNET. The head node(s) must have the capability to connect to Azure services over the Internet. You may need to coordinate with your network administrator to configure this connectivity.
36+
37+
## Network Ports and Security
38+
The following NSG rules must be configured for successful communication between Master node, CycleCloud server and Compute nodes.
39+
40+
41+
| **Service** | **Port** | **Protocol** | **Direction** | **Purpose** | **Requirement** |
42+
|------------------------------------|-----------------|--------------|------------------|------------------------------------------------------------------------|---------------------------------------------------------------------------------|
43+
| **SSH (Secure Shell)** | 22 | TCP | Inbound/Outbound | Secure command-line access to the Slurm Master node | Open on both on-premises firewall and Azure NSGs |
44+
| **Slurm Control (slurmctld, slurmd)** | 6817, 6818 | TCP | Inbound/Outbound | Communication between Slurm Master and compute nodes | Open in on-premises firewall and Azure NSGs |
45+
| **Munge Authentication Service** | 4065 | TCP | Inbound/Outbound | Authentication between Slurm Master and compute nodes | Open on both on-premises network and Azure NSGs |
46+
| **CycleCloud Service** | 443 | TCP | Outbound | Communication between Slurm Master node and Azure CycleCloud | Allow outbound connections to Azure CycleCloud services from the Slurm Master node |
47+
| **NFS ports** | 2049 | TCP | Inbound/Outbound | Shared filesystem access between Master node and Azure CycleCloud | Open on both on-premises network and Azure NSGs |
48+
| **LDAP port** (Optional) | 389 | TCP | Inbound/Outbound | Centralized authentication mechanism for user management | Open on both on-premises network and Azure NSGs
49+
50+
Please refer [Slurm Network Configuration Guide](https://slurm.schedmd.com/network.html)
51+
52+
## Software Requirement
53+
54+
- **OS Version**: AlmaLinux release 8.x or Ubuntu 22.04
55+
- **CycleCloud Version**: 8.x or later
56+
- **CycleCloud-Slurm Project Version**: 3.0.x
57+
58+
## NFS File server
59+
A shared file system between the external Slurm Scheduler node and the CycleCloud cluster. You can use Azure NetApp Files, Azure Files, NFS, or other methods to mount the same file system on both sides. In this example, we are using a Scheduler VM as an NFS server.
60+
61+
## Centralized User management system (LDAP or AD)
62+
In HPC environments, maintaining consistent user IDs (UIDs) and group IDs (GIDs) across the cluster is critical for seamless user access and resource management. A centralized user management system, such as LDAP or Active Directory (AD), ensures that UIDs and GIDs are synchronized across all compute nodes and storage systems.
63+
64+
> !Important
65+
>
66+
> For more information, see the blog post about how to setup Slurm Cloud Bursting Using CycleCloud on Azure [https://techcommunity.microsoft.com/blog/azurehighperformancecomputingblog/setting-up-slurm-cloud-bursting-using-cyclecloud-on-azure/4140922].
67+
68+
### Next Steps
69+
70+
* [GitHub repo - cyclecloud-slurm](https://github.com/Azure/cyclecloud-slurm/tree/master)
71+
* [Azure CycleCloud Documentation](../overview.md)
72+
* [Slurm documentation](https://slurm.schedmd.com/documentation.html)

articles/cyclecloud/how-to/slurm-cloud-bursting-setup.md

Lines changed: 0 additions & 213 deletions
This file was deleted.

0 commit comments

Comments
 (0)