Skip to content

Commit 2aad36f

Browse files
committed
Line edits9
1 parent 203d669 commit 2aad36f

File tree

2 files changed

+17
-19
lines changed

2 files changed

+17
-19
lines changed

learn-pr/azure/intro-to-hpc/includes/3-how-azure-hpc-works.md

Lines changed: 11 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -36,21 +36,19 @@ Run visualization workloads with HPC and Azure Virtual Machines that boost produ
3636

3737
## Mapping Azure VM and Storage products to components in an HPC system
3838

39-
### Azure Compute Virtual Machine Solutions
40-
41-
#### VMs with low latency (HPC SKUs)
39+
### VMs with low latency (HPC SKUs)
4240

4341
The following H-series and N-series VMs are RDMA capable and can communicate over the low latency and high bandwidth InfiniBand network. The RDMA capability over such an interconnect is critical to boost the scalability and performance of distributed-node HPC and AI workloads.
4442

45-
#### VMs with accelerators
43+
### VMs with accelerators
4644

4745
| **CPU** | **GP-GPU** | **VISUAL** |
4846
| :------------------- | :------------------- | :------------------- |
4947
| **HB-series** VMs are optimized for applications that are memory intensive, such as fluid dynamics, explicit finite element analysis, and weather modeling. <br><br>**HC-series** VMs are optimized for applications that are compute intensive, such as molecular dynamics, implicit finite element analysis, and computational chemistry. <br><br><br>| **NC-series** VMs are powered by the NVIDIA Tesla K80 card and the Intel Xeon E5-2690 v3 (Haswell) processor. Users can crunch through data faster by using CUDA for energy exploration applications, crash simulations, ray traced rendering, deep learning, and more. <br><br> **ND-series** VMs are a new addition to the GPU family designed for AI and deep learning workloads. It offers configuration with a secondary low-latency, high-throughput network through RDMA, and InfiniBand connectivity enables running of large-scale training jobs spanning many GPUs.| **NV-series** VMs are made for desktop accelerated applications and virtual desktops where customers are able to visualize their data or simulations. Enables users to visualize their graphics intensive workflows on the NV instances to get a superior graphics capability and additionally run single precision workloads such as encoding and rendering.<br><br><br><br><br><br><br>|
5048

51-
### Azure Storage Solutions
49+
## Azure Storage Solutions
5250

53-
#### Azure Blob Storage
51+
### Azure Blob Storage
5452

5553
Allows massively scalable and secure object storage for cloud-native workloads, archives, data lakes, high-performance computing, and machine learning. It's scalable and optimized for data lakes with comprehensive data management.
5654

@@ -63,7 +61,7 @@ Key design features include:
6361
- Storing data for backup and restore, disaster recovery, and archiving.
6462
- Storing data for analysis by an on-premises or Azure-hosted service.
6563

66-
#### Azure NetApp Files
64+
### Azure NetApp Files
6765

6866
Makes it easy for enterprise line-of-business and storage professionals to migrate and run complex, file-based applications with no code change. It's used as the underlying shared file-storage service in various scenarios such as, lift-shift migration of POSIX compliant Linux and Windows applications, SAP HANA, databases, and enterprise web applications.
6967

@@ -75,7 +73,7 @@ Key benefits include:
7573
- Data protection using Cross-Region replication.
7674
- Advanced Enterprise Data Management features.
7775

78-
#### Azure Files
76+
### Azure Files
7977

8078
Azure Files offers fully managed file shares in the cloud that are accessible via the industry standard Server Message Block (SMB) protocol or Network File System (NFS) protocol.
8179

@@ -101,7 +99,7 @@ Key benefits include:
10199
- Resiliency
102100
- Familiar Programmability
103101

104-
#### Azure Managed Lustre
102+
### Azure Managed Lustre
105103

106104
Azure Managed Lustre service gives you the capability to quickly create an Azure-based Lustre file system for cloud-based high-performance computing jobs. It's a fully managed parallel file system best suited for medium to large HPC workloads. It enables HPC applications in the cloud without breaking application compatibility by providing familiar Lustre parallel file system functionality, behaviors, and performance, securing long-term application investments.
107105

@@ -116,9 +114,9 @@ Key benefits include:
116114
- Supports containerized workloads with AKS.
117115
- Integrates with Azure Blob Storage as a source for importing and exporting data for long-term storage.
118116

119-
#### VM-based file systems
117+
### VM-based file systems
120118

121-
##### Single VM NAS
119+
#### Single VM NAS
122120

123121
Cloud-based Network Attached Storage (NAS) helps you address storage needs in the cloud using the same constructs as an on-premises NAS system. It gives organizations storage that's as performant as their on-premises NAS with the added ability to scale in the cloud-and all without having to make major changes to their existing application interfaces and processes.
124122

@@ -129,7 +127,7 @@ Key benefits include:
129127
- Network devices accessing Virtual NAS storage can continue to do so using the same protocols without any reconfiguration.
130128
- Capacity management is also easier since any required storage can be allocated from the underlying virtualization layer.
131129

132-
##### Multi-node Parallel file systems
130+
#### Multi-node Parallel file systems
133131

134132
Parallel file systems distribute block level storage across multiple networked storage nodes. File data is spread among these nodes, meaning file data is spread among multiple storage devices. It pools any individual storage I/O requests across multiple storage nodes that are accessible through a common namespace.
135133

@@ -142,7 +140,7 @@ The advantages of distributed storage and superior I/O performance make parallel
142140

143141
![Diagram of Azure object storage.](../media/10-azure-object-storage.png)
144142

145-
##### Cray ClusterStor
143+
#### Cray ClusterStor
146144

147145
The Cray ClusterStor in Azure storage system is a high capacity and high throughput storage solution to accelerate your HPC simulations. It's a bare metal appliance that is fully integrated in the Azure fabric and accessible by a large selection of other Azure services. Cray ClusterStor in Azure offers a Lustre-based, single-tenant, bare metal, and fully managed HPC environment in Microsoft Azure.
148146

learn-pr/azure/intro-to-hpc/includes/4-microsoft-hpc-pack.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
One of the key concepts of cloud computing is *Orchestration*. It refers to overseeing the deployment, running and monitoring of all the components of an application in the cluster.
1+
One of the key concepts of cloud computing is *Orchestration*. It refers to overseeing the deployment, running, and monitoring of all the components of an application in the cluster.
22

33
Additionally, an orchestrator can perform other tasks like healing (managing errors), scaling, and logging. Orchestrators like the well-known Kubernetes or Mesos can access cloud cluster resources directly by virtualization.
44

@@ -75,20 +75,20 @@ There are two basic strategies that schedulers can use to determine which job to
7575

7676
- **Shortest Job First:** Based on the execution time declared in the job script, the scheduler estimates the job execution time. The jobs are ranked in the ascending order of the execution time. While short jobs will start after a short waiting time, long running jobs (or at least jobs declared as such) might never actually start.
7777

78-
- **Backfilling:** The scheduler maintains the concept of *First Come, First Serve* without preventing long running jobs from executing. The scheduler runs the job only when the first job in the queue can be executed. If otherwise, the scheduler goes through the rest of the queue to check whether another job can be executed without extending the waiting time of the first job in queue. If it finds such a job, the scheduler runs that job. Small jobs usually encounter short queue times.
78+
In addition, there is the practice of **Backfilling.** The scheduler maintains the concept of *First Come, First Serve* without preventing long running jobs from executing. The scheduler runs the job only when the first job in the queue can be executed. If otherwise, the scheduler goes through the rest of the queue to check whether another job can be executed without extending the waiting time of the first job in queue. If it finds such a job, the scheduler runs that job. Small jobs usually encounter short queue times.
7979

8080
### Workflow management
8181

82-
- **Task pipelining:** Repeated operations such as tool usage and software process task sequence executions can be organized into a pipeline. Automating it can make the overall software and tool usage more efficient. It creates efficiencies by making the task itself faster and reducing the burden upon the knowledge worker for its management.
82+
**Task pipelining:** Repeated operations such as tool usage and software process task sequence executions can be organized into a pipeline. Automating it can make the overall software and tool usage more efficient. It creates efficiencies by making the task itself faster and reducing the burden upon the knowledge worker for its management.
8383

84-
- **Task automation:** Automation can reduce the error rate of a process by eliminating variance in how it's performed. Pipelining and automation of a task can open the door for further process innovations like parallelization and cloud deployment.
84+
**Task automation:** Automation can reduce the error rate of a process by eliminating variance in how it's performed. Pipelining and automation of a task can open the door for further process innovations like parallelization and cloud deployment.
8585

8686
### Tools for workflow management
8787

88-
- **Azure Batch:** Use Azure Batch to run large-scale, parallel, and high-performance computing (HPC) batch jobs efficiently in Azure. Azure Batch creates and manages a pool of compute nodes (virtual machines), installs the applications you want to run, and schedules jobs to run on the nodes. There's no cluster or job scheduler software to install, manage, or scale. Instead, you use Batch APIs and tools, command-line scripts, or the Azure portal to configure, manage, and monitor your jobs.
88+
**Azure Batch:** Use Azure Batch to run large-scale, parallel, and high-performance computing (HPC) batch jobs efficiently in Azure. Azure Batch creates and manages a pool of compute nodes (virtual machines), installs the applications you want to run, and schedules jobs to run on the nodes. There's no cluster or job scheduler software to install, manage, or scale. Instead, you use Batch APIs and tools, command-line scripts, or the Azure portal to configure, manage, and monitor your jobs.
8989

9090
For full details on Azure Batch, including more capabilities and how it works, see [Azure Batch](/azure/batch).
9191

92-
- **Azure CycleCloud:** Azure CycleCloud is an enterprise-friendly tool for orchestrating and managing High Performance Computing (HPC) environments on Azure. With CycleCloud, users can plan infrastructure for HPC systems, deploy familiar HPC schedulers, and automatically scale the infrastructure to run jobs efficiently at any scale. Through CycleCloud, users can create different types of file systems and mount them to the compute cluster nodes to support HPC workloads.
92+
**Azure CycleCloud:** Azure CycleCloud is an enterprise-friendly tool for orchestrating and managing High Performance Computing (HPC) environments on Azure. With CycleCloud, users can plan infrastructure for HPC systems, deploy familiar HPC schedulers, and automatically scale the infrastructure to run jobs efficiently at any scale. Through CycleCloud, users can create different types of file systems and mount them to the compute cluster nodes to support HPC workloads.
9393

9494
For more information on Azure CycleCloud, see [Azure CycleCloud](/azure/cyclecloud).

0 commit comments

Comments
 (0)