Skip to content

Commit aa369b1

Browse files
authored
Merge pull request #17071 from ManikaDhiman/md-mertics-dashboard
New content for Performance Metrics dashboard
2 parents 6fdf4f2 + 695db8a commit aa369b1

9 files changed

+119
-3
lines changed
255 KB
Loading
261 KB
Loading
282 KB
Loading
287 KB
Loading
527 KB
Loading
569 KB
Loading
578 KB
Loading
476 KB
Loading

azure-local/manage/monitor-cluster-with-metrics.md

Lines changed: 119 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,14 +6,14 @@ ms.author: alkohli
66
ms.reviewer: saniyaislam
77
ms.topic: how-to
88
ms.service: azure-local
9-
ms.date: 10/15/2024
9+
ms.date: 03/20/2025
1010
---
1111

1212
# Monitor Azure Local with Azure Monitor Metrics
1313

1414
[!INCLUDE [applies-to](../includes/hci-applies-to-23h2.md)]
1515

16-
This article describes how to monitor your Azure Local system with [Azure Monitor Metrics](/azure/azure-monitor/essentials/data-platform-metrics). It also provides a comprehensive list of metrics collected for compute, storage, and network resources in Azure Local.
16+
This article describes how to monitor your Azure Local system with [Azure Monitor Metrics](/azure/azure-monitor/essentials/data-platform-metrics). It also describes the Performance Metrics dashboard and lists metrics collected for compute, storage, and network resources in Azure Local.
1717

1818
When you have critical applications and business processes that rely on Azure resources, it's important to monitor those resources for their availability, performance, and operation. The integration of Azure Monitor Metrics with Azure Local enables you to store numeric data from your clusters in a dedicated time-series database. This database is automatically created for each Azure subscription. Use [metrics explorer](/azure/azure-monitor/essentials/tutorial-metrics) to analyze data from your Azure Local system and assess its health and utilization.
1919

@@ -25,7 +25,7 @@ Take a few moments to watch the video walkthrough on creating metric charts in m
2525

2626
Here are the benefits of using Metrics for Azure Local:
2727

28-
- **No additional cost**. These metrics are standard, out-of-the-box features that are automatically collected and provided to you at no extra cost.
28+
- **No extra cost**. These metrics are standard, out-of-the-box features that are automatically collected and provided to you at no extra cost.
2929

3030
- **Near real-time insights**. You have the capability to observe out-of-the-box metrics and correlate trends using near real-time data.  
3131

@@ -87,6 +87,122 @@ Follow these steps to analyze metrics for a specific Azure Local cluster in the
8787

8888
To create alerts, select the **Alerts** option and set up alerts as described in [Create metric alerts](./setup-metric-alerts.md#create-metrics-alerts).
8989

90+
## Monitor performance metrics
91+
92+
The performance metrics dashboard provides a comprehensive view of performance metrics across all Azure Local systems within a subscription or for a specific system. It collects over 60 metrics at no additional cost via the `AzureEdgeTelemetryAndDiagnostics` extension. These metrics form the basis of the charts displayed in the dashboard, offering insights into infrastructure performance and health.
93+
94+
There are two types of performance metrics dashboards:
95+
96+
- **Single Cluster Performance Metrics**, which offers drilled-down views for a specific system, split by unique logical unit number (LUN).
97+
98+
- **Multi Cluster Performance Metrics**, which monitors multiple systems at scale and provides detailed view of performance metrics across all systems within a subscription.
99+
100+
### Benefits
101+
102+
- Requires no extra setup to view your data, provided the [`AzureEdgeTelemetryAndDiagnostics`](../concepts/telemetry-and-diagnostics-overview.md) extension is installed.
103+
104+
- Consolidates all available metrics into a single view, eliminating the need to select individual metrics.
105+
106+
- Built using Azure Workbooks, highly customizable and user-friendly.
107+
108+
- Includes multiple filters, such as a time filter for viewing data up to the past 30 days.
109+
110+
- Allows viewing metrics for multiple clusters across various subscriptions, with filters for subscription, resource groups, or clusters. For a specific cluster, a drilled-down view of metrics at the node, volume, and netadapter levels is available.
111+
112+
### Access the performance metrics dashboard
113+
114+
You can access the performance metrics dashboard through Azure Monitor or the Azure Local system.
115+
116+
#### Access the dashboard via Azure Monitor
117+
118+
To access the dashboard via Azure Monitor, follow these steps:
119+
120+
1. Navigate to Azure Monitor and select **Workbooks**.
121+
122+
1. Under the **Azure Local** section, select the **Multi Cluster Performance Metrics** workbook.
123+
124+
:::image type="content" source="media/monitor-cluster-with-metrics/access-via-azure-monitor.png" alt-text="Screenshot of the Workbooks gallery when accessed via Azure Monitor." lightbox="media/monitor-cluster-with-metrics/access-via-azure-monitor.png":::
125+
126+
#### Access the dashboard via the Azure Local system
127+
128+
To access the dashboard via the Azure Local system, follow these steps:
129+
130+
1. In the Azure portal, go to your Azure Local system.
131+
132+
1. Under **Monitoring**, select **Workbooks**.
133+
134+
1. Select one of the following workbooks based on whether you want to view performance metrics for a single cluster or multiple clusters:
135+
136+
- **Single Cluster Performance Metrics**
137+
138+
- **Multi Cluster Performance Metrics**
139+
140+
:::image type="content" source="media/monitor-cluster-with-metrics/access-via-system.png" alt-text="Screenshot of the Workbooks gallery when accessed via Azure Local system." lightbox="media/monitor-cluster-with-metrics/access-via-system.png":::
141+
142+
### View the dashboard charts
143+
144+
The performance metrics dashboard is organized into three tabs, each focusing on different aspects of system performance. Select the relevant tab to view the metrics related to the selected system performance category.
145+
146+
### [Storage Performance](#tab/storage-performance)
147+
148+
Monitoring storage performance helps optimize storage utilization, allocation, and configuration according to resources and business needs.
149+
150+
The **Storage Performance** tab presents three types of metrics:
151+
152+
- **Volume Usage Metrics.** This section displays metrics related to volume usage, such as disk read/write operations per second, disk read/write bytes per second, and volume latency.
153+
154+
Here's a sample screenshot of Volume Usage Metrics:
155+
156+
:::image type="content" source="media/monitor-cluster-with-metrics/storage-performance-volume-usage.png" alt-text="Screenshot of the Storage Performance dashboard showing the Volume Usage metrics." lightbox="media/monitor-cluster-with-metrics/storage-performance-volume-usage.png":::
157+
158+
- **VHD Metrics.** This section displays metrics related to VHD, such as VHD read/write operations per second, VHD read/write bytes per second, VHD latency, and VHD current and maximum size.
159+
160+
Here's a sample screenshot of VHD Metrics:
161+
162+
:::image type="content" source="media/monitor-cluster-with-metrics/storage-performance-vhd.png" alt-text="Screenshot of the Storage Performance dashboard showing the VHD metrics." lightbox="media/monitor-cluster-with-metrics/storage-performance-vhd.png":::
163+
164+
- **Physical Disk Metrics.** This section displays metrics related to physical disk read/write operations per second, physical disk read/write bytes per second, latency read and write, total capacity size, and capacity size used.
165+
166+
Here's a sample screenshot of Physical Disk Metrics:
167+
168+
:::image type="content" source="media/monitor-cluster-with-metrics/storage-performance-physical-disk.png" alt-text="Screenshot of the Storage Performance dashboard showing the Physical Disk metrics." lightbox="media/monitor-cluster-with-metrics/storage-performance-physical-disk.png":::
169+
170+
In a **Single Cluster Performance Metrics** dashboard, you can drill down further to view metrics split by LUN, which is a unique identifier for storage resources.
171+
172+
### [Network Performance](#tab/network-performance)
173+
174+
Monitoring network performance metrics ensure network availability for users, help identify and troubleshoot problems, and improve network performance.
175+
176+
This section provides network performance metrics, including netadapter bytes sent/received per second, RDMA inbound/outbound bytes per second, and VM netadapter bytes sent/received per second.
177+
178+
Here's a sample screenshot of Network Metrics:
179+
180+
:::image type="content" source="media/monitor-cluster-with-metrics/network-performance-network.png" alt-text="Screenshot of the Network Performance dashboard showing the Network metrics." lightbox="media/monitor-cluster-with-metrics/network-performance-network.png":::
181+
182+
In a **Single Cluster Performance Metrics** dashboard, you can drill down Network metrics to see performance for each netadapter available on different servers within a cluster by its unique LUN.
183+
184+
### [Compute](#tab/compute)
185+
186+
Monitoring compute metrics, including memory and CPU, ensures proper resource allocation and utilization. It identifies usage patterns for appropriate actions, helps detect issues, optimizes system performance, and ensures smooth operation of resources.
187+
188+
The **Compute** tab presents two types of metrics:
189+
190+
- **Memory Metrics.** This section provides information on memory used, available, percentage usage for host and guest, VM memory available, used, memory assigned, pressure, maximum, minimum, startup, and more.
191+
192+
Here's a sample screenshot of Memory Metrics:
193+
194+
:::image type="content" source="media/monitor-cluster-with-metrics/compute-memory.png" alt-text="Screenshot of the Compute Performance dashboard showing the Memory metrics." lightbox="media/monitor-cluster-with-metrics/compute-memory.png":::
195+
196+
- **CPU Metrics.** This section offers metrics, such as Total CPU percentage, host vs guest CPU percentage, and VM CPU percentage.
197+
198+
Here's a sample screenshot of CPU Metrics:
199+
200+
:::image type="content" source="media/monitor-cluster-with-metrics/compute-cpu.png" alt-text="Screenshot of the Compute Performance dashboard showing the CPU metrics." lightbox="media/monitor-cluster-with-metrics/compute-cpu.png":::
201+
202+
In a **Single Cluster Performance Metrics** dashboard, you can view Memory and CPU metrics for each server within a cluster.
203+
204+
---
205+
90206
## What metrics are collected?
91207

92208
This section lists the platform metrics that are collected for the Azure Local cluster, the aggregation types, and the dimensions available for each metric. For more information about metric dimensions, see [Multi-dimensional metrics](/azure/azure-monitor/essentials/data-platform-metrics#multi-dimensional-metrics).

0 commit comments

Comments
 (0)