Skip to content

Commit 9e52cef

Browse files
Merge branch 'main' of https://github.com/MicrosoftDocs/azure-stack-docs-pr into amlfs-templates
2 parents 76e23a4 + 38b980e commit 9e52cef

File tree

6 files changed

+166
-9
lines changed

6 files changed

+166
-9
lines changed

.openpublishing.publish.config.json

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -92,6 +92,12 @@
9292
"branch": "main",
9393
"branch_mapping": {}
9494
},
95+
{
96+
"path_to_root": "azure-reference-other-repo",
97+
"url": "https://github.com/MicrosoftDocs/azure-reference-other-pr",
98+
"branch": "main",
99+
"branch_mapping": {}
100+
},
95101
{
96102
"path_to_root": "quickstart-templates",
97103
"url": "https://github.com/Azure/azure-quickstart-templates",

azure-managed-lustre/TOC.yml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,12 @@
4444
href: configure-network-security-group.md
4545
- name: Use customer-managed encryption keys
4646
href: customer-managed-encryption-keys.md
47+
- name: Monitoring metrics and logs
48+
items:
49+
- name: Monitor a file system
50+
href: monitor-file-system.md
51+
- name: Monitoring reference for metrics and logs
52+
href: monitor-file-system-reference.md
4753
- name: Availability and disaster recovery
4854
items:
4955
- name: Recover from a regional outage
Lines changed: 62 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,62 @@
1+
---
2+
title: Monitoring data reference for Azure Managed Lustre
3+
description: This article contains important reference material you need when you monitor Azure Managed Lustre.
4+
ms.date: 08/16/2024
5+
ms.custom: horz-monitor
6+
ms.topic: reference
7+
author: pauljewellmsft
8+
ms.author: pauljewell
9+
ms.service: azure-managed-lustre
10+
---
11+
12+
# Azure Managed Lustre monitoring data reference
13+
14+
[!INCLUDE [horz-monitor-ref-intro](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-intro.md)]
15+
16+
See [Monitor Azure Managed Lustre](monitor-file-system.md) for details on the data you can collect for Azure Managed Lustre and how to use it.
17+
18+
[!INCLUDE [horz-monitor-ref-metrics-intro](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-metrics-intro.md)]
19+
20+
### Supported metrics for Microsoft.StorageCache/amlFilesystems
21+
22+
The following table lists the metrics available for the Microsoft.StorageCache/amlFilesystems resource type.
23+
24+
[!INCLUDE [horz-monitor-ref-metrics-tableheader](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-metrics-tableheader.md)]
25+
26+
[!INCLUDE [Microsoft.StorageCache/amlFilesystems](~/../azure-reference-other-repo/azure-monitor-ref/supported-metrics/includes/microsoft-storagecache-amlfilesystems-metrics-include.md)]
27+
28+
> [!NOTE]
29+
> The metric `OSTBytesUsed` represents the total capacity consumed on the file system, including all metadata and overhead associated with the files. The value for `OSTBytesUsed` might be greater than the result of running `lfs df` on the file system, as `df` output for **Used** only attempts to capture the data that the end user has placed on the file system.
30+
31+
[!INCLUDE [horz-monitor-ref-metrics-dimensions-intro](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-metrics-dimensions-intro.md)]
32+
33+
[!INCLUDE [horz-monitor-ref-metrics-dimensions](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-metrics-dimensions.md)]
34+
35+
### Dimensions specific to Azure Managed Lustre
36+
37+
| Dimension name | Description |
38+
| --- | --- |
39+
| `ostnum` | Object Storage Target (OST) index number |
40+
| `mdtnum` | Metadata Target (MDT) index number |
41+
| `operation` | Type of operation performed |
42+
43+
### Supported resource logs for Microsoft.StorageCache/amlFilesystems
44+
45+
[!INCLUDE [Microsoft.StorageCache/amlFilesystems](~/../azure-reference-other-repo/azure-monitor-ref/supported-logs/includes/microsoft-storagecache-amlfilesystems-logs-include.md)]
46+
47+
### Azure Monitor Logs tables
48+
49+
This section lists the Azure Monitor Logs tables relevant to this service, which are available for query by Log Analytics using Kusto queries.
50+
51+
- [AFSAuditLogs](/azure/azure-monitor/reference/tables/AFSAuditLogs)
52+
- [AzureActivity](/azure/azure-monitor/reference/tables/azureactivity)
53+
- [AzureMetrics](/azure/azure-monitor/reference/tables/azuremetrics)
54+
55+
[!INCLUDE [horz-monitor-ref-activity-log](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-activity-log.md)]
56+
57+
- [Microsoft.StorageCache permissions](/azure/role-based-access-control/permissions/storage#microsoftstoragecache)
58+
59+
## Related content
60+
61+
- See [Monitor Azure Managed Lustre](monitor-file-system.md) for a description of monitoring Azure Managed Lustre.
62+
- See [Monitor Azure resources with Azure Monitor](/azure/azure-monitor/essentials/monitor-azure-resource) for details on monitoring Azure resources.
Lines changed: 80 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,80 @@
1+
---
2+
title: Monitor Azure Managed Lustre
3+
description: Start here to learn how to monitor Azure Managed Lustre.
4+
ms.date: 08/16/2024
5+
ms.custom: horz-monitor
6+
ms.topic: conceptual
7+
author: pauljewellmsft
8+
ms.author: pauljewell
9+
ms.service: azure-managed-lustre
10+
---
11+
12+
# Monitor Azure Managed Lustre
13+
14+
[!INCLUDE [horz-monitor-intro](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-intro.md)]
15+
16+
[!INCLUDE [horz-monitor-resource-types](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-resource-types.md)]
17+
18+
For more information about the resource types for Azure Managed Lustre, see [Azure Managed Lustre monitoring data reference](monitor-file-system-reference.md).
19+
20+
[!INCLUDE [horz-monitor-data-storage](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-data-storage.md)]
21+
22+
[!INCLUDE [horz-monitor-platform-metrics](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-platform-metrics.md)]
23+
24+
For a list of available metrics for Azure Managed Lustre, see [Azure Managed Lustre monitoring data reference](monitor-file-system-reference.md#metrics).
25+
26+
[!INCLUDE [horz-monitor-resource-logs](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-resource-logs.md)]
27+
28+
For the available resource log categories, their associated Log Analytics tables, and the log schemas for Azure Managed Lustre, see [Azure Managed Lustre monitoring data reference](monitor-file-system-reference.md#supported-resource-logs-for-microsoftstoragecacheamlfilesystems).
29+
30+
[!INCLUDE [horz-monitor-activity-log](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-activity-log.md)]
31+
32+
[!INCLUDE [horz-monitor-analyze-data](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-analyze-data.md)]
33+
34+
[!INCLUDE [horz-monitor-external-tools](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-external-tools.md)]
35+
36+
[!INCLUDE [horz-monitor-kusto-queries](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-kusto-queries.md)]
37+
38+
This section shows queries that you can enter in the **Log search** bar to help you monitor your Managed Lustre file system.
39+
40+
- **Aggregate operations query**: List all the UnsuspendAmlFilesystem requests for a given time duration.
41+
42+
```kusto
43+
AFSAuditLogs
44+
// The OperationName below can be replaced by obtain other operations such as "RebootAmlFilesystemNode" or "AmlFSRefreshHSMToken".
45+
| where OperationName has "UnsuspendAmlFilesystem"
46+
| project TimeGenerated, _ResourceId, ActivityId, ResultSignature, ResultDescription, Location
47+
| sort by TimeGenerated asc
48+
| limit 100
49+
```
50+
51+
- **Unauthorized requests query**: Count of failed AMLFilesystems requests due to unauthorized access.
52+
53+
```kusto
54+
AFSAuditLogs
55+
// 401 below could be replaced by other result signatures to obtain different operation results.
56+
// For example, 'ResultSignature == 202' to obtain accepted requests.
57+
| where ResultSignature == 401
58+
| summarize count() by _ResourceId, OperationName
59+
```
60+
61+
[!INCLUDE [horz-monitor-alerts](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-alerts.md)]
62+
63+
### Azure Managed Lustre alert rules
64+
65+
The following table lists some suggested alert rules for Azure Managed Lustre. The alerts in this table are just examples. You can set alerts for any metric, log entry, or activity log entry listed in the [Azure Managed Lustre monitoring data reference](monitor-file-system-reference.md).
66+
67+
| Alert type | Condition | Description |
68+
| --- | --- | --- |
69+
| Metric | (**OST Bytes Used** / **OST Bytes Total**) > 0.85 | Storage capacity usage for the file system has exceeded 85% of total|
70+
| Metric | (**OST Files Used** / **OST Files Total**) > 0.85 | Number of files in the file system has exceeded 85% of total |
71+
72+
> [!NOTE]
73+
> The threshold value of 85% is used as an example to show an alert before the file system reaches full capacity. You can adjust the threshold based on your requirements.
74+
75+
[!INCLUDE [horz-monitor-advisor-recommendations](~/../azure-stack/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-advisor-recommendations.md)]
76+
77+
## Related content
78+
79+
- See [Azure Managed Lustre monitoring data reference](monitor-file-system-reference.md) for a reference of the metrics, logs, and other important values created for Azure Managed Lustre.
80+
- See [Monitoring Azure resources with Azure Monitor](/azure/azure-monitor/essentials/monitor-azure-resource) for general details on monitoring Azure resources.

azure-stack/hci/upgrade/about-upgrades-23h2.md

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -21,10 +21,10 @@ Throughout this article, we refer to Azure Stack HCI, version 23H2 as the *new*
2121

2222
Azure Stack HCI, version 23H2 is the latest version of the Azure Stack HCI solution. This version integrates the Azure Arc infrastructure that provisions and manages the workloads such as Arc VMs, Azure Kubernetes Services, and Azure Virtual Desktop. For more information, see [What's new in Azure Stack HCI, version 23H2](../whats-new.md#features-and-improvements-in-2311).
2323

24-
With version 23H2, Azure Stack HCI evolved from a cloud connected operating system (OS) to an Arc enabled solution. The OS forms the base layer of this solution, with the Arc and the Orchestrator (also known as the Lifecycle Manager) components layered on top. These components are packaged together into a solution that follows an [Infrastructure as code (IaC)](/devops/deliver/what-is-infrastructure-as-code) model.
24+
With version 23H2, Azure Stack HCI evolved from a cloud-connected operating system (OS) to an Arc-enabled solution. The OS forms the base layer of this solution, with the Arc and the Orchestrator (also known as the Lifecycle Manager) components layered on top. These components are packaged together into a solution that follows an [Infrastructure as code (IaC)](/devops/deliver/what-is-infrastructure-as-code) model.
2525

26-
- This IaC model takes a set of input parameters that are specific to each customer and the environment.
27-
- The lifecycle manager then orchestrates the desired state across all the layers to meet the desired state and the version.
26+
- This IaC model takes a set of input parameters that are specific to each customer and environment.
27+
- The lifecycle manager then orchestrates the desired state across all the layers to meet the desired state and version.
2828

2929
The following diagram illustrates the components of an Azure Stack HCI, version 23H2 system:
3030

@@ -34,15 +34,15 @@ The following diagram illustrates the components of an Azure Stack HCI, version
3434

3535
An upgrade is a whole new version of software that represents a significant change or major improvement. An update, on the other hand, is a process of applying a set of changes to the software to improve its performance, security, or stability.
3636

37-
The Azure Stack HCI, version 23H2, is a whole new version of the solution with a multitude of new capabilities. To move from Azure Stack HCI, version 22H2 to version 23H2, you need to upgrade your existing cluster. On the other hand, to ensure that you have the most recent features and security improvements for your current version of Azure Stack HCI, you would need to update your existing cluster.
37+
Azure Stack HCI, version 23H2, is a whole new version of the solution with a multitude of new capabilities. To move from Azure Stack HCI, version 22H2 to version 23H2, you need to upgrade your existing cluster. On the other hand, to ensure that you have the most recent features and security improvements for your current version of Azure Stack HCI, you would need to update your existing cluster.
3838

3939
## High-level steps for Azure Stack HCI upgrade
4040

4141
To upgrade your Azure Stack HCI from an old version, follow these high-level steps:
4242

4343
1. Upgrade the *old* OS to the *new* OS using one of the following methods:
44-
- [Via the PowerShell (recommended)](./upgrade-22h2-to-23h2-powershell.md).
45-
- [Via the Windows Admin Center](./upgrade-22h2-to-23h2-windows-admin-center.md).
44+
- [Via PowerShell (recommended)](./upgrade-22h2-to-23h2-powershell.md).
45+
- [Via Windows Admin Center](./upgrade-22h2-to-23h2-windows-admin-center.md).
4646
- [Via other manual methods](./upgrade-22h2-to-23h2-other-methods.md).
4747

4848
1. Perform post-OS upgrade tasks.
@@ -55,9 +55,12 @@ The following diagram illustrates the Azure Stack HCI upgrade process:
5555

5656
:::image type="content" source="./media/about-upgrades-23h2/update-os-to-23h2-and-apply-the-solution-update.png" alt-text="Diagram that illustrates the two steps to update the Azure Stack HCI OS and then apply the solution update." lightbox="./media/about-upgrades-23h2/update-os-to-23h2-and-apply-the-solution-update.png":::
5757

58+
> [!IMPORTANT]
59+
> If you were running AKS workloads on your Azure Stack HCI cluster, you must remove Azure Kubernetes Service and all the settings from AKS enabled by Azure Arc before you apply the solution upgrade. Kubernetes versions are incompatible between Azure Stack HCI, version 22H2, and version 23H2. Additionally, Preview versions of Arc VMs can't be updated.
60+
5861
## Next steps
5962

6063
Choose one of the following options to upgrade your Azure Stack HCI, version 22H2 to Azure Stack HCI, version 23H2:
6164
- [Use PowerShell](./upgrade-22h2-to-23h2-powershell.md).
6265
- [Use Windows Admin Center](./upgrade-22h2-to-23h2-windows-admin-center.md).
63-
- [Use other methods](./upgrade-22h2-to-23h2-other-methods.md).
66+
- [Use other methods](./upgrade-22h2-to-23h2-other-methods.md).

azure-stack/hci/upgrade/post-upgrade-steps.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ Before you begin, make sure that:
3535

3636
Follow these steps on your client to connect to one of the servers of your Azure Stack HCI cluster.
3737

38-
1. Run PowerShell as administrator on the client that you're using to connect to your cluster.
38+
1. Run PowerShell as Administrator on the client that you're using to connect to your cluster.
3939
2. Open a remote PowerShell session to a server on your Azure Stack HCI cluster. Run the following command and provide the credentials of your server when prompted:
4040

4141
```powershell
@@ -140,4 +140,4 @@ You're now ready to apply the solution upgrade.
140140
## Next steps
141141
142142
- [Optional: Install and enable Network ATC on your cluster.](./install-enable-network-atc.md)
143-
- [Learn how to assess the solution upgrade readiness.](./validate-solution-upgrade-readiness.md)
143+
- [Learn how to assess the solution upgrade readiness.](./validate-solution-upgrade-readiness.md)

0 commit comments

Comments
 (0)