Skip to content

Commit 99b2f8f

Browse files
authored
Merge pull request #303420 from jobingeorge-microsoft/main
[GA] Resource Health for AVS
2 parents 45c84fc + 9105fe7 commit 99b2f8f

19 files changed

+190
-45
lines changed

articles/azure-vmware/architecture-identity.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -230,7 +230,7 @@ Now that you've covered Azure VMware Solution access and identity concepts, you
230230

231231
- [Details of each privilege](https://techdocs.broadcom.com/us/en/vmware-cis/vsphere/vsphere/8-0/vsphere-security-8-0/defined-privileges.html)
232232

233-
- [How Azure VMware Solution monitors and repairs private clouds](azure-vmware-solution-host-remediation.md#host-monitoring-and-remediation)
233+
- [How Azure VMware Solution monitors and repairs private clouds](azure-vmware-solution-private-cloud-maintenance-best-practices.md#host-monitoring-and-remediation)
234234

235235
<!-- LINKS - external-->
236236
[VMware product documentation]: https://techdocs.broadcom.com/us/en/vmware-cis/vsphere/vsphere/8-0/vsphere-security-8-0/defined-privileges.html

articles/azure-vmware/architecture-private-clouds.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -146,7 +146,7 @@ Azure VMware Solution continuously monitors the health of both the physical unde
146146
Now that you learned about Azure VMware Solution private cloud concepts, you might want to read:
147147

148148
- [Azure VMware Solution networking and interconnectivity concepts](architecture-networking.md)
149-
- [Azure VMware Solution private cloud maintenance best practices](azure-vmware-solution-host-remediation.md)
149+
- [Azure VMware Solution private cloud maintenance best practices](azure-vmware-solution-private-cloud-maintenance-best-practices.md)
150150
- [Azure VMware Solution storage concepts](architecture-storage.md)
151151
- [Enable an Azure VMware Solution resource](deploy-azure-vmware-solution.md#register-the-microsoftavs-resource-provider)
152152

articles/azure-vmware/azure-vmware-solution-platform-updates.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.date: 7/29/2025
1010

1111
# What's new in Azure VMware Solution
1212

13-
Microsoft regularly applies important updates to the Azure VMware Solution for new features and software lifecycle management. You should receive a notification through Azure Service Health that includes the timeline of the maintenance. For more information, see [Host maintenance and lifecycle management](azure-vmware-solution-host-remediation.md#host-maintenance-and-lifecycle-management).
13+
Microsoft regularly applies important updates to the Azure VMware Solution for new features and software lifecycle management. You should receive a notification through Azure Service Health that includes the timeline of the maintenance. For more information, see [Host maintenance and lifecycle management](azure-vmware-solution-private-cloud-maintenance-best-practices.md#host-maintenance-and-lifecycle-management).
1414

1515
## July 2025
1616

articles/azure-vmware/azure-vmware-solution-host-remediation.md renamed to articles/azure-vmware/azure-vmware-solution-private-cloud-maintenance-best-practices.md

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -59,13 +59,13 @@ The following actions are always recommended for ensuring host maintenance opera
5959
## Alert Codes and Remediation Table
6060
| Error Code | Error Details | Recommended Action |
6161
|--------------------|---------------------------------|---------------------|
62-
| EPC_CDROM_EMULATEMODE | This error is encountered when CD-ROM on the Virtual Machine uses emulate mode, whose ISO image isn't accessible | Follow [this KB article](https://knowledge.broadcom.com/external/article?legacyId=79306) for the removal of any CDROM mounted on customer's workload Virtual Machines in emulate mode or detach ISO. It's recommended to use "Passthrough mode" for mounting any CD-ROM. |
63-
| EPC_DRSOVERRIDERULE | This error is encountered when there's a Virtual Machine with DRS Override set to "Disabled" mode. | VM shouldn't block vMotion while putting host into maintenance. Set Partially Automated DRS rules for the VM. Refer to [this document](/azure/azure-vmware/create-placement-policy#enable-restrict-vm-movement-for-specific-vms) to know more about VM placement policies. |
64-
| EPC_SCSIDEVICE_SHARINGMODE | This error is encountered when a Virtual Machine is configured to use a device that prevents a maintenance operation: A device that is a SCSI controller which is engaged in bus-sharing | Follow [this KB article](https://knowledge.broadcom.com/external/article?legacyId=79910) for the removal of any SCSI controller engaged in bus-sharing attached to VMs |
65-
| EPC_DATASTORE_INACCESSIBLE | This error is encountered when any external Datastore attached to AVS Private Cloud becomes inaccessible | Follow [this article](/azure/azure-vmware/attach-azure-netapp-files-to-azure-vmware-solution-hosts?tabs=azure-portal#performance-best-practices) for the removal of any stale Datastore attached to cluster |
66-
| EPC_NWADAPTER_STALE | This error is encountered when connected Network interface on the Virtual Machine uses network adapter which becomes inaccessible | Follow [this KB article](https://knowledge.broadcom.com/external/article/318738/troubleshooting-the-migration-compatibil.html) for the removal of any stale N/W adapters attached to Virtual Machines |
67-
| EPC_SERIAL_PORT | This error is encountered when a Virtual Machine’s serial port is connected to a device that can't be accessed on the destination host. | If you're using an image file (ISO, FLP, and so on), ensure that it's accessible from all ESXi servers on the cluster. Store the files on a data store that is shared between all ESXi servers that participate in vMotion of the virtual machine. Refer to [this KB article](https://knowledge.broadcom.com/external/article/324829/vmotion-fails-with-the-compatibility-err.html) from Broadcom for more information. |
68-
| EPC_HARDWARE_DEVICE | This error is encountered when a Virtual Machine’s parallel Port/USB Device is connected to a device can't be accessed on the destination host. | If you're using an image file (ISO, FLP, and so on), ensure that it's accessible from all ESXi servers of the cluster. Store the files on a data store that is shared between all ESXi servers that participate in the vMotion of the virtual machine. Refer to [this KB article](https://knowledge.broadcom.com/external/article/324829/vmotion-fails-with-the-compatibility-err.html) from Broadcom for more information. |
62+
| EPC_CDROM_EMULATEMODE | This error is encountered when CD-ROM on the Virtual Machine uses emulate mode, whose ISO image isn't accessible | Follow [this KB article](https://knowledge.broadcom.com/external/article?legacyId=79306) for the removal of any CDROM mounted on customer's workload Virtual Machines in emulate mode or detach ISO. It's recommended to use "Passthrough mode" for mounting any CD-ROM. |
63+
| EPC_DRSOVERRIDERULE | This error is encountered when there's a Virtual Machine with DRS Override set to "Disabled" mode. | VM shouldn't block vMotion while putting host into maintenance. Set Partially Automated DRS rules for the VM. Refer to [this document](/azure/azure-vmware/create-placement-policy#enable-restrict-vm-movement-for-specific-vms) to know more about VM placement policies. |
64+
| EPC_SCSIDEVICE_SHARINGMODE | This error is encountered when a Virtual Machine is configured to use a device that prevents a maintenance operation: A device that is a SCSI controller, which is engaged in bus-sharing | Follow [this KB article](https://knowledge.broadcom.com/external/article?legacyId=79910) for the removal of any SCSI controller engaged in bus-sharing attached to VMs |
65+
| EPC_DATASTORE_INACCESSIBLE | This error is encountered when any external Datastore attached to AVS Private Cloud becomes inaccessible | Follow [this article](/azure/azure-vmware/attach-azure-netapp-files-to-azure-vmware-solution-hosts?tabs=azure-portal#performance-best-practices) for the removal of any stale Datastore attached to cluster |
66+
| EPC_NWADAPTER_STALE | This error is encountered when connected Network interface on the Virtual Machine uses network adapter, which becomes inaccessible | Follow [this KB article](https://knowledge.broadcom.com/external/article/318738/troubleshooting-the-migration-compatibil.html) for the removal of any stale N/W adapters attached to Virtual Machines |
67+
| EPC_SERIAL_PORT | This error is encountered when a Virtual Machine’s serial port is connected to a device that can't be accessed on the destination host. | If you're using an image file (ISO, FLP, and so on), ensure that it's accessible from all ESXi servers on the cluster. Store the files on a data store that is shared between all ESXi servers that participate in vMotion of the virtual machine. Refer to [this KB article](https://knowledge.broadcom.com/external/article/324829/vmotion-fails-with-the-compatibility-err.html) from Broadcom for more information. |
68+
| EPC_HARDWARE_DEVICE | This error is encountered when a Virtual Machine’s parallel Port/USB Device is connected to a device can't be accessed on the destination host. | If you're using an image file (ISO, FLP, and so on), ensure that it's accessible from all ESXi servers of the cluster. Store the files on a data store that is shared between all ESXi servers that participate in the vMotion of the virtual machine. Refer to [this KB article](https://knowledge.broadcom.com/external/article/324829/vmotion-fails-with-the-compatibility-err.html) from Broadcom for more information. |
6969
| EPC_INVALIDVM / EPC_ORPHANVM | This error is encountered when there's an orphaned or Invalid VM in the inventory | Ensure all your Virtual Machines are accessible to the vCenter. Refer to [this KB article](https://knowledge.broadcom.com/external/article/312831/virtual-machines-appear-as-invalid-or-or.html) for more information |
7070

7171

@@ -77,6 +77,7 @@ The following actions are always recommended for ensuring host maintenance opera
7777

7878
Now that you've covered Azure VMware Solution private cloud maintenance best practices, you might want to learn about:
7979

80+
- [How to configure Resource Health alerts for Azure VMware Solution](configure-azure-monitor-for-resource-health-for-azure-vmware-solution.md)
8081
- [Azure VMware Solution networking and interconnectivity concepts](architecture-networking.md)
8182
- [Azure VMware Solution storage concepts](architecture-storage.md)
8283
- [How to configure Azure Alerts in Azure VMware Solution](configure-alerts-for-azure-vmware-solution.md)
Lines changed: 90 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,90 @@
1+
---
2+
title: Create an Azure Monitor resource health alert rule for Azure VMware Solution
3+
description: Learn about configuring Azure Monitor alerts for Resource Health for your Azure VMware Solution private cloud.
4+
ms.topic: how-to
5+
ms.service: azure-vmware
6+
ms.date: 08/05/2025
7+
---
8+
9+
# Create an Azure Monitor resource health alert rule for Azure VMware Solution
10+
11+
12+
This article shows you how to create or edit a resource health alert rule in Azure Monitor for your Azure VMware Solution private cloud. To learn more about alerts, see the [alerts overview](/azure/azure-monitor/alerts/alerts-overview).
13+
14+
Alerts triggered by these alert rules contain a payload that uses the [common alert schema](/azure/azure-monitor/alerts/alerts-common-schema).
15+
16+
## Prerequisites
17+
18+
To create or edit a Resource Health alert rule, you need:
19+
20+
- An Azure subscription with an AVS private cloud deployed
21+
- Contributor (write) access to that subscription
22+
- Read permission on any action groups you intend to use
23+
24+
## Create Resource Health alert rule
25+
26+
### Access Resource Health blade
27+
28+
1. In the Azure portal, navigate to your AVS private cloud resource.
29+
1. From the left menu, select **Help****Resource Health**.
30+
:::image type="content" source="media/resource-health/resource-health-left-nav.png" alt-text="Screenshot showing where to find Resource Health for the AVS private cloud.":::
31+
32+
1. Click **Add resource health alert**.
33+
:::image type="content" source="media/resource-health/resource-health-create-alert.png" alt-text="Screenshot showing add resource health alert button.":::
34+
35+
### Define the alert condition
36+
37+
* On the **Conditions** pane, select values for each of these fields:
38+
39+
| Field | Description |
40+
|------------------------------|--------------------------------------------------------------------------------------------------------------------------|
41+
| **Event status** | Select **Updated** as the event status |
42+
| **Current resource status** | Select the current resource status. Values are **Available**, **Degraded**, and **Unavailable**. |
43+
| **Previous resource status** | (Optional) Select the previous resource status. Values are **Available**, **Degraded**, **Unavailable**, and **Unknown**.|
44+
| **Reason type** | Select the causes of the resource health events. Values are **Platform Initiated**, **Unknown**, and **User Initiated**. |
45+
46+
:::image type="content" source="media/resource-health/resource-health-condition.png" alt-text="Screenshot showing condition tabs while creating alerts using Resource Health for the AVS private cloud.":::
47+
48+
### Configure actions
49+
50+
1. Switch to the **Actions** tab and choose **Use action groups**.
51+
1. Pick an existing action group or click **Create action group** to:
52+
- Specify the subscription and resource group
53+
- For **Region**, choose **Global**. Resource Health alerts can only be located in the Global region (which is the default).
54+
55+
- Provide an **Action group name** and **Display name**
56+
57+
:::image type="content" source="media/resource-health/resource-health-new-action-group.png" alt-text="Screenshot showing new action group creation wizard with basics tab.":::
58+
59+
- Under **Notification type**, select channels (Email, SMS, Push, Voice) and add stakeholder contacts
60+
:::image type="content" source="media/resource-health/resource-health-action-group-notifications.png" alt-text="Screenshot showing new action group creation wizard with notifications tab." lightbox="media/resource-health/resource-health-action-group-notifications.png":::
61+
62+
- Under **Review + create**, review the details and click on **Create**
63+
64+
### Review and create
65+
66+
1. On the **Details** tab, fill in:
67+
- **Alert rule name**
68+
- Subscription and resource group for the rule
69+
- Ensure **Enable alert rule upon creation** is checked
70+
71+
:::image type="content" source="media/resource-health/resource-health-action-group-details.png" alt-text="Screenshot showing new action group creation wizard with details tab.":::
72+
73+
1. Click **Review + create**, then **Create**.
74+
75+
:::image type="content" source="media/resource-health/resource-health-alert-review-create.png" alt-text="Screenshot showing new alert rule creation wizard with review-create tab.":::
76+
77+
## Manage alert rules
78+
79+
- **View triggered alerts**: search for your alert rule name and select the **History** tab. You can click on any of the triggered alerts to know about it.
80+
81+
:::image type="content" source="media/resource-health/alert-rule-history.png" alt-text="Screenshot showing an alert rule history." lightbox="media/resource-health/alert-rule-history.png":::
82+
83+
- **Edit an existing rule**: open its **Overview** page and click **Edit**.
84+
85+
## Next steps
86+
87+
- [Manage your alerts](/azure/azure-monitor/alerts/alerts-manage-alert-instances)
88+
- Learn more about the [ITSM Connector](/azure/azure-monitor/alerts/itsmc-overview)
89+
90+

articles/azure-vmware/ecosystem-app-monitoring-solutions.md

Lines changed: 0 additions & 32 deletions
Original file line numberDiff line numberDiff line change
@@ -74,38 +74,6 @@ Unknown means that Resource Health hasn't received information about the resou
7474

7575
Degraded means that Resource Health detected a loss in performance in either one or more private cloud resources, although it's still available for use. Different resources have their own criteria for when they report that they are degraded.
7676

77-
78-
79-
#### Pre-configured Alarms enabled in Azure Resource Health
80-
81-
82-
|Alert Name|Remediation Mode|
83-
| -------- | -------- |
84-
|Physical Disk Health Alarm |System Remediation|
85-
|System Board Health Alarm|System Remediation|
86-
| Memory Health Alarm|System Remediation|
87-
|Storage Health Alarm|System Remediation|
88-
|Temperature Health Alarm |System Remediation|
89-
|Host Connection State Alarm|System Remediation|
90-
|High Availability (HA) host Status |System Remediation|
91-
| Network Connectivity Lost Alarm|System Remediation|
92-
|Virtual Storage (vSAN) Host Disk Error Alarm|System Remediation|
93-
|Voltage Health Alarm |System Remediation|
94-
|Processor Health Alarm| System Remediation|
95-
|Fan Health Alarm|System Remediation|
96-
|High pNIC error rate detected|System Remediation|
97-
|iDRAC critical alerts if there are hardware faults (CPU/DIMM/PCI bus/Voltage issues)|System Remediation|
98-
|vSphere HA restarted a virtual machine|System Remediation|
99-
|Virtual Storage (vSAN) High Disk Utilization|Customer Intervention Required|
100-
|Replacement Start and Stop Notification|System Remediation|
101-
|Repair Service notification to customers (Host reboot and Restart of Management services) |System Remediation|
102-
|Notification to customer when a Virtual Machine is configured to use an external device that prevents a maintenance operation|Customer Intervention Required|
103-
| Customer notification when CD-ROM is mounted on the Virtual Machine and its ISO image isn't accessible and blocks maintenance operation|Customer Intervention Required|
104-
|Notification to customer when an external Datastore mounted becomes inaccessible and will block maintenance operations|Customer Intervention Required|
105-
|Notification to customer when connected network adapter becomes inaccessible and blocks any maintenance operations|Customer Intervention Required|
106-
|VMware Network (NSX –T) alarms (Customer notification about License expiration)|Customer Intervention Required|
107-
108-
10977
## Next Steps
11078

11179
Now that you have configured an alert rule for your Azure VMware Solution private cloud, you can learn more about:

articles/azure-vmware/introduction.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ You can deploy new or scale existing private clouds through the Azure portal or
3434

3535
## Azure VMware Solution private cloud extension with AV64 node size
3636

37-
The AV64 is an Azure VMware Solution host SKU, which is available to expand the Azure VMware Solution private cloud built with the existing AV36, AV36P, or AV52 SKU. If you want to deploy AV64 directly, refer to [Azure VMWare Solution in an Azure Virtual Network](native-introduction.md). Use the [Microsoft documentation](https://azure.microsoft.com/explore/global-infrastructure/products-by-region/?products=azure-vmware) to check for availability of the AV64 SKU in the region.
37+
The AV64 is an Azure VMware Solution host SKU, which is available to expand the Azure VMware Solution private cloud built with the existing AV36, AV36P, or AV52 SKU. If you want to deploy AV64 directly, refer to [Azure VMware Solution in an Azure Virtual Network](native-introduction.md). Use the [Microsoft documentation](https://azure.microsoft.com/explore/global-infrastructure/products-by-region/?products=azure-vmware) to check for availability of the AV64 SKU in the region.
3838

3939
:::image type="content" source="media/introduction/av64-mixed-sku-topology.png" alt-text="Diagram showing Azure VMware Solution private cloud with AV64 SKU in mixed SKU configuration." border="false" lightbox="media/introduction/av64-mixed-sku-topology.png":::
4040

@@ -138,7 +138,7 @@ Azure VMware Solution doesn't store customer data.
138138

139139
## Host and software lifecycle maintenance
140140

141-
Regular upgrades of the Azure VMware Solution private cloud and VMware software ensure the latest security, stability, and feature sets are running in your private clouds. For more information, see [Host maintenance and lifecycle management](azure-vmware-solution-host-remediation.md#host-maintenance-and-lifecycle-management).
141+
Regular upgrades of the Azure VMware Solution private cloud and VMware software ensure the latest security, stability, and feature sets are running in your private clouds. For more information, see [Host maintenance and lifecycle management](azure-vmware-solution-private-cloud-maintenance-best-practices.md#host-maintenance-and-lifecycle-management).
142142

143143
## Monitoring your private cloud
144144

236 KB
Loading
42.4 KB
Loading
72.8 KB
Loading

0 commit comments

Comments
 (0)