Skip to content

Commit 1bb4ae8

Browse files
revert changes
1 parent 4cf3716 commit 1bb4ae8

File tree

1 file changed

+56
-28
lines changed

1 file changed

+56
-28
lines changed

articles/azure-vmware/ecosystem-app-monitoring-solutions.md

Lines changed: 56 additions & 28 deletions
Original file line numberDiff line numberDiff line change
@@ -17,64 +17,92 @@ Microsoft recommends [Application Insights](/azure/azure-monitor/app/app-insight
1717

1818
Learn how modern monitoring with Azure Monitor can transform your business by reviewing the [product overview, features, getting started guide and more](https://azure.microsoft.com/services/monitor).
1919

20-
### Azure Resource Health for Azure VMware Solution Private Cloud
20+
### Azure Resource Health for Azure VMware Solution Private Cloud (Public preview)
2121

2222
In this article, you learn how Azure Resource Health helps you diagnose and get support for service problems that affect your Private Cloud resources. Azure Resource Health reports on the current and past health of your Private Cloud Infrastructure resources and provides you with a personalized dashboard of the health of the infrastructure resources. Azure Resource Health allows you to report on historical events and can identify every time a service is unavailable and if Service Level Agreement (SLA) is violated.
2323

24+
#### Preview Enablement
25+
26+
You are required to register yourself for the feature preview under _Preview Features_ of Azure VMware Solution in Azure portal. Customers should first register themselves to ***"Microsoft.AVS/ResourceHealth"*** preview flag from Azure portal and once registered, all the preconfigured alerts related to Host replacement, vCenter, and other critical alarms will start to surface in the Resource Health of Azure VMware Solution (AVS) User Interface (UI).
27+
2428
#### Benefits of enabling Resource Health
2529

2630
- Resource Health feature enablement adds significant value to your monitoring capabilities. You get notified about unplanned maintenance that took place in your private cloud infrastructure.
2731

2832
- Resource Health gives you a personalized dashboard of the health of your resources. Resource Health shows all the time that your resources have been unavailable which makes it easy for you to check if SLA was violated.
2933

30-
- A group of critical alerts are enabled which notifies you about host replacements, storage critical alarms and also about the network health of your private cloud.
34+
- For the Public Preview, a group of critical alerts are enabled which notifies you about Host replacements, storage critical alarms and also about the Network health of your private cloud.
3135

3236
- The alerts are updated to have all the necessary information for better reporting and triage purposes.
3337

34-
- Resource Health uses Azure Monitor action groups that allow you to configure Email/SMS/Webhook/ITSM and get notified via communication method of your choice.
38+
- Resource Health uses Azure Action groups that allow you to configure Email/SMS/Webhook/ITSM and get notified via communication method of your choice.
3539

36-
- The health of your private cloud infrastructure reflects following statuses
40+
- Once Enabled the health of your private cloud infrastructure reflects following statuses
3741

3842

39-
- **Available**: Available means that there are no events detected that affect the health of the resource. In cases where the resource recovered from unplanned downtime during the last 24 hours, you see a "Recently resolved" notification
43+
- Available
4044

41-
- **Unavailable**: Unavailable means that the service detected an ongoing platform or nonplatform event that affects the health of the resource.
45+
- Unavailable
46+
47+
- Unknown
48+
49+
- Degraded
50+
51+
52+
#### Available
53+
54+
Available means that there are no events detected that affect the health of the resource. In cases where the resource recovered from unplanned downtime during the last 24 hours, you see a "Recently resolved" notification
55+
56+
57+
58+
#### Unavailable
59+
60+
Unavailable means that the service detected an ongoing platform or nonplatform event that affects the health of the resource.
61+
62+
#### Unknown
63+
64+
Unknown means that Resource Health hasn't received information about the resource for more than 10 minutes. You may see this status under two different conditions:
65+
66+
- Your subscription is not enabled for Resource Health metrics, and you need to register yourself for the preview.
67+
68+
- If the resource is running as expected, the status of the resource will change to Available after a few minutes. If you experience problems with the resource, the Unknown health status might mean that an event in the private cloud is affecting the resource.
69+
70+
4271

43-
- **Unknown**: If the resource is running as expected, the status of the resource will change to Available after a few minutes. If you experience problems with the resource, the Unknown health status might mean that an event in the private cloud is affecting the resource.
72+
#### Degraded
4473

45-
- **Degraded**: Degraded means that Resource Health detected a loss in performance in either one or more private cloud resources, although it's still available for use. Different resources have their own criteria for when they report that they are degraded.
74+
Degraded means that Resource Health detected a loss in performance in either one or more private cloud resources, although it's still available for use. Different resources have their own criteria for when they report that they are degraded.
4675

4776

4877

4978
#### Pre-configured Alarms enabled in Azure Resource Health
5079

51-
The following table shows the list of all onboarded alerts. **Customer intervention Required** indicates that you need to take action to remediate the alert. **System Remediation** alarms will be actioned by Microsoft to be remediated.
5280

5381
|Alert Name|Remediation Mode|
5482
| -------- | -------- |
5583
|Physical Disk Health Alarm |System Remediation|
56-
|System Board Health Alarm |System Remediation|
57-
|Memory Health Alarm |System Remediation|
58-
|Storage Health Alarm |System Remediation|
84+
|System Board Health Alarm|System Remediation|
85+
| Memory Health Alarm|System Remediation|
86+
|Storage Health Alarm|System Remediation|
5987
|Temperature Health Alarm |System Remediation|
60-
|Host Connection State Alarm |System Remediation|
88+
|Host Connection State Alarm|System Remediation|
6189
|High Availability (HA) host Status |System Remediation|
62-
|Network Connectivity Lost Alarm |System Remediation|
63-
|Virtual Storage (vSAN) Host Disk Error Alarm |System Remediation|
90+
| Network Connectivity Lost Alarm|System Remediation|
91+
|Virtual Storage (vSAN) Host Disk Error Alarm|System Remediation|
6492
|Voltage Health Alarm |System Remediation|
65-
|Processor Health Alarm |System Remediation|
66-
|Fan Health Alarm |System Remediation|
67-
|High pNIC error rate detected |System Remediation|
68-
|iDRAC critical alerts if there are hardware faults (CPU/DIMM/PCI bus/Voltage issues) |System Remediation|
69-
|vSphere HA restarted a virtual machine |System Remediation|
70-
|Virtual Storage (vSAN) High Disk Utilization |Customer Intervention Required|
71-
|Replacement Start and Stop Notification |System Remediation|
93+
|Processor Health Alarm| System Remediation|
94+
|Fan Health Alarm|System Remediation|
95+
|High pNIC error rate detected|System Remediation|
96+
|iDRAC critical alerts if there are hardware faults (CPU/DIMM/PCI bus/Voltage issues)|System Remediation|
97+
|vSphere HA restarted a virtual machine|System Remediation|
98+
|Virtual Storage (vSAN) High Disk Utilization|Customer Intervention Required|
99+
|Replacement Start and Stop Notification|System Remediation|
72100
|Repair Service notification to customers (Host reboot and Restart of Management services) |System Remediation|
73-
|Notification to customer when a Virtual Machine is configured to use an external device that prevents a maintenance operation |Customer Intervention Required|
74-
|Customer notification when CD-ROM is mounted on the Virtual Machine and its ISO image isn't accessible and blocks maintenance operation |Customer Intervention Required|
75-
|Notification to customer when an external Datastore mounted becomes inaccessible and will block maintenance operations |Customer Intervention Required|
76-
|Notification to customer when connected network adapter becomes inaccessible and blocks any maintenance operations |Customer Intervention Required|
77-
|VMware Network (NSX–T) alarms (Customer notification about License expiration) |Customer Intervention Required|
101+
|Notification to customer when a Virtual Machine is configured to use an external device that prevents a maintenance operation|Customer Intervention Required|
102+
| Customer notification when CD-ROM is mounted on the Virtual Machine and its ISO image isn't accessible and blocks maintenance operation|Customer Intervention Required|
103+
|Notification to customer when an external Datastore mounted becomes inaccessible and will block maintenance operations|Customer Intervention Required|
104+
|Notification to customer when connected network adapter becomes inaccessible and blocks any maintenance operations|Customer Intervention Required|
105+
|VMware Network (NSX –T) alarms (Customer notification about License expiration)|Customer Intervention Required|
78106

79107

80108
## Next Steps
@@ -85,7 +113,7 @@ Now that you have configured an alert rule for your Azure VMware Solution privat
85113

86114
- [Azure Monitor](/azure/azure-monitor/overview)
87115

88-
- [Azure Monitor Action Groups](/azure/azure-monitor/alerts/action-groups)
116+
- [Azure Action Groups](/azure/azure-monitor/alerts/action-groups)
89117

90118
You can also continue with one of the other Azure VMware Solution how-to [guides](/azure/azure-vmware/deploy-azure-vmware-solution?tabs=azure-portal)
91119

0 commit comments

Comments
 (0)