@@ -41,7 +41,7 @@ nodes) and JournalNodes (zookeeper node), NameNodes (head nodes) uptime, as well
41
41
metrics specific to certain cluster types, like YARN ResourceManager uptime for
42
42
Spark and Hadoop clusters.
43
43
44
- ![ Ambari dashboard] ( media/hdinsight-cluster-availability/apache-ambari-dashboard.png )
44
+ ![ Apache Ambari use dashboard display ] ( media/hdinsight-cluster-availability/apache-ambari-dashboard.png )
45
45
46
46
### Hosts – view individual node status
47
47
@@ -51,13 +51,13 @@ each node. The green check to the left of each node name indicates all
51
51
components are up on the node. If a component is down on a node, you will see a
52
52
red alert triangle instead of the green check.
53
53
54
- ![ Ambari hosts view] ( media/hdinsight-cluster-availability/apache-ambari-hosts1.png )
54
+ ![ HDInsight Apache Ambari hosts view] ( media/hdinsight-cluster-availability/apache-ambari-hosts1.png )
55
55
56
56
You can then click on the ** name** of a node to view more detailed host metrics
57
57
for that particular node. This view shows the status/availability of each
58
58
individual component.
59
59
60
- ![ Ambari hosts single node view] ( media/hdinsight-cluster-availability/apache-ambari-hosts-node.png )
60
+ ![ Apache Ambari hosts single node view] ( media/hdinsight-cluster-availability/apache-ambari-hosts-node.png )
61
61
62
62
### Ambari alerts
63
63
@@ -66,7 +66,7 @@ certain events. When alerts are triggered, they are shown in the upper-left
66
66
corner of Ambari in a red badge containing the number of alerts. Clicking this
67
67
badge shows a list of current alerts.
68
68
69
- ![ Ambari alerts count] ( media/hdinsight-cluster-availability/apache-ambari-alerts.png )
69
+ ![ Apache Ambari current alerts count] ( media/hdinsight-cluster-availability/apache-ambari-alerts.png )
70
70
71
71
To view a list of alert definitions and their statuses, click the ** Alerts**
72
72
tab, as shown below.
@@ -93,7 +93,7 @@ of the alert as well as the specific criteria that will trigger a ‘warning’
93
93
configuration, click the ** Edit** button in the upper-right corner of the
94
94
Configuration box.
95
95
96
- ![ Ambari alert configuration] ( media/hdinsight-cluster-availability/ambari-alert-configuration.png )
96
+ ![ Apache Ambari alert configuration] ( media/hdinsight-cluster-availability/ambari-alert-configuration.png )
97
97
98
98
Here, you can edit the description and, more importantly, the check interval
99
99
and thresholds for warning or critical alerts.
@@ -182,7 +182,7 @@ in 5 hours (i.e. is presumed to be unavailable).
182
182
From the ** Logs** blade, run the ** Unavailable computers** sample query by
183
183
clicking ** Run** on that query, as shown below.
184
184
185
- ![ Log Analytics workspace logs blade 'unavailable computers' sample query ] ( media/hdinsight-cluster-availability/portal-unavailable-computers.png )
185
+ ![ Log Analytics workspace logs blade 'unavailable computers' sample] ( media/hdinsight-cluster-availability/portal-unavailable-computers.png )
186
186
187
187
If all nodes are available, this query should return 0 results for now. Click
188
188
** New alert rule** to begin configuring your alert for this query.
@@ -197,7 +197,7 @@ triggered.
197
197
Click the ** condition title** , as shown below, to finish configuring the signal
198
198
logic.
199
199
200
- ![ Alert rule condition] ( media/hdinsight-cluster-availability/portal-condition-title.png )
200
+ ![ Portal alert create rule condition] ( media/hdinsight-cluster-availability/portal-condition-title.png )
201
201
202
202
This will open the ** Configure signal logic** blade.
203
203
@@ -223,7 +223,7 @@ Click **Done** when you are finished configuring the signal logic.
223
223
If you do not already have an existing action group, click ** Create New** under
224
224
the ** Action Groups** section.
225
225
226
- ![ Alert rule new action group] ( media/hdinsight-cluster-availability/portal-create-new-action-group.png )
226
+ ![ Alert rule create new action group] ( media/hdinsight-cluster-availability/portal-create-new-action-group.png )
227
227
228
228
This will open the ** Add action group** blade. Choose an ** Action group name** ,
229
229
** Short name** , ** Subscription** , and ** Resource group.** Under the ** Actions**
@@ -240,14 +240,14 @@ recipient, **check** the **Email** box, and type an email address to which you
240
240
want the alert sent. Click ** OK** in the ** Email/SMS/Push/Voice** blade, then in
241
241
the ** Add action group** blade to finish configuring your action group.
242
242
243
- ![ Alert rule add action group] ( media/hdinsight-cluster-availability/portal-add-action-group.png )
243
+ ![ Alert rule create add action group] ( media/hdinsight-cluster-availability/portal-add-action-group.png )
244
244
245
245
After these blades close, you should see your action group listed under the
246
246
** Action Groups** section. Finally, complete the ** Alert Details** section by
247
247
typing an ** Alert Rule Name** and ** Description** and choosing a ** Severity** .
248
248
Click ** Create Alert Rule** to finish.
249
249
250
- ![ Create alert rule finish] ( media/hdinsight-cluster-availability/portal-create-alert-rule-finish.png )
250
+ ![ Portal create alert rule finish] ( media/hdinsight-cluster-availability/portal-create-alert-rule-finish.png )
251
251
252
252
> [ !TIP]
253
253
> The ability to specify ** Severity** is a powerful tool that can be used
@@ -258,7 +258,7 @@ Critical (Sev 0) in the unlikely event that both head nodes go down.
258
258
When the condition for this alert is met, the alert will fire and you will
259
259
receive an email with the alert details like this:
260
260
261
- ![ Azure Monitor alert email] ( media/hdinsight-cluster-availability/portal-oms-alert-email.png )
261
+ ![ Azure Monitor alert email example ] ( media/hdinsight-cluster-availability/portal-oms-alert-email.png )
262
262
263
263
You can also view all alerts that have fired, grouped by severity, by going to
264
264
the ** Alerts** blade in your ** Log Analytics Workspace** .
0 commit comments