You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: Monitoring/monitor-ontap-services/cloudformation.yaml
+85-49Lines changed: 85 additions & 49 deletions
Original file line number
Diff line number
Diff line change
@@ -18,16 +18,17 @@ Metadata:
18
18
- secretPasswordKey
19
19
- checkInterval
20
20
- createWatchdogAlarm
21
+
- implementWatchdogAsLambda
22
+
- watchdogRoleArn
23
+
- LambdaRoleArn
24
+
- SchedulerRoleArn
21
25
- createSecretsManagerEndpoint
22
26
- createSNSEndpoint
23
27
- createCloudWatchLogsEndpoint
24
28
- createS3Endpoint
25
29
- routeTableIds
26
30
- vpcId
27
31
- endpointSecurityGroupIds
28
-
- LambdaRoleArn
29
-
- SchedulerRoleArn
30
-
- watchdogRoleArn
31
32
- Label:
32
33
default: "Alert Parameters"
33
34
Parameters:
@@ -94,13 +95,19 @@ Parameters:
94
95
Default: "password"
95
96
96
97
createWatchdogAlarm:
97
-
Description: "Create a CloudWatch alarm to monitor the Lambda function. It will alert you if the function fails to run successfully."
98
+
Description: "Create a CloudWatch alarm to monitor the Lambda function. It will alert you if the monitoring Lambda function fails to run successfully."
98
99
Type: String
99
100
Default: "true"
100
101
AllowedValues: ["true", "false"]
101
102
103
+
implementWatchdogAsLambda:
104
+
Description: "Use a Lambda function to publish to the SNS topic so it can reside in a different region. Only needed if you are creating the CloudWatch alarm and the SNS topic is in a different region."
105
+
Type: String
106
+
Default: "false"
107
+
AllowedValues: ["true", "false"]
108
+
102
109
watchdogRoleArn:
103
-
Description: "The ARN of the role to use for the Lambda function that will publish messages to the SNS topic if the monitoring function doesn't run properly. This is only needed if you are having the CloudWatch alarm created and if you want to provide an existing role, otherwisean appropriate one will be created for you."
110
+
Description: "The ARN of the role to assign to the Lambda function that will publish messages to the SNS topic if the monitoring function doesn't run properly. This is only needed if you are having the CloudWatch alarm created, implemented as a Lambda function and you want to provide an existing role, otherwise, if needed, an appropriate role will be created for you."
104
111
Type: String
105
112
Default: ""
106
113
@@ -192,7 +199,7 @@ Parameters:
192
199
Description: "Alert when a SnapMirror update hasn't transferred any new data in the specified seconds. Set to 0 to disable this alert."
193
200
Type: Number
194
201
Default: 600
195
-
MinValue: 60
202
+
MinValue: 0
196
203
197
204
snapMirrorHealthAlert:
198
205
Description: "Alert when the SnapMirror relationship is not healthy."
@@ -201,34 +208,40 @@ Parameters:
201
208
AllowedValues: ["true", "false"]
202
209
203
210
fileSystemUtilizationWarnAlert:
204
-
Description: "Alert when the file system utilization exceeds this threshold in percentage."
211
+
Description: "Alert when the file system utilization exceeds this threshold in percentage. Set to 0 to disable this alert."
205
212
Type: Number
206
213
Default: 80
214
+
MinValue: 0
207
215
208
216
fileSystemUtilizationCriticalAlert:
209
-
Description: "Alert when the file system utilization exceeds this threshold in percentage."
217
+
Description: "Alert when the file system utilization exceeds this threshold in percentage. Set to 0 to disable this alert."
210
218
Type: Number
211
219
Default: 90
220
+
MinValue: 0
212
221
213
222
volumeUtilizationWarnAlert:
214
-
Description: "Alert when a volume utilization exceeds this threshold in percentage."
223
+
Description: "Alert when a volume utilization exceeds this threshold in percentage. Set to 0 to disable this alert."
215
224
Type: Number
216
225
Default: 90
226
+
MinValue: 0
217
227
218
228
volumeUtilizationCriticalAlert:
219
-
Description: "Alert when a volume utilization exceeds this threshold in percentage."
229
+
Description: "Alert when a volume utilization exceeds this threshold in percentage. Set to 0 to disable this alert."
220
230
Type: Number
221
231
Default: 95
232
+
MinValue: 0
222
233
223
234
volumeFileUtilizationWarnAlert:
224
-
Description: "Alert when a volume inode utilization exceeds this threshold in percentage."
235
+
Description: "Alert when a volume inode utilization exceeds this threshold in percentage. Set to 0 to disable this alert."
225
236
Type: Number
226
237
Default: 90
238
+
MinValue: 0
227
239
228
240
volumeFileUtilizationCriticalAlert:
229
-
Description: "Alert when a volume inode utilization exceeds this threshold in percentage."
241
+
Description: "Alert when a volume inode utilization exceeds this threshold in percentage. Set to 0 to disable this alert."
230
242
Type: Number
231
243
Default: 95
244
+
MinValue: 0
232
245
233
246
volumeOfflineAlert:
234
247
Description: "Alert when a volume goes offline."
@@ -237,19 +250,22 @@ Parameters:
237
250
Default: "true"
238
251
239
252
softQuotaUtilizationAlert:
240
-
Description: "Alert when a soft quota exceeds this threshold in percentage."
253
+
Description: "Alert when a soft quota exceeds this threshold in percentage. Set to 0 to disable this alert."
241
254
Type: Number
242
255
Default: 100
256
+
MinValue: 0
243
257
244
258
hardQuotaUtilizationAlert:
245
-
Description: "Alert when a hard quota exceeds this threshold in percentage."
259
+
Description: "Alert when a hard quota exceeds this threshold in percentage. Set to 0 to disable this alert."
246
260
Type: Number
247
261
Default: 80
262
+
MinValue: 0
248
263
249
264
inodesQuotaUtilizationAlert:
250
-
Description: "Alert when an inode quota exceeds this threshold in percentage."
265
+
Description: "Alert when an inode quota exceeds this threshold in percentage. Set to 0 to disable this alert."
conditions["services"][getServiceIndex("snapmirror", conditions)]["rules"].append({"Healthy": False}) # This is what it matches on, so it is interesting when the health is false.
0 commit comments