You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: Monitoring/monitor-ontap-services/README.md
+11-11Lines changed: 11 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -95,7 +95,7 @@ To install the program using the CloudFormation template, you will need to do th
95
95
|SubnetIds|The subnet IDs that the Lambda function will be attached to. They must have connectivity to the FSxN file system management endpoint that you wish to monitor. It is recommended to select at least two.|
96
96
|SecurityGroupIds|The security group IDs that the Lambda function will be attached to. The security group must allow outbound traffic over port 443 to the SNS, Secrets Manager, and CloudWatch and S3 AWS service endpoints, as well as the FSxN file system you want to monitor.|
97
97
|SnsTopicArn|The ARN of the SNS topic you want the program to publish alert messages to.|
98
-
|CloudWatchLogGroupName|The name of **an existing** CloudWatch Log Group that the Lambda function can send event messages to. It will create a new Log Stream within the Log Group every day that is unique to this file system so you can use the same Log Group for multiple instances of this program. If this field is left blank, alerts will not be sent to CloudWatch.|
98
+
|CloudWatchLogGroupARN|The ARN of **an existing** CloudWatch Log Group that the Lambda function can send event messages to. It will create a new Log Stream within the Log Group every day that is unique to this file system so you can use the same Log Group for multiple instances of this program. If this field is left blank, alerts will not be sent to CloudWatch.|
99
99
|SecretArn|The ARN of the secret within the AWS Secrets Manager that holds the FSxN file system credentials.|
100
100
|SecretUsernameKey|The name of the key within the secret that holds the username portion of the FSxN file system credentials. The default is 'username'.|
101
101
|SecretPasswordKey|The name of the key within the secret that holds the password portion of the FSxN file system credentials. The default is 'password'.|
@@ -124,16 +124,16 @@ set for the OntapAdminServer parameter.
124
124
After the stack has been created, check the status of the Lambda function to make sure it is
125
125
not in an error state. To find the Lambda function go to the Resources tab of the CloudFormation
126
126
stack and click on the "Physical ID" of the Lambda function. This should bring you to the Lambda service in the AWS
127
-
console. Once there, click on the "Monitor" tab to see if the function has been invoked. Locate the
127
+
console. Once there, click on the "Monitor" tab to see if the function has been invoked. Note that it will take
128
+
at least the configured iteration time before the function is invoked for the first time. Locate the
128
129
"Error count and success rate(%)" chart, which is usually found at the top right corner of the "Monitor" dashboard.
129
-
Within the "CheckInterval" number of minutes there should be at least one dot on that chart. Note that initially
130
-
the chart is slow to reflect any status so you might have to be patient. Continue to press the "refresh"
131
-
button (the icon with a circle on it) every minute or so to update the status. Once you see a dot on the chart, when you hover your mouse
132
-
over it, you should see the "success rate" and "number of errors." The success rate should be 100% and the number
133
-
of errors should be 0. If it is not, then scroll down to the CloudWatch Logs section and click on the most recent
134
-
log stream. This will show you the output of the Lambda function. If there are any errors, they will be displayed
135
-
there. If you can't figure out what is causing an error, then please create an issue in this repository and someone
136
-
will help you.
130
+
After the "CheckInterval" number of minutes there should be at least one dot on that chart.
131
+
Hover your mouse over the dot and you should see the "success rate" and "number of errors."
132
+
The success rate should be 100% and the number of errors should be 0. If it is not, then scroll up a little bit and
133
+
click on "View CloudWatch Logs" link. Once on this page, click on the first LogStream and review any output.
134
+
If there are any errors, they will be displayed there. If you can't figure out what is causing an error,
135
+
please create an issue on the [Issues](https://github.com/NetApp/FSx-ONTAP-samples-scripts/issues) section
136
+
in this repository and someone will help you.
137
137
138
138
---
139
139
@@ -324,7 +324,7 @@ Each rule should be an object with one, or more, of the following keys:
324
324
|failover|Boolean|If 'true' the program will send an alert if the FSxN cluster is running on its standby node. If it is set to `false`, it will not report on failover status.|
325
325
|networkInterfaces|Boolean|If 'true' the program will send an alert if any of the network interfaces are down. If it is set to `false`, it will not report on any network interfaces that are down.|
326
326
327
-
###### Matching condition schema for EMS Messages (ems)
327
+
###### Matching condition schema for EMS Events (ems)
328
328
Each rule should be an object with three keys, with an optional 4th key:
Description: "The DNS name, or IP address, of the management endpoint of the FSxN file system to be monitored."
@@ -62,7 +149,7 @@ Parameters:
62
149
Type: String
63
150
64
151
subNetIds:
65
-
Description: "The subnet IDs where you want to deploy the Lambda function. Must have connectivity to the FSxN file system to be monitored."
152
+
Description: "The subnet IDs where you want to deploy the Lambda function. Must have connectivity to the FSxN file system to be monitored. Recommended to have at least two. Also recommended to be in a private subnet."
66
153
Type: "List<AWS::EC2::Subnet::Id>"
67
154
68
155
securityGroupIds:
@@ -100,7 +187,7 @@ Parameters:
100
187
AllowedValues: ["true", "false"]
101
188
102
189
implementWatchdogAsLambda:
103
-
Description: "Use a Lambda function to publish to the SNS topic so it can reside in a different region. Only needed if you are creating the CloudWatch alarm and the SNS topic is in a different region."
190
+
Description: "Use a Lambda function to publish to the SNS topic so the topic can reside in a different region. Only needed if you are creating the CloudWatch alarm and the SNS topic is in a different region."
104
191
Type: String
105
192
Default: "false"
106
193
AllowedValues: ["true", "false"]
@@ -111,25 +198,25 @@ Parameters:
111
198
Default: ""
112
199
113
200
createSecretsManagerEndpoint:
114
-
Description: "Create a Secrets Manager endpoint."
201
+
Description: "Set to 'true' if you want to create a Secrets Manager endpoint."
115
202
Type: String
116
203
Default: "false"
117
204
AllowedValues: ["true", "false"]
118
205
119
206
createSNSEndpoint:
120
-
Description: "Create an SNS endpoint."
207
+
Description: "Set to 'true if you want to create an SNS endpoint."
121
208
Type: String
122
209
Default: "false"
123
210
AllowedValues: ["true", "false"]
124
211
125
212
createCloudWatchLogsEndpoint:
126
-
Description: "Create a CloudWatch logs endpoint."
213
+
Description: "Set to 'true if you want to create a CloudWatch logs endpoint."
127
214
Type: String
128
215
Default: "false"
129
216
AllowedValues: ["true", "false"]
130
217
131
218
createS3Endpoint:
132
-
Description: "Create an S3 endpoint."
219
+
Description: "Set to 'true if you want to create an S3 endpoint."
133
220
Type: String
134
221
Default: "false"
135
222
AllowedValues: ["true", "false"]
@@ -145,7 +232,7 @@ Parameters:
145
232
Default: ""
146
233
147
234
endpointSecurityGroupIds:
148
-
Description: "The security group IDs, comma separated list, to associate with the SNS, SecretsManager and/or CloudWatch Logs endpoints. Must allow traffic from from the Lambda function over TCP port 443. This parameter is only needed if you are creating the SNS, SecretsManager, or CloudWatch Logs endpoint."
235
+
Description: "The security group IDs, comma separated list, to associate with the SNS, SecretsManager and/or CloudWatch Logs endpoints. Must allow inbound traffic from from the Lambda function over TCP port 443. This parameter is only needed if you are creating the SNS, SecretsManager, or CloudWatch Logs endpoint."
149
236
Type: CommaDelimitedList
150
237
Default: ""
151
238
@@ -626,8 +713,8 @@ Resources:
626
713
# "matching conditions." It is intended to be run as a Lambda function, but
conditions["services"][getServiceIndex("snapmirror", conditions)]["rules"].append({"Healthy": False}) # This is what it matches on, so it is interesting when the health is false.
conditions["services"][getServiceIndex("snapmirror", conditions)]["rules"].append({"Healthy": False}) # This is what it matches on, so it is interesting when the health is false.
0 commit comments