Skip to content

Commit c896cf7

Browse files
authored
Update amazon-opensearch-service.md
1 parent 2b905cb commit c896cf7

File tree

1 file changed

+40
-35
lines changed

1 file changed

+40
-35
lines changed

docs/integrations/amazon-aws/amazon-opensearch-service.md

Lines changed: 40 additions & 35 deletions
Original file line numberDiff line numberDiff line change
@@ -16,11 +16,11 @@ The Sumo Logic app for Amazon OpenSearch collects CloudWatch logs, CloudWatch me
1616

1717
The Sumo Logic app for Amazon OpenSearch uses:
1818

19-
* OpenSearch CloudWatch Logs. For details, see [here](https://docs.aws.amazon.com/opensearch-service/latest/developerguide/createdomain-configure-slow-logs.html).
20-
* OpenSearch CloudWatch Metrics. For details, see [here](https://docs.aws.amazon.com/opensearch-service/latest/developerguide/managedomains-cloudwatchmetrics.html).
21-
* OpenSearch using AWS CloudTrail. For details, see [here](https://docs.aws.amazon.com/opensearch-service/latest/developerguide/managedomains-cloudtrailauditing.html).
19+
* [OpenSearch CloudWatch Logs](https://docs.aws.amazon.com/opensearch-service/latest/developerguide/createdomain-configure-slow-logs.html)
20+
* [OpenSearch CloudWatch Metrics](https://docs.aws.amazon.com/opensearch-service/latest/developerguide/managedomains-cloudwatchmetrics.html)
21+
* [OpenSearch using AWS CloudTrail](https://docs.aws.amazon.com/opensearch-service/latest/developerguide/managedomains-cloudtrailauditing.html)
2222

23-
### **Sample OpenSearch CloudWatch Logs**
23+
### Sample OpenSearch CloudWatch Logs
2424

2525
<details>
2626
<summary>Click to expand</summary>
@@ -72,7 +72,7 @@ The Sumo Logic app for Amazon OpenSearch uses:
7272

7373
</details>
7474

75-
### **Sample OpenSearch CloudTrail Logs**
75+
### Sample OpenSearch CloudTrail Logs
7676

7777
<details>
7878
<summary>Click to expand</summary>
@@ -145,7 +145,7 @@ The Sumo Logic app for Amazon OpenSearch uses:
145145
```
146146
</details>
147147

148-
### **Sample queries**
148+
### Sample queries
149149

150150
```sql title="Average GC Time (Cloud Watch Error Log)"
151151
account=* region=* namespace=aws/es domainname=* "[WARN ][o.o.m.j.JvmGcMonitorService]"
@@ -155,6 +155,7 @@ account=* region=* namespace=aws/es domainname=* "[WARN ][o.o.m.j.JvmGcMonitorSe
155155
| sum(duration) as Total_Time, avg(duration) as Avg_Time, max(duration) as Max_Time
156156
| fields Avg_Time
157157
```
158+
158159
```sql title="Top 5 Slow Queries by Index (Cloud Watch Slow Log)"
159160
account=* region=* namespace=aws/es domainname=* "[index.search.slowlog.query]"
160161
| parse "[*][*][*] [*] [*][*] took[*], took_millis[*], total_hits[*], stats[], search_type[*], total_shards[*], source[*], id[*]" as timestamp,log_level,log_type, node_id, index_name, shard_number, execution_time, execution_time_millis, total_hits, search_type, total_shards, source, id
@@ -163,6 +164,7 @@ account=* region=* namespace=aws/es domainname=* "[index.search.slowlog.query]"
163164
| count as frequency by domainname, index_name, node_id, execution_time_millis , source
164165
| topk(5, execution_time_millis) by index_name
165166
```
167+
166168
```sql title="Failed Login by User (Cloud Watch Audit Log)"
167169
account=* region=* namespace=aws/es domainname=* FAILED_LOGIN
168170
| json "audit_cluster_name", "audit_node_id","audit_category","audit_request_origin", "audit_request_remote_address", "audit_request_layer","audit_request_effective_user", "audit_rest_request_path"
@@ -172,6 +174,7 @@ account=* region=* namespace=aws/es domainname=* FAILED_LOGIN
172174
| count as freq by domainname, audit_request_effective_user
173175
| sort by freq, domainname asc, audit_request_effective_user asc
174176
```
177+
175178
```sql title="Successful Events by Event Name (Cloud Trail Logs)"
176179
account=* region=* namespace=aws/es "\"eventsource\":\"es.amazonaws.com\""
177180
| json "userIdentity", "eventSource", "eventName", "awsRegion", "sourceIPAddress", "userAgent", "eventType", "recipientAccountId", "requestParameters", "responseElements", "requestID", "errorCode", "errorMessage" as userIdentity, event_source, event_name, region, src_ip, user_agent, event_type, recipient_account_id, requestParameters, responseElements, request_id, error_code, error_message nodrop
@@ -186,13 +189,14 @@ account=* region=* namespace=aws/es "\"eventsource\":\"es.amazonaws.com\""
186189
| count as event_count by event_name
187190
| sort by event_count, event_name asc
188191
```
192+
189193
```sql title="Write Latency by Domain Name (Metrics-based)"
190194
account=* region=* namespace=aws/es domainname=* !nodeid=* metric=WriteLatency statistic = average | avg by domainname
191195
```
192196

193-
## **Collect logs and metrics for the Amazon OpenSearch app**
197+
## Collect logs and metrics for the Amazon OpenSearch app
194198

195-
### **Collect Amazon OpenSearch CloudWatch Logs**
199+
### Collect Amazon OpenSearch CloudWatch Logs
196200

197201
To enable Amazon OpenSearch CloudWatch Logs, follow the steps mentioned in [AWS Documentation](https://docs.aws.amazon.com/opensearch-service/latest/developerguide/createdomain-configure-slow-logs.html)
198202

@@ -212,28 +216,28 @@ Sumo Logic supports several methods for collecting logs from Amazon CloudWatch.
212216

213217
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/AWS-Lambda/lamda-cw-logs-source-fields.png')} alt="Fields" />
214218

215-
### **Collect Amazon OpenSearch CloudTrail Logs**
219+
### Collect Amazon OpenSearch CloudTrail Logs
216220

217221
1. Add an [AWS CloudTrail Source](https://help.sumologic.com/docs/send-data/hosted-collectors/amazon-aws/aws-cloudtrail-source/) to your Hosted Collector.
218-
* Name. Enter a name to display for the new Source.
219-
* Description. Enter an optional description.
220-
* S3 Region. Select the Amazon Region for your cloudTrail S3 bucket.
221-
* Bucket Name. Enter the exact name of your cloudTrail S3 bucket.
222-
* Path Expression. Enter the string that matches the S3 objects you'd like to collect. You can use a wildcard (\*) in this string.
222+
* **Name**. Enter a name to display for the new Source.
223+
* **Description**. Enter an optional description.
224+
* **S3 Region**. Select the Amazon Region for your CloudTrail S3 bucket.
225+
* **Bucket Name**. Enter the exact name of your CloudTrail S3 bucket.
226+
* **Path Expression**. Enter the string that matches the S3 objects you'd like to collect. You can use a wildcard (\*) in this string.
223227
* DO NOT use a [leading forward slash](https://help.sumologic.com/docs/send-data/hosted-collectors/amazon-aws/amazon-path-expressions/).
224228
* The S3 bucket name is not part of the path. Don’t include the bucket name when you are setting the Path Expression.
225-
* Source Category. Enter a source category. For example, enter `aws/observability/CloudTrail/logs`.
226-
* Fields. Add an account field and assign it a value that is a friendly name/alias to your AWS account from which you are collecting logs. Logs can be queried using the account field.
229+
* **Source Category**. Enter a source category. For example, enter `aws/observability/CloudTrail/logs`.
230+
* **Fields**. Add an account field and assign it a value that is a friendly name/alias to your AWS account from which you are collecting logs. Logs can be queried using the account field.
227231
![Fields][image3]
228232
* Access Key ID and Secret Access Key. Enter your Amazon [Access Key ID and Secret Access Key](http://docs.aws.amazon.com/AWSSimpleQueueService/latest/SQSGettingStartedGuide/AWSCredentials.html). Learn how to use Role-based access to AWS [here](https://help.sumologic.com/docs/send-data/hosted-collectors/amazon-aws/aws-sources/).
229233
* Log File Discovery \-\> Scan Interval. Use the default of 5 minutes. Alternately, enter the frequency. Sumo Logic will scan your S3 bucket for new data. Learn how to configure Log File Discovery [here](https://help.sumologic.com/docs/send-data/hosted-collectors/amazon-aws/aws-sources/).
230234
* Enable Timestamp Parsing. Select the Extract timestamp information from log file entries check box.
231235
* Time Zone. Select Ignore time zone from the log file and instead use, and select UTC from the dropdown.
232236
* Timestamp Format. Select Automatically detect the format.
233237
* Enable Multiline Processing. Select the Detect messages spanning multiple lines check box, and select Infer Boundaries.
234-
2. Click Save.
238+
2. Click **Save**.
235239

236-
### **Collect Amazon OpenSearch CloudWatch Metrics**
240+
### Collect Amazon OpenSearch CloudWatch Metrics
237241

238242
1. Configure a [Hosted Collector](https://help.sumologic.com/docs/send-data/hosted-collectors/configure-hosted-collector/).
239243
2. Configure an [Amazon CloudWatch Source for Metrics](https://help.sumologic.com/docs/send-data/hosted-collectors/amazon-aws/amazon-cloudwatch-source-metrics/) or [AWS Kinesis Firehose for Metrics Source](https://help.sumologic.com/docs/send-data/hosted-collectors/amazon-aws/aws-kinesis-firehose-metrics-source/) (Recommended).
@@ -312,7 +316,7 @@ import AppInstall from '../../reuse/apps/app-install-v2.md';
312316

313317
## **Viewing Amazon OpenSearch dashboards**
314318

315-
### **01. Amazon OpenSearch \- Overview**
319+
### Overview
316320

317321
The Amazon OpenSearch \- Overview dashboard provides a comprehensive overview of Amazon OpenSearch performance and operational metrics. It displays key information about cluster utilization, user activity, query performance, error logs, and system events. The dashboard is designed to help administrators monitor and optimize their OpenSearch deployment across different domains and regions.
318322

@@ -323,7 +327,7 @@ Use this dashboard to:
323327

324328
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/01.-Amazon-OpenSearch-Overview.png')} alt="Fields" />
325329

326-
### **02. Amazon Opensearch \- Performance Overview**
330+
### Performance Overview
327331

328332
The Amazon OpenSearch \- Performance Overview dashboard provides a comprehensive view of the OpenSearch cluster's health, performance, and resource utilization. It offers real-time insights into cluster status, CPU and memory usage, storage metrics, document management, and read/write latencies across different domains.
329333

@@ -337,7 +341,7 @@ Use this dashboard to:
337341

338342
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/02.-Amazon-OpenSearch-Performance-Overview.png')} alt="Fields" />
339343

340-
### **03. Amazon OpenSearch \- CloudTrail Audit Events**
344+
### CloudTrail Audit Events
341345

342346
The Amazon Opensearch \- CloudTrail Audit Events dashboard provides insights across CloudTrail events across location, status, and topic names.
343347

@@ -350,7 +354,7 @@ Use this dashboard to:
350354

351355
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/03.-Amazon-OpenSearch-CloudTrail-Audit-Events.png')} alt="Fields" />
352356

353-
### **04. Amazon OpenSearch \- Audit Logs \- Failed Logins**
357+
### Audit Logs - Failed Logins
354358

355359
The Amazon OpenSearch \- Audit Logs \- Failed Logins dashboard provides a comprehensive view of login activities, focusing on failed login attempts and authentication errors. It offers insights into the geographical distribution of failed logins, user-specific login failures, cluster-based login issues, and detailed authentication error logs.
356360

@@ -363,7 +367,7 @@ Use this dashboard to:
363367

364368
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/04.-Amazon-OpenSearch-Audit-Logs-Failed-Logins.png')} alt="Fields" />
365369

366-
### **05. Amazon OpenSearch \- Error Logs \- Garbage Collection**
370+
### Error Logs - Garbage Collection
367371

368372
The Amazon OpenSearch \- Error Logs \- Garbage Collection dashboard provides a comprehensive view of garbage collection (GC) activities in AWS OpenSearch Service. It offers insights into GC performance, memory cleanup, and JVM memory usage across different domains. The dashboard helps monitor and optimize the garbage collection process, which is crucial for maintaining the performance and stability of OpenSearch clusters.
369373

@@ -376,7 +380,7 @@ Use this dashboard to:
376380

377381
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/05.-Amazon-OpenSearch-Error-Logs-Garbage-Collection.png')} alt="Fields" />
378382

379-
### **06. Amazon OpenSearch \- Slow Logs \- Queries**
383+
### Slow Logs - Queries
380384

381385
The Amazon Opensearch \- Slow Logs \- Queries dashboard provides a comprehensive view of query performance and behavior within an OpenSearch environment.
382386

@@ -388,7 +392,7 @@ Use this dashboard to:
388392

389393
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/06.-Amazon-OpenSearch-Slow-Logs-Queries.png')} alt="Fields" />
390394

391-
### **07. Amazon OpenSearch \- Domain Name (Cluster) Performance**
395+
### Domain Name (Cluster) Performance
392396

393397
The Amazon OpenSearch \- Domain Name (Cluster) Performance dashboard provides a comprehensive view of cluster performance and resource utilization across different domains. It offers insights into node count, CPU and memory usage, request patterns, and storage metrics for OpenSearch clusters.
394398

@@ -401,10 +405,10 @@ Use this dashboard to:
401405

402406
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/07.-Amazon-OpenSearch-Domain-Name-Cluster-Performance.png')} alt="Fields" />
403407

404-
### **08. Amazon OpenSearch \- Nodes Performance**
408+
### Nodes Performance
405409

406-
Summary:
407410
The Amazon OpenSearch \- Nodes Performance dashboard provides a detailed view of node-level performance metrics for OpenSearch clusters across different domains. It offers insights into search and indexing operations, threadpool activities, and overall cluster health, allowing for granular monitoring and troubleshooting of OpenSearch nodes.
411+
408412
Use this dashboard to:
409413
* Compare search and indexing performance across different nodes and domains, with visualizations for search/indexing rates and latencies, helping identify potential bottlenecks or underperforming nodes.
410414
* Monitor thread pool activities, including search queue times, rejected requests, and write queue metrics, which are crucial for understanding cluster load and capacity issues.
@@ -414,9 +418,10 @@ Use this dashboard to:
414418

415419
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/08.-Amazon-OpenSearch-Nodes-Performance.png')} alt="Fields" />
416420

417-
### **09. Amazon OpenSearch \- EBS Volume Performance**
421+
### EBS Volume Performance
422+
423+
The Amazon OpenSearch \- EBS Volume Performance dashboard provides a comprehensive view of the performance metrics for Amazon Elastic Block Store (EBS) volumes associated with OpenSearch clusters. It displays various key performance indicators such as read and write latency, I/O operations per second (IOPS), throughput, burst balance, and disk queue depth.
418424

419-
The Amazon OpenSearch \- EBS Volume Performance dashboard provides a comprehensive view of the performance metrics for Amazon Elastic Block Store (EBS) volumes associated with OpenSearch clusters. It displays various key performance indicators such as read and write latency, I/O operations per second (IOPS), throughput, burst balance, and disk queue depth.
420425
Use this dashboard to:
421426
* Monitor read and write latency of EBS volumes to ensure optimal response times for OpenSearch operations.
422427
* Track read and write IOPS to understand the I/O demand on your EBS volumes and identify any performance constraints.
@@ -426,14 +431,14 @@ Use this dashboard to:
426431

427432
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/09.-Amazon-OpenSearch-EBS-Volume-Performance.png')} alt="Fields" />
428433

429-
### **10. Amazon OpenSearch \- Cache Performance**
434+
### Cache Performance
430435

431436
The Amazon OpenSearch \- Cache Performance dashboard provides insights into cache performance, evictions, capacity, and memory usage, which are crucial for maintaining optimal performance of OpenSearch clusters.
432437

433438
Use this dashboard to:
434-
* Performance tuning of OpenSearch clusters
435-
* Capacity planning for cache and memory resources
436-
* Troubleshooting cache-related issues
437-
* Ability to correlate cache metrics with overall system performance
439+
* Performance tuning of OpenSearch clusters.
440+
* Capacity planning for cache and memory resources.
441+
* Troubleshooting cache-related issues.
442+
* Ability to correlate cache metrics with overall system performance.
438443

439-
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/10.-Amazon-OpenSearch-Cache-Performance.png')} alt="Fields" />
444+
<img src={useBaseUrl('https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Amazon-OpenSearch/10.-Amazon-OpenSearch-Cache-Performance.png')} alt="Fields" />

0 commit comments

Comments
 (0)