Skip to content

Commit 876df63

Browse files
James96315James Ma
andauthored
[2.2.1] - 2024-07-22 (#300)
### Added - Optimised instance list performance. #76 - Added support for using Unix epoch time format as the time key for single line text log type. ### Fixed - Fixed an issue that missing time key when editing JSON config. #296 - Fixed an issue that upgrading to v2.2.0 failed due to missing CMK permissions. #297 --------- Co-authored-by: James Ma <[email protected]>
1 parent c6a6f75 commit 876df63

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

54 files changed

+1717
-2336
lines changed

CHANGELOG.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,18 @@ All notable changes to this project will be documented in this file.
55
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
66
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
77

8+
## [2.2.1] - 2024-07-22
9+
10+
### Added
11+
12+
- Optimised instance list performance. #76
13+
- Added support for using Unix epoch time format as the time key for single line text log type.
14+
15+
### Fixed
16+
17+
- Fixed an issue that missing time key when editing JSON config. #296
18+
- Fixed an issue that upgrading to v2.2.0 failed due to missing CMK permissions. #297
19+
820
## [2.2.0] - 2024-06-20
921

1022
### Added
@@ -26,7 +38,10 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
2638
- Fixed a bug where the system could not read properties of undefined ('accountId') when the Next button was clicked without selecting an Instance Group. #236
2739
- Fixed an issue where logs were not received when using the solution-provisioned staging bucket in Light Engine. #237
2840
- Fixed a permissions issue in the LogMerger State Machine within Light Engine: The S3ObjectMigration Lambda failed due to insufficient KMS permissions on the analytics S3 bucket. #272
41+
- Fixed a bug that the maximum number of distributions that can be displayed is 100 when creating pipeline. #278
2942
- Fixed a bug that prevented instances from being listed when switching accounts on the Instance Group list page. #291
43+
- Fixed a bug where creating a Log Conf with JSON type, if the field type select float, can not create the index template. #293
44+
3045

3146
## [2.1.2] - 2024-03-19
3247

README.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,6 @@ The Centralized Logging with OpenSearch solution provides comprehensive log mana
1010
- [Architecture](#architecture)
1111
- [Deployment](#deployment)
1212
- [Customization](#customization)
13-
- [Collection of operational metrics](#collection-of-operational-metrics)
1413

1514

1615
## Solution Overview

docs/en/implementation-guide/trouble-shooting.md

Lines changed: 92 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -186,3 +186,95 @@ echo /usr/local/openssl11/lib/ >> /etc/ld.so.conf
186186
ldconfig
187187
188188
```
189+
190+
## I found that the OpenSearch data node's disk space was full, and then executed "delete index_prefix*" from the dev tools in the OpenSearch Dashboards. After execution, the index name suffix no longer contains the time format. What should I do to fix?
191+
192+
!!! Warning "Note"
193+
194+
The following operation will delete the currently written index, resulting in data loss.
195+
196+
1. Open the Centralized Logging with OpenSearch console, find the pipeline which has this issue and choose **View details**.
197+
2. Go to Monitoring > Lambda Processor, and click on the link(start with `/aws/lambda/CL-xxx`) under **Lambda Processor**.
198+
199+
![](../images/trouble-shooting/lambda-link.png)
200+
201+
3. Go to **Lambda** console > **Configuration** > **Concurrency**, choose **Edit**, select **Reserve concurrency** and set it to 0.
202+
203+
![](../images/trouble-shooting/lambda-configuration-concurrency.png)
204+
205+
![](../images/trouble-shooting/lambda-edit-concurrency.png)
206+
207+
4. Open the OpenSearch Dashboards, go to **Dev Tools**, input `DELETE your_index_name` and click to send request.
208+
209+
![](../images/trouble-shooting/aos-dev-tools.png)
210+
211+
![](../images/trouble-shooting/delete_index.png)
212+
213+
5. Input `GET _cat/indices/your_index_name` and click to send request. If **"status"** is 404 and **"type"** is index_not_found_exception in the returned result, it means success. Otherwise, please repeat step 4.
214+
215+
![](../images/trouble-shooting/cat_index.png)
216+
217+
6. Input `POST /your_index_name/_rollover` and click to send request.
218+
219+
7. Go to **Lambda** console > **Configuration** > **Concurrency**, choose **Edit**, select **Reserve concurrency** and set it to the value you want, or select **Use unreserved account concurrency**, save.
220+
221+
## Standard Operating Procedure for Proxy Stack Connection Problems
222+
223+
### When I access OpenSearch dashboards through the proxy, the browser shows 504 gateway timeout
224+
225+
##### Possible Root cause:
226+
a. If instances keeps terminating and initializing
227+
228+
i. Wrong security Group
229+
230+
b. Instances are not keep terminating
231+
232+
i. VPC peering request not accepted
233+
234+
ii. Peering with the wrong VPC
235+
236+
iii. Route table has the wrong routes
237+
238+
c. Check if VPC Peering is working.
239+
240+
### When I access OpenSearch dashboards through the proxy, the browser shows "Site can't be reached"
241+
242+
![](../images/trouble-shooting/site_cannt_be_reached.png)
243+
244+
##### Possible root cause:
245+
246+
1. Application Load Balancer is deployed inside private subnet
247+
248+
2. The proxy stack has just been re-deployed, it takes at least 15mins for DNS server to resolve the new Load Balancer endpoint address
249+
250+
251+
##### Solution:
252+
253+
1. ALB deploy location is wrong, just delete the proxy stack and create a new one
254+
255+
2. wait for 15 mins
256+
257+
## I set the log collection path to /log_path/*.log, what will be the impact?
258+
259+
!!! Warning "Note"
260+
261+
Normally we don't recommend using wildcard * as a prefix for matching logs. If there are hundreds, or even thousands of files in the directory, this will seriously affect the rate of FluentBit's log collection, and it is recommended that you can remove outdated files on a regular basis.
262+
263+
## The log file names are the same for different systems, but the log path contains the system name in order to differentiate between the different systems. I wish to create a pipeline to handle this, how should I set the log path?
264+
265+
!!! Info "Note"
266+
267+
#### Let's go through an example:
268+
269+
For example, we have 3 environments, dev, staging, prod. The log paths are /log_path/dev/jvm.log, /log_path/staging/jvm.log, and /log_path/prod/jvm.log. In this scenario if you wish to create only one pipeline, you can set the log path as follows:
270+
271+
![](../images/trouble-shooting/log_path.png)
272+
273+
`/log_path/*/jvm.log`.
274+
275+
## In EKS environment, I am using DaemonSet mode to collect logs, but my logs are not using standard output mode, how should I configure the Yaml file for deployment?
276+
277+
As we know, if you create a pipeline and the selected log source is EKS in the CLO, the system will automatically generate the content in YAML format for you to assist you in creating the deployment file for you to deploy FluentBit. You can match the log path `/your_log_path/` in the YAML file and remove the `Parser cri_regex`. Please refer to the following screenshot for details:
278+
279+
![](../images/trouble-shooting/without_cri_log.png)
280+
214 KB
Loading
114 KB
Loading
51.6 KB
Loading
135 KB
Loading
41 KB
Loading
220 KB
Loading
112 KB
Loading

0 commit comments

Comments
 (0)