Skip to content

Commit f0e3c64

Browse files
authored
Get Alarm Runtime Status: support query the running status for the whole cluster. (#13570)
1 parent f0d3d9b commit f0e3c64

File tree

24 files changed

+916
-264
lines changed

24 files changed

+916
-264
lines changed

dist-material/alarm-settings.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@
1616

1717
# Sample alarm rules.
1818
rules:
19-
# Rule unique name, must be ended with `_rule`.
19+
# Rule unique id, must be ended with `_rule`.
2020
service_resp_time_rule:
2121
# A MQE expression, the result type must be `SINGLE_VALUE` and the root operation of the expression must be a Compare Operation
2222
# which provides `1`(true) or `0`(false) result. When the result is `1`(true), the alarm will be triggered.

dist-material/config-examples/alarm-settings.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@
1616

1717
# Sample alarm rules.
1818
rules:
19-
# Rule unique name, must be ended with `_rule`.
19+
# Rule unique id, must be ended with `_rule`.
2020
endpoint_percent_rule:
2121
expression: sum((endpoint_sla / 100) < 75) >= 3
2222
# The length of time to evaluate the metrics

docs/en/changes/changes.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -117,6 +117,7 @@
117117
* Library-pprof-parser: feat: add PprofSegmentParser.
118118
* Storage: feat: add languageType column to ProfileThreadSnapshotRecord.
119119
* Feat: add go profile analyzer
120+
* Get Alarm Runtime Status: support query the running status for the whole cluster.
120121

121122
#### UI
122123

docs/en/setup/backend/backend-alarm.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Defines the relation between scope and entity name.
1818

1919
## Rules
2020
An alerting rule is made up of the following elements:
21-
- **Rule name**. A unique name shown in the alarm message. It must end with `_rule`.
21+
- **Rule id**. A unique name shown in the alarm message. It must end with `_rule`.
2222
- **Expression**. A [MQE](../../api/metrics-query-expression.md) expression that defines the conditions of the rule.
2323
The result type must be `SINGLE_VALUE` and the root operation of the expression must be a
2424
[Compare Operation](../../api/metrics-query-expression.md#compare-operation) or [Bool Operation](../../api/metrics-query-expression.md#bool-operation) which provides `1`(true) or `0`(false) result.
@@ -34,9 +34,9 @@ The metrics names in the expression could be found in the [list of all potential
3434
- **Exclude names regex**. A regex that excludes entity names. Both rules will take effect if both include-label list and include-label regex are set.
3535
- **Tags**. Tags are key/value pairs that are attached to alarms. Tags are used to specify distinguishing attributes of alarms that are meaningful and relevant to users. If you want to make these tags searchable on the SkyWalking UI, you may set the tag keys in `core/default/searchableAlarmTags` or through the system environment variable `SW_SEARCHABLE_ALARM_TAG_KEYS`. The key `level` is supported by default.
3636
- **Period**. The size of metrics cache in minutes for checking the alarm conditions. This is a time window that corresponds to the backend deployment env time.
37-
- **Hooks**. Binding the specific names of the hooks when the alarm is triggered.
38-
The name format is `{hookType}.{hookName}` (slack.custom1 e.g.) and must be defined in the `hooks` section of the `alarm-settings.yml` file.
39-
If the hook name is not specified, the global hook will be used.
37+
- **Hooks**. Binding the specific ids of the hooks when the alarm is triggered.
38+
The id format is `{hookType}.{hookName}` (slack.custom1 e.g.) and must be defined in the `hooks` section of the `alarm-settings.yml` file.
39+
If the hook id is not specified, the global hook will be used.
4040
- **Silence period**. After the alarm is triggered at Time-N (TN), there will be silence during the **TN -> TN + period**.
4141
By default, it works in the same manner as **period**. The same Alarm (having the same ID in the same metrics name) may only be triggered once within a period.
4242

@@ -61,7 +61,7 @@ the calculation is `((1001 + 10001 + ... + 1001) / 7) > 1000` and the result wou
6161

6262
```yaml
6363
rules:
64-
# Rule unique name, must be ended with `_rule`.
64+
# Rule unique id, must be ended with `_rule`.
6565
endpoint_percent_rule:
6666
# A MQE expression and the root operation of the expression must be a Compare Operation.
6767
expression: sum((endpoint_sla / 100) < 75) >= 3
@@ -145,8 +145,8 @@ is through [AI powered baseline calculation](../ai-pipeline/metrics-baseline-int
145145

146146
## Hooks
147147
Hooks are a way to send alarm messages to the outside world. SkyWalking supports multiple hooks of the same type, each hook can support different configurations.
148-
For example, you can configure two Slack hooks, one named `default` and set `is-default: true` means this hook will apply on all `Alarm Rules` **without config** `hooks`.
149-
Another named `custom1` will only apply on the `Alarm Rules` which **with config** `hooks` and include the name `slack.custom1`.
148+
For example, you can configure two Slack hooks, one id is `default` and set `is-default: true` means this hook will apply on all `Alarm Rules` **without config** `hooks`.
149+
Another id is `custom1` will only apply on the `Alarm Rules` which **with config** `hooks` and include the id `slack.custom1`.
150150

151151
```yaml
152152
hooks:

0 commit comments

Comments
 (0)