You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/system/system-metrics.md
+16-28Lines changed: 16 additions & 28 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1332,14 +1332,10 @@ This metric is [recommended][MetricRecommended].
1332
1332
The metric represents the percentage of time that tasks were stalled on a given resource
1333
1333
over the specified time window.
1334
1334
1335
-
The "some" stall type indicates at least some tasks are stalled on the resource.
1336
-
The "full" stall type indicates all non-idle tasks are stalled simultaneously, representing
1337
-
a more severe state where the system is thrashing and CPU cycles are wasted.
1338
-
1339
1335
PSI is available on Linux systems with kernel 4.20 or later and requires CONFIG_PSI=y.
1340
1336
CPU "full" stall is reported as zero at the system level for backward compatibility (available since 5.13).
1341
1337
1342
-
Values are percentages in the range [0, 100]. The ratios are tracked over 10-second, 60-second,
1338
+
The ratios are tracked over 10-second, 60-second,
1343
1339
and 300-second windows.
1344
1340
1345
1341
See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html) and
@@ -1349,15 +1345,13 @@ See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html
1349
1345
1350
1346
| Key | Stability |[Requirement Level](https://opentelemetry.io/docs/specs/semconv/general/attribute-requirement-level/)| Value Type | Description | Example Values |
1351
1347
|---|---|---|---|---|---|
1352
-
|[`system.psi.resource`](/docs/registry/attributes/system.md)||`Required`| string | The resource experiencing pressure (cpu, memory, or io) [1]|`cpu`; `memory`; `io`|
1353
-
|[`system.psi.stall_type`](/docs/registry/attributes/system.md)||`Required`| string | The stall type (some or full) [2]|`some`; `full`|
1354
-
|[`system.psi.window`](/docs/registry/attributes/system.md)||`Required`| string | The time window over which pressure is calculated [3]|`10s`; `60s`; `300s`|
1348
+
|[`system.psi.resource`](/docs/registry/attributes/system.md)||`Required`| string | The resource experiencing pressure [1]|`cpu`; `memory`; `io`|
1349
+
|[`system.psi.stall_type`](/docs/registry/attributes/system.md)||`Required`| string | The PSI stall type |`some`; `full`|
1350
+
|[`system.psi.window`](/docs/registry/attributes/system.md)||`Required`| string | The time window over which pressure is calculated [2]|`10s`; `60s`; `300s`|
1355
1351
1356
1352
**[1]`system.psi.resource`:** Linux PSI (Pressure Stall Information) measures resource pressure for CPU, memory, and I/O. See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html).
1357
1353
1358
-
**[2]`system.psi.stall_type`:** PSI distinguishes between "some" stall (at least some tasks stalled) and "full" stall (all non-idle tasks stalled simultaneously).
1359
-
1360
-
**[3]`system.psi.window`:** Typically one of: 10s, 60s, or 300s
1354
+
**[2]`system.psi.window`:** PSI tracks pressure as percentages over 10-second, 60-second, and 300-second windows. This attribute identifies which time window the metric represents.
1361
1355
1362
1356
---
1363
1357
@@ -1375,12 +1369,12 @@ See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html
1375
1369
1376
1370
| Value | Description | Stability |
1377
1371
|---|---|---|
1378
-
|`full`| All non-idle tasks are stalled on the resource simultaneously [4]||
1379
-
|`some`| At least some tasks are stalled on the resource [5]||
1372
+
|`full`| All non-idle tasks are stalled on the resource simultaneously [3]||
1373
+
|`some`| At least some tasks are stalled on the resource [4]||
1380
1374
1381
-
**[4]:** The "full" line indicates the share of time in which all non-idle tasks are stalled on a given resource simultaneously. This represents a state where actual CPU cycles are going to waste and the workload is thrashing. CPU full is undefined at the system level and is set to zero for backward compatibility (available since Linux 5.13).
1375
+
**[3]:** The "full" line indicates the share of time in which all non-idle tasks are stalled on a given resource simultaneously. This represents a state where actual CPU cycles are going to waste and the workload is thrashing. CPU full is undefined at the system level and is set to zero for backward compatibility (available since Linux 5.13).
1382
1376
1383
-
**[5]:** The "some" line indicates the share of time in which at least some tasks are stalled on a given resource.
1377
+
**[4]:** The "some" line indicates the share of time in which at least some tasks are stalled on a given resource.
1384
1378
1385
1379
<!-- markdownlint-restore -->
1386
1380
<!-- prettier-ignore-end -->
@@ -1407,28 +1401,22 @@ Unlike the percentage-based `system.linux.psi.pressure` metric, this allows dete
1407
1401
of latency spikes that wouldn't necessarily make a noticeable impact on time averages.
1408
1402
It also enables calculating average trends over custom time frames.
1409
1403
1410
-
The "some" stall type indicates at least some tasks are stalled on the resource.
1411
-
The "full" stall type indicates all non-idle tasks are stalled simultaneously.
1412
-
1413
1404
PSI is available on Linux systems with kernel 4.20 or later and requires CONFIG_PSI=y.
1414
1405
CPU "full" stall is reported as zero at the system level for backward compatibility (available since 5.13).
1415
1406
1416
1407
This is a monotonically increasing counter that resets on system reboot.
1417
1408
1418
-
See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html) and
See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html)
1420
1410
1421
1411
**Attributes:**
1422
1412
1423
1413
| Key | Stability |[Requirement Level](https://opentelemetry.io/docs/specs/semconv/general/attribute-requirement-level/)| Value Type | Description | Example Values |
1424
1414
|---|---|---|---|---|---|
1425
-
|[`system.psi.resource`](/docs/registry/attributes/system.md)||`Required`| string | The resource experiencing pressure (cpu, memory, or io) [1]|`cpu`; `memory`; `io`|
1426
-
|[`system.psi.stall_type`](/docs/registry/attributes/system.md)||`Required`| string | The stall type (some or full) [2]|`some`; `full`|
1415
+
|[`system.psi.resource`](/docs/registry/attributes/system.md)||`Required`| string | The resource experiencing pressure [1]|`cpu`; `memory`; `io`|
1416
+
|[`system.psi.stall_type`](/docs/registry/attributes/system.md)||`Required`| string | The PSI stall type |`some`; `full`|
1427
1417
1428
1418
**[1]`system.psi.resource`:** Linux PSI (Pressure Stall Information) measures resource pressure for CPU, memory, and I/O. See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html).
1429
1419
1430
-
**[2]`system.psi.stall_type`:** PSI distinguishes between "some" stall (at least some tasks stalled) and "full" stall (all non-idle tasks stalled simultaneously).
1431
-
1432
1420
---
1433
1421
1434
1422
`system.psi.resource` has the following list of well-known values. If one of them applies, then the respective value MUST be used; otherwise, a custom value MAY be used.
@@ -1445,12 +1433,12 @@ See [Linux kernel PSI documentation](https://docs.kernel.org/accounting/psi.html
1445
1433
1446
1434
| Value | Description | Stability |
1447
1435
|---|---|---|
1448
-
|`full`| All non-idle tasks are stalled on the resource simultaneously [3]||
1449
-
|`some`| At least some tasks are stalled on the resource [4]||
1436
+
|`full`| All non-idle tasks are stalled on the resource simultaneously [2]||
1437
+
|`some`| At least some tasks are stalled on the resource [3]||
1450
1438
1451
-
**[3]:** The "full" line indicates the share of time in which all non-idle tasks are stalled on a given resource simultaneously. This represents a state where actual CPU cycles are going to waste and the workload is thrashing. CPU full is undefined at the system level and is set to zero for backward compatibility (available since Linux 5.13).
1439
+
**[2]:** The "full" line indicates the share of time in which all non-idle tasks are stalled on a given resource simultaneously. This represents a state where actual CPU cycles are going to waste and the workload is thrashing. CPU full is undefined at the system level and is set to zero for backward compatibility (available since Linux 5.13).
1452
1440
1453
-
**[4]:** The "some" line indicates the share of time in which at least some tasks are stalled on a given resource.
1441
+
**[3]:** The "some" line indicates the share of time in which at least some tasks are stalled on a given resource.
0 commit comments