Skip to content

Commit ac8532b

Browse files
authored
Doc: Add Known issue for 8.18.7: Agent stuck on failed upgrade (#1856)
1 parent 8a8c0f7 commit ac8532b

File tree

1 file changed

+39
-0
lines changed

1 file changed

+39
-0
lines changed

docs/en/ingest-management/release-notes/release-notes-8.18.asciidoc

Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -33,6 +33,45 @@ Also see:
3333
[[release-notes-8.18.7]]
3434
== {fleet} and {agent} 8.18.7
3535

36+
[discrete]
37+
[[known-issues-8.18.7]]
38+
=== Known issues
39+
40+
[[known-issue-2978-8-18-7]]
41+
.Failed upgrades leave {agent} stuck until restart
42+
[%collapsible]
43+
====
44+
45+
This known issue applies to {agent} 8.18.7 and 9.0.7. {agent} versions 8.19.x and 9.1.x are not affected.
46+
47+
On September 17, 2025, a known issue was discovered that can cause {agent} upgrades to get stuck if an upgrade attempt fails under specific conditions. This happens because the coordinator's `overrideState` remains set, leaving the agent in a state that appears to be upgrading.
48+
49+
**Conditions**
50+
51+
This issue is triggered if the upgrade fails during one of the early checks inside `Coordinator.Upgrade`, for example:
52+
53+
- The agent is not upgradeable
54+
- Capabilities check denies the upgrade
55+
- When {agent} is tamper-protected, Endpoint must validate that the upgrade action was correctly signed by Kibana to allow the upgrade. If the signature is missing, invalid, or the connection between {agent} and Endpoint was interrupted, the validation fails. This causes the agent coordinator's override state to become stuck until the agent is restarted.
56+
57+
**Symptoms**
58+
59+
- {fleet} shows the upgrade action in progress, even though the upgrade remains stuck
60+
- No further upgrade attempts succeed
61+
- Elastic Agent status shows an override state indicating upgrade
62+
63+
**Workaround**
64+
65+
Restart the {agent} to clear the coordinator's `overrideState` and allow new upgrade attempts to proceed.
66+
67+
**Resolution**
68+
69+
This issue was fixed in link:https://github.com/elastic/elastic-agent/pull/9992[#9992], which ensures that the coordinator clears its override state whenever an early failure occurs.
70+
71+
The fix is included in versions 9.1.4 and 8.19.4, and planned for versions 9.0.8 and 8.18.8.
72+
73+
====
74+
3675
[discrete]
3776
[[features-enhancements-8.18.7]]
3877
=== New features and enhancements

0 commit comments

Comments
 (0)