You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/operator-nexus/troubleshoot-bmm-degraded.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,15 +2,15 @@
2
2
title: Troubleshoot BMM Degraded issues in Azure Operator Nexus
3
3
description: Troubleshooting guide for Bare Metal Machines in 'Degraded' status in Azure Operator Nexus.
4
4
ms.service: azure-operator-nexus
5
-
ms.custom: troubleshooting
5
+
ms.custom: azure-operator-nexus
6
6
ms.topic: troubleshooting
7
7
ms.date: 02/03/2025
8
8
author: robertstarling
9
9
ms.author: robstarling
10
10
ms.reviewer: ekarandjeff
11
11
---
12
12
13
-
# Troubleshoot _Degraded_ status errors on an Azure Operator Nexus cluster Bare Metal Machine
13
+
# Troubleshoot _Degraded_ status errors on an Azure Operator Nexus Cluster Bare Metal Machine
14
14
15
15
This document provides basic troubleshooting information for Bare Metal Machine (BMM) resources which are reporting a _Degraded_ status in the BMM detailed status message.
16
16
@@ -29,7 +29,7 @@ Bare Metal Machines (BMM) which are in _Degraded_ state exhibit the following sy
29
29
|`Degraded: LACP status is down`| Yes |[Degraded: `LACP status is down`](#degraded-lacp-status-is-down)|
30
30
|`Degraded: BMM power state doesn't match expected state`| No |[Degraded: `BMM power state doesn't match expected state`](#degraded-bmm-power-state-doesnt-match-expected-state)|
31
31
32
-
_Degraded_ status messages and associated automatic cordoning behavior are present in Azure Operator Nexus version 4.1 and higher.
32
+
_Degraded_ status messages and associated automatic cordoning behavior are present in Azure Operator Nexus version 2502.1 and higher.
33
33
34
34
## Troubleshooting
35
35
@@ -126,7 +126,7 @@ If an uncordoned BMM is in a _Degraded_ state for 15 minutes or more, the node m
126
126
- To uncordon a BMM manually, use the `az networkcloud baremetalmachine uncordon` command or execute the _Uncordon_ action from the Azure portal.
127
127
- Manually uncordoning a BMM which still has a degraded condition has no effect. The _Uncordon_ request will execute successfully, but the node will immediately be automatically cordoned again until 2 hours after the underlying conditions are resolved.
128
128
129
-
To investigate whether a currently cordoned node is due to a recent _Degraded_ state:
129
+
To investigate whether a currently cordoned BMM is due to a recent _Degraded_ state:
130
130
131
131
- Review the `lastTransitionTime` in the `conditions` for the kubernetes `bmm` resource, as described in the [Troubleshooting](#troubleshooting) section, to identify any recently resolved _Degraded_ conditions.
132
132
- Review the Activity Logs for the BMM resource in the Azure portal to check for any user initiated cordon requests.
@@ -171,7 +171,7 @@ To troubleshoot this issue:
171
171
- for more information about diagnosing and fixing LACP issues, see [Troubleshoot LACP Bonding](./troubleshoot-lacp-bonding.md).
172
172
173
173
> [!WARNING]
174
-
> As of version 4.1, there's a known issue where `LACP status is down` can be incorrectly reported in addition to the `port is not functioning as expected` message during a port down scenario. This issue can happen when a BMM is restarted or reimaged while the physical port is down. This issue will be fixed in a future release. In the meantime, the `LACP status is down` warning can be safely ignored if the physical port is also down.
174
+
> As of version 2502.1, there's a known issue where `LACP status is down` can be incorrectly reported in addition to the `port is not functioning as expected` message during a port down scenario. This issue can happen when a BMM is restarted or reimaged while the physical port is down. This issue will be fixed in a future release. In the meantime, the `LACP status is down` warning can be safely ignored if the physical port is also down.
175
175
176
176
**Example `conditions` output for unexpected LACP state**
0 commit comments