Skip to content

Commit f627f45

Browse files
authored
Update troubleshoot-bmm-provisioning.md
Fix typos add MAC address check to table.
1 parent 8cca120 commit f627f45

File tree

1 file changed

+13
-11
lines changed

1 file changed

+13
-11
lines changed

articles/operator-nexus/troubleshoot-bmm-provisioning.md

Lines changed: 13 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ ms.author: bpinto
1111

1212
# Troubleshoot BMM provisioning in Azure Operator Nexus cluster
1313

14-
As part of cluster deploy action, bare metal machines (BMM) are provisioned with required roles to participate in the cluster. This document supports troubleshooting for common provisioning issues using Azure CLI, Azure portal, and the server baseboard management controller (BMC). For the Azure Operator Nexus platform, the underlying server hardware uses integrated Dell remote access controller (iDRAC) as the BMC. Provisioning uses the Preboot eXecution Environment (PXE) interface to load the Opearating System (OS) on the BMM.
14+
As part of cluster deploy action, bare metal machines (BMM) are provisioned with required roles to participate in the cluster. This document supports troubleshooting for common provisioning issues using Azure CLI, Azure portal, and the server baseboard management controller (BMC). For the Azure Operator Nexus platform, the underlying server hardware uses integrated Dell remote access controller (iDRAC) as the BMC. Provisioning uses the Preboot eXecution Environment (PXE) interface to load the Operating System (OS) on the BMM.
1515

1616
## Prerequisites
1717
1. Install the latest version of the [appropriate CLI extensions](howto-install-cli-extensions.md)
@@ -109,13 +109,15 @@ The following conditions can cause provisioning failures:
109109

110110
| Error Type | Resolution |
111111
| ---------- | ---------- |
112-
| BMC shows `Backplane Comm` critical error | 1) Execute remote flea drain. 2) Perform physical flea drain. 3) Execute BMM `replace` action. |
113-
| Boot network data response empty from BMC | 1) Bounce port on fabric device. 2) Execute remote flea drain. 3) Perform physical flea drain. 4) Execute BMM `replace` action. |
114-
| Disk data response empty from BMC | 1) Remove/replace disk. 2) Remove/replace storage controller. 3) Execute remote flea drain. 4) Perform physical flea drain. 5) Execute BMM `replace` action. |
115-
| BMC unreachable | 1) Bounce port on fabric device. 2) Remove/replace cable. 3) Execute remote flea drain. 4) Perform physical flea drain. 5) Execute BMM `replace` action. |
116-
| BMC fails log in | 1) Update credentials on BMC. 2) Execute BMM `replace` action. |
117-
| Memory, CPU, OEM critical errors | 1) Resolve hardware issue with remove/replace. 2) Execute remote flea drain. 3) Perform physical flea drain. 4) Execute BMM `replace` action. |
118-
| Console stuck at grub menu | 1) Execute NVRAM reset. 2) Execute BMM `replace` action. |
112+
| BMC shows `Backplane Comm` critical error. | 1) Execute BMM remote flea drain. 2) Perform BMM physical flea drain. 3) Execute BMM `replace` action. |
113+
| Boot (PXE) network data response empty from BMC. | 1) Reset port on fabric device. 2) Execute BMM remote flea drain. 3) Perform BMM physical flea drain. 4) Execute BMM `replace` action. |
114+
| Boot (PXE) MAC address mismatch. | 1) Validate BMM MAC address data against BMC data. 2) Execute BMM remote flea drain. 3) Perform BMM physical flea drain. 4) Execute BMM `replace` action. |
115+
| BMC MAC address mismatch | 1) Validate BMM MAC address data against BMC data. 2) Execute BMM remote flea drain. 3) Perform BMM physical flea drain. 4) Execute BMM `replace` action. |
116+
| Disk data response empty from BMC. | 1) Remove/replace disk. 2) Remove/replace storage controller. 3) Execute BMM remote flea drain. 4) Perform BMM physical flea drain. 5) Execute BMM `replace` action. |
117+
| BMC unreachable. | 1) Reset port on fabric device. 2) Remove/replace cable. 3) Execute BMM remote flea drain. 4) Perform BMM physical flea drain. 5) Execute BMM `replace` action. |
118+
| BMC fails log in. | 1) Update credentials on BMC. 2) Execute BMM `replace` action. |
119+
| Memory, CPU, OEM critical errors on BMC. | 1) Resolve hardware issue with remove/replace. 2) Execute BMM remote flea drain. 3) Perform BMM physical flea drain. 4) Execute BMM `replace` action. |
120+
| Console stuck at boot loader (GRUB) menu. | 1) Execute NVRAM reset. 2) Execute BMM `replace` action. |
119121

120122
### Azure BMM activity log
121123

@@ -149,7 +151,7 @@ racadm --nocertwarn -r $IP -u $BMC_USR -p $BMC_PWD getsysinfo | grep "MAC Addres
149151
racadm --nocertwarn -r $IP -u $BMC_USR -p $BMC_PWD getsysinfo | grep "NIC.Embedded.1-1-1" #Boot MAC
150152
```
151153

152-
If the MAC address supplied to the Cluster is incorrect, use the BMM replace action at [BMM actions](howto-baremetal-functions.md) to correct the addresses.
154+
If the MAC address supplied to the cluster is incorrect, use the BMM `replace` action at [BMM actions](howto-baremetal-functions.md) to correct the addresses.
153155

154156
### Ping test BMC connectivity
155157

@@ -217,9 +219,9 @@ If the activity log indicates invalid credentials on the BMC, run the following
217219
racadm -r $BMC_IP -u $BMC_USER -p $CURRENT_PASSWORD set iDRAC.Users.2.Password $BMC_PWD
218220
```
219221

220-
## Adding servers back into the Cluster after a repair
222+
## Adding servers back into the cluster after a repair
221223

222-
After hardware is fixed, run BMM replace action following instructions from the following page [BMM actions](howto-baremetal-functions.md).
224+
After hardware is fixed, run BMM `replace` action following instructions from the following page [BMM actions](howto-baremetal-functions.md).
223225

224226
If you still have questions, [contact support](https://portal.azure.com/?#blade/Microsoft_Azure_Support/HelpAndSupportBlade).
225227
For more information about support plans, see [Azure Support plans](https://azure.microsoft.com/support/plans/response/).

0 commit comments

Comments
 (0)