Skip to content

Commit 95c5127

Browse files
committed
additional feedback comments resolved focused on language and clarity issues
1 parent 99108ef commit 95c5127

File tree

1 file changed

+10
-7
lines changed

1 file changed

+10
-7
lines changed

articles/operator-nexus/howto-baremetal-best-practices.md

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ For this reason, it's essential to understand the available options well when tr
3434

3535
- Familiarize yourself with the relevant documentation, including troubleshooting guides and how-to articles.
3636
Always refer to the latest documentation to stay informed about best practices and updates.
37-
- Attempt to identify the root cause of the failure to avoid repeating the same mistake.
37+
- Avoid repeated failed operations by first attempting to identify the root cause of the failure before retrying the operation.
3838
Perform retry attempts in incremental steps to isolate and address specific issues.
3939
- Wait for Az CLI commands to run to completion and validate the state of the BMM resource before executing other steps.
4040
- Verify that the firmware and software versions are up-to-date before a new greenfield deployment to prevent compatibility issues between hardware and software versions.
@@ -68,7 +68,8 @@ Before initiating any `reimage` operation, ensure the following preconditions ar
6868
- Evaluate any BMM warnings or degraded conditions which could indicate the need to resolve hardware, network, or server configuration problems before a `reimage` operation.
6969
For more information, read [Troubleshoot Degraded Status Errors on Bare Metal Machines] and [Troubleshoot Bare Metal Machine Warning Status].
7070
- Validate that there are no running firmware upgrade jobs through the BMC before initiating a `reimage` operation.
71-
The BMM has `provisioningStatus` in the `Preparing` state. Interrupting an ongoing firmware upgrade can leave the BMM in an inconsistent state.
71+
Interrupting an ongoing firmware upgrade can leave the BMM in an inconsistent state.
72+
Confirm the BMM resource's `detailedStatus` isn't in the `Preparing` state.
7273

7374
## Best Practices for a BMM Replace
7475

@@ -78,8 +79,8 @@ The BMM `replace` action is explained in [BMM Lifecycle Management Commands] and
7879

7980
Hardware failures are a normal occurrence over the life of a server.
8081
Component replacements might be necessary to restore functionality and ensure continued operation.
81-
In cases where one or more hardware components fail on the server, it's necessary to perform a BMM `replace` operation.
82-
The `replace` operation should be executed after any hardware maintenance event. Multiple maintenance events should be done as multiple `replace` operations.
82+
The `replace` operation must be executed after any hardware maintenance/repair event.
83+
When one or more hardware components fail on the server (multiple failures), make the necessary repairs for **all** components before executing a BMM `replace` operation.
8384

8485
> [!IMPORTANT]
8586
> With the `2024-07-01` GA API version, the RAID controller is reset during BMM `replace`, wiping all data from the server's virtual disks.
@@ -89,8 +90,9 @@ The `replace` operation should be executed after any hardware maintenance event.
8990

9091
When a BMM is marked with failed hardware validation, it might indicate that physical repairs are needed.
9192
It's crucial to identify and address these repairs before performing a BMM `replace`.
92-
A hardware validation process is invoked, as part of the `replace` operation, to ensure the physical host's integrity before deploying the OS image.
93-
If the BMM continues to have hardware validation failures, then the BMM can't provision successfully meaning it fails to complete the necessary setup steps to become operational and join the cluster.
93+
A hardware validation process is invoked as part of the `replace` operation to ensure the physical host's integrity before deploying the OS image.
94+
The BMM can't provision successfully when the BMM continues to have hardware validation failures.
95+
As a result, the BMM fails to complete the necessary setup steps to become operational and join the cluster.
9496
Ensure **all hardware validation issues** are cleared before the next `replace` action.
9597

9698
To understand hardware validation result, read through the article [Troubleshoot Hardware Validation Failure](./troubleshoot-hardware-validation-failure.md).
@@ -105,7 +107,8 @@ Before initiating any `replace` operation, ensure the following preconditions ar
105107
- Evaluate any BMM warnings or degraded conditions which could indicate the need to resolve hardware, network, or server configuration problems before a `replace` operation.
106108
For more information, see [Troubleshoot Degraded Status Errors on Bare Metal Machines] and [Troubleshoot Bare Metal Machine Warning Status].
107109
- Validate that there are no running firmware upgrade jobs through the BMC before initiating a `replace` operation.
108-
The BMM has `provisioningStatus` in the `Preparing` state. Interrupting an ongoing firmware upgrade can leave the BMM in an inconsistent state.
110+
Interrupting an ongoing firmware upgrade can leave the BMM in an inconsistent state.
111+
Confirm the BMM resource's `detailedStatus` isn't in the `Preparing` state.
109112

110113
### BMM Replace isn't Required
111114

0 commit comments

Comments
 (0)