Skip to content

Commit 540288b

Browse files
Merge pull request #288729 from vnikolin/vnikolin-hwv-314
Hardware Validation Nexus Release 3.14 Updates
2 parents 2a52fd6 + c75bc56 commit 540288b

File tree

1 file changed

+58
-2
lines changed

1 file changed

+58
-2
lines changed

articles/operator-nexus/troubleshoot-hardware-validation-failure.md

Lines changed: 58 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -195,6 +195,34 @@ Expanding `result_detail` for a given category shows detailed results.
195195
}
196196
```
197197

198+
* Firmware versions are logged as informational. The following component firmware versions are typically logged (depending on hardware model):
199+
* BIOS
200+
* iDRAC
201+
* Complex Programmable Logic Device (CPLD)
202+
* RAID Controller
203+
* Backplane
204+
205+
* The HWV framework identifies problematic firmware versions and attempts to auto fix. The following example shows a successful iDRAC firmware fix (versions and task ID are illustrational only).
206+
207+
```yaml
208+
{
209+
"system_info": {
210+
"system_info_result": "Pass",
211+
"result_detail": [
212+
{
213+
"field_name": "Integrated Dell Remote Access Controller - unsupported_firmware_check",
214+
"comparison_result": "Pass",
215+
"expected": "6.00.30.00 - unsupported_firmware",
216+
"fetched": "7.10.30.00"
217+
}
218+
],
219+
"result_log": [
220+
"Firmware autofix task /redfish/v1/TaskService/Tasks/JID_274085357727 completed"
221+
]
222+
},
223+
}
224+
```
225+
198226
### Drive info category
199227

200228
* Disk Checks Failure
@@ -450,6 +478,17 @@ Expanding `result_detail` for a given category shows detailed results.
450478
}
451479
```
452480

481+
* Allow listed critical alarms and warning alarms are logged as informational starting with Nexus release 3.14.
482+
483+
```yaml
484+
{
485+
"field_name": "LCLog_Warning_Alarms - Non-Failing",
486+
"comparison_result": "Info",
487+
"expected": "Warning Alarm",
488+
"fetched": "104473 2024-07-26T16:05:19-05:00 The Embedded NIC 1 Port 1 network link is down."
489+
}
490+
```
491+
453492
* To check LC logs in BMC webui:
454493

455494
`BMC` -> `Maintenance` -> `Lifecycle Log`
@@ -492,6 +531,17 @@ Expanding `result_detail` for a given category shows detailed results.
492531

493532
* To troubleshoot server power-on failure attempt a flea drain. If problem persists engage vendor.
494533

534+
* Virtual Flea Drain
535+
* HWV attempts a virtual flea drain for most failing checks. Flea drain attempts are logged under `health_info` -> `result_log`.
536+
537+
```yaml
538+
"result_log": [
539+
"flea drain completed successfully",
540+
]
541+
```
542+
543+
* If the virtual flea drain fails, perform a physical flea drain as a first troubleshooting step.
544+
495545
* RAID cleanup failures
496546
* As part of RAID cleanup, the RAID controller configuration is reset. Dell server health check fails for RAID controller reset failure. A failed RAID cleanup action indicates an underlying hardware issue. The following example shows a failed RAID controller reset.
497547

@@ -614,11 +664,17 @@ Expanding `result_detail` for a given category shows detailed results.
614664
### Device login check
615665

616666
* Device Login Check Considerations
617-
* The `device_login` check fails if the iDRAC isn't accessible or if the hardware validation plugin isn't able to sign-in.
667+
* The `device_login` check fails if the iDRAC isn't reachable or if the hardware validation plugin isn't able to sign-in.
668+
669+
```yaml
670+
{
671+
"device_login": "Fail - Unreachable"
672+
}
673+
```
618674

619675
```yaml
620676
{
621-
"device_login": "Fail"
677+
"device_login": "Fail - Unauthorized"
622678
}
623679
```
624680

0 commit comments

Comments
 (0)