Add badfish support for iDRAC cleanup operations#745
Add badfish support for iDRAC cleanup operations#745openshift-merge-bot[bot] merged 1 commit intoredhat-performance:mainfrom
Conversation
|
Hi @cjeanner. Thanks for your PR. I'm waiting for a github.com member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
I could use this updated code against a cluster of old Dell r630. Without those updates, I faced many issues with the provisioning where nodes weren't booting on the virtual media, or crashed for some iDrac-related issues (error 50x, etc). With this patch, I could deploy twice in a row OCP - a big improvement compared to my previous experience. Since the default value for the new parameter is "false", it's really on-demand and shouldn't impact others without their knowledge. |
4c7d558 to
ee75ab0
Compare
akrzos
left a comment
There was a problem hiding this comment.
I am concerned that some of the changes here have not been tested such as changing the the use of raw to command.
I do want to acknowledge I completely feel your pain in trying to deploy on the older Dell r630 lab hardware so I would like to get this in to help alleviate how difficult it is with that hardware. |
cjeanner
left a comment
There was a problem hiding this comment.
TLDR; need to revert the raw -> command change, and add the missing condition.
ee75ab0 to
e741e89
Compare
akrzos
left a comment
There was a problem hiding this comment.
Ran the PR in a self scheduled environment and it successfully deployed after completing both the job clear and idrac reset.
Additional time:
- job clear - 40s (badfish running)
- idrac reset - 26s (badfish running) + 3m 6s waiting for idrac to become responsive again
My final feedback would be to rename one of the tasks noted and remove the word run from it also consider removing the extra podman install and image pull.
e741e89 to
47b673d
Compare
47b673d to
70f1a89
Compare
|
/test ? |
|
@akrzos: The following commands are available to trigger required jobs: DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
/test deploy-mno |
|
@josecastillolema Could you look at this CI failure? Looks like the CI might not be copying and editing the all.sample.yml file since it appears it is missing a var. |
This commit adds support for using badfish container to perform iDRAC cleanup operations on Dell hardware. Badfish is used to clear the iDRAC job queue and reset the iDRAC service to improve stability during boot operations. It does not replace redfish_command or URI modules, which continue to be used for standard Redfish operations. Changes: - Created new 'badfish' Ansible role with install.yml and call.yml tasks - Added 'reset_idrac' parameter to control badfish-based iDRAC cleanup operations - Integrated badfish container installation into bastion bootstrap process - Updated boot-iso/dell.yml to use badfish for: - Clearing iDRAC job queue (always executed) - Resetting iDRAC service (when reset_idrac is enabled) - Waiting for iDRAC to be available after reset - Replaced fixed pause with wait_for module to verify host power down - All badfish operations use quay.io/quads/badfish container image The badfish role provides a reusable call.yml task file that accepts badfish_host, badfish_user, badfish_password, and badfish_args parameters, making it easy to call badfish commands from other roles. The 'reset_idrac' parameter pulls and uses the badfish container to perform iDRAC cleanup operations, which helps resolve issues with stuck job queues and improves iDRAC stability during virtual media boot operations. AI Model: Claude Sonnet 4.5
70f1a89 to
0d58b39
Compare
|
/test deploy-mno |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: akrzos The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
d028c3e
into
redhat-performance:main
This commit adds support for using badfish container to perform iDRAC cleanup operations on Dell hardware. Badfish is used to clear the iDRAC job queue and reset the iDRAC service to improve stability during boot operations. It does not replace redfish_command or URI modules, which continue to be used for standard Redfish operations.
Changes:
The badfish role provides a reusable call.yml task file that accepts badfish_host, badfish_user, badfish_password, and badfish_args parameters, making it easy to call badfish commands from other roles.
The 'reset_idrac' parameter pulls and uses the badfish container to perform iDRAC cleanup operations, which helps resolve issues with stuck job queues and improves iDRAC stability during virtual media boot operations.
AI Model: Claude Sonnet 4.5