Skip to content

Broken Hardware: MOC-R4PCC04U35 (H100) #1697

@hakasapl

Description

@hakasapl

Node Details

  • Node Name: MOC-R4PCC04U35
  • Node Type: H100
  • Cluster Membership: NERC OCP Prod
  • Cluster Node Name (wrk-XY): wrk-127
  • Serial #: J70127BK
  • Physical Location (RX-PX-CX UX): R4-PC-C04 U35

Describe the issue the node is experiencing

Node reports lower than 512 CPUs (might already be fixed but can't boot to check)

Locked out of iDRAC - can't access setup to reset it

Node won't post past DXE INIT

Node Status

No action can be taken on this node until both of these boxes are checked

  • Check this box once this node is no longer in a cluster from a user perspective and can be rebooted and wiped as needed.
  • Maintenance mode is enabled on this node

Vendor Ticket Information

  • A ticket has been opened with a vendor concerning this hardware

  • Ticket Vendor (Lenovo, Dell, etc.): Lenovo

  • Ticket Number: 3000720509

  • Primary Contact Email on the Ticket: hsap@bu.edu

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions