-
Notifications
You must be signed in to change notification settings - Fork 72
Open
Labels
Sled AgentRelated to the Per-Sled Configuration and ManagementRelated to the Per-Sled Configuration and Managementfault-managementEverything related to the fault-management initiative (RFD480 and others)Everything related to the fault-management initiative (RFD480 and others)nexusRelated to nexusRelated to nexus
Description
We'll have to figure this out eventually.
Some questions on my mind:
- Do we do this by just having
sled-agentshell out tofmdump -ejor whatever makes it list ereports and format the NVLists as JSON? Or do we want to have proper Rust bindings for the FMA C libraries? - I suspect that the host is a lot better at spewing out a giant pile of identical ereports than Hubris, so we may actually want to worry about deduplication/debouncing a bit more here...
- Can we reuse the sled UUID as the restart ID? That would make stuff easier. Figure out whether this is practical.
- Note that unlike in Hubris, all this stuff is getting written to disk, so we don't inherently lose data on a sled-agent restart; the restart ID may not need to be tied to the lifetime of a single sled-agent process. Figure out where the restart boundary actually is and where data loss occurs on the host side.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
Sled AgentRelated to the Per-Sled Configuration and ManagementRelated to the Per-Sled Configuration and Managementfault-managementEverything related to the fault-management initiative (RFD480 and others)Everything related to the fault-management initiative (RFD480 and others)nexusRelated to nexusRelated to nexus
Type
Fields
Give feedbackNo fields configured for Enhancement.