Skip to content

Peer stuck trying to enter a safe state. #5257

@karknu

Description

@karknu

While trying to reproduce the poison pig attack with cardano-ignite one of my peers got stuck trying to enter the safe state.

    "targets": {
      "active": 0,
      "activeBigLedgerPeers": 0,
      "established": 0,
      "establishedBigLedgerPeers": 0,
      "knownBigLedgerPeers": 0,
      "knownPeers": 0,
      "roots": 0
    },
    "ledgerStateJudgement": "TooOld",

In the debug data it claims to have 172.16.4.122 as hot. But there is no traffic to that peer when checking TCP dump and ss doesn’t list that socket.
The peer itself was last mentioned about 24h ago.

{"app":[],"at":"2025-12-04T11:35:48.893Z","data":{"connectionId":{"localAddress":{"address":"172.16.4.104","port":"3001"},"remoteAddress":{"address":"172.16.4.122","port":"3001"}},"kind":"PeerMonitoringError","reason":"MiniProtocolExceptions [MiniProtocolException {mpeMiniProtocolNumber = MiniProtocolNum 2, mpeMiniProtocolException = ExceededTimeLimit (ChainSync (Header (HardForkBlock (': * ByronBlock (': * (ShelleyBlock (TPraos StandardCrypto) ShelleyEra) (': * (ShelleyBlock (TPraos StandardCrypto) AllegraEra) (': * (ShelleyBlock (TPraos StandardCrypto) MaryEra) (': * (ShelleyBlock (TPraos StandardCrypto) AlonzoEra) (': * (ShelleyBlock (Praos StandardCrypto) BabbageEra) (': * (ShelleyBlock (Praos StandardCrypto) ConwayEra) ('[] *)))))))))) (Tip HardForkBlock (': * ByronBlock (': * (ShelleyBlock (TPraos StandardCrypto) ShelleyEra) (': * (ShelleyBlock (TPraos StandardCrypto) AllegraEra) (': * (ShelleyBlock (TPraos StandardCrypto) MaryEra) (': * (ShelleyBlock (TPraos StandardCrypto) AlonzoEra) (': * (ShelleyBlock (Praos StandardCrypto) BabbageEra) (': * (ShelleyBlock (Praos StandardCrypto) ConwayEra) ('[] *)))))))))) ServerHasAgency (SingNext SingMustReply)}]"},"env":"10.5.3:00000","host":"p3r3","loc":null,"msg":"","ns":["cardano.node.PeerSelectionActions"],"pid":"128","sev":"Error","thread":"5153"}
{"app":[],"at":"2025-12-04T11:35:48.893Z","data":{"kind":"PeerStatusChangeFailure","peerStatusChangeType":"HotToCooling (ConnectionId {localAddress = 172.16.4.104:3001, remoteAddress = 172.16.4.122:3001})","reason":"ApplicationFailure [MiniProtocolException {mpeMiniProtocolNumber = MiniProtocolNum 2, mpeMiniProtocolException = ExceededTimeLimit (ChainSync (Header (HardForkBlock (': * ByronBlock (': * (ShelleyBlock (TPraos StandardCrypto) ShelleyEra) (': * (ShelleyBlock (TPraos StandardCrypto) AllegraEra) (': * (ShelleyBlock (TPraos StandardCrypto) MaryEra) (': * (ShelleyBlock (TPraos StandardCrypto) AlonzoEra) (': * (ShelleyBlock (Praos StandardCrypto) BabbageEra) (': * (ShelleyBlock (Praos StandardCrypto) ConwayEra) ('[] *)))))))))) (Tip HardForkBlock (': * ByronBlock (': * (ShelleyBlock (TPraos StandardCrypto) ShelleyEra) (': * (ShelleyBlock (TPraos StandardCrypto) AllegraEra) (': * (ShelleyBlock (TPraos StandardCrypto) MaryEra) (': * (ShelleyBlock (TPraos StandardCrypto) AlonzoEra) (': * (ShelleyBlock (Praos StandardCrypto) BabbageEra) (': * (ShelleyBlock (Praos StandardCrypto) ConwayEra) ('[] *)))))))))) ServerHasAgency (SingNext SingMustReply)}]"},"env":"10.5.3:00000","host":"p3r3","loc":null,"msg":"","ns":["cardano.node.PeerSelectionActions"],"pid":"128","sev":"Error","thread":"6513"}
{"app":[],"at":"2025-12-04T11:45:48.895Z","data":{"actualActive":1,"kind":"DemoteHotFailed","peer":{"address":"172.16.4.122","port":"3001"},"reason":"DemotionTimeoutException (Just (MiniProtocolExceptions [MiniProtocolException {mpeMiniProtocolNumber = MiniProtocolNum 2, mpeMiniProtocolException = ExceededTimeLimit (ChainSync (Header (HardForkBlock (': * ByronBlock (': * (ShelleyBlock (TPraos StandardCrypto) ShelleyEra) (': * (ShelleyBlock (TPraos StandardCrypto) AllegraEra) (': * (ShelleyBlock (TPraos StandardCrypto) MaryEra) (': * (ShelleyBlock (TPraos StandardCrypto) AlonzoEra) (': * (ShelleyBlock (Praos StandardCrypto) BabbageEra) (': * (ShelleyBlock (Praos StandardCrypto) ConwayEra) ('[] *)))))))))) (Tip HardForkBlock (': * ByronBlock (': * (ShelleyBlock (TPraos StandardCrypto) ShelleyEra) (': * (ShelleyBlock (TPraos StandardCrypto) AllegraEra) (': * (ShelleyBlock (TPraos StandardCrypto) MaryEra) (': * (ShelleyBlock (TPraos StandardCrypto) AlonzoEra) (': * (ShelleyBlock (Praos StandardCrypto) BabbageEra) (': * (ShelleyBlock (Praos StandardCrypto) ConwayEra) ('[] *)))))))))) ServerHasAgency (SingNext SingMustReply)}]))","targetActive":0},"env":"10.5.3:00000","host":"p3r3","loc":null,"msg":"","ns":["cardano.node.PeerSelection"],"pid":"128","sev":"Info","thread":"59"}
{"app":[],"at":"2025-12-04T11:45:48.895Z","data":{"kind":"DemoteAsynchronous","state":[[{"address":"172.16.4.122","port":"3001"},["PeerCooling",null]]]},"env":"10.5.3:00000","host":"p3r3","loc":null,"msg":"","ns":["cardano.node.PeerSelection"],"pid":"128","sev":"Info","thread":"59"}

I suspect that the node has failed to enter a safe state after 20minutes without blocks. The 15minute timeout for entering safe mode also didn't trigger.

stuck.log

Metadata

Metadata

Assignees

No one assigned

    Labels

    bootstrapIssues / PRs related to bootstrap peersbugSomething isn't workingcardano-igniteAn issue or bug discovered using cardano-ignite.high-priorityhigh priority issues / PRs

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions