Disconnect invalid and inactive peers by jmozah · Pull Request #431 · Fantom-foundation/go-opera

jmozah · 2023-02-27T01:21:49Z

This PR adds checks to identify and ban peers that pass the P2P handshake and are accepted into the application protocol but has other application-level issues.

Some clients have valid caps (opera/63, fsnap/1) but invalid client names such as Efireal, go-corex, Geth etc.
Progress message is checked if the Epoch increases for a nominal duration.
Application message should be received within a threshold.
Recurring Application error now results in banning the peer.

These checks have shown that peers that are valid and working honestly get priority.

Depends on Fantom-foundation/go-ethereum#44

holisticode · 2023-04-14T17:18:02Z

gossip/handler.go

-		useless = true
+
+	// Some clients have compatible caps and thus pass discovery checks and seep in to
+	// protocol handler. We should band these clients immediately.


nit: little typo

holisticode · 2023-04-14T17:19:30Z

gossip/handler.go

 	txChanSize = 4096
+
+	// percentage of useless peer nodes to allow
+	uselessPeerPercentage = 20 // 20%


Nit: Why don't we use just a factor, e.g. 0.2, instead of then having to calculate each time the percentage?

holisticode · 2023-04-14T17:20:33Z

gossip/handler.go

+
+	// A useless peer is the one which does not support protocols opera/63 & fsnap/1.
+	useless := !eligibleForSnap(p.Peer)
+	if !p.Peer.Info().Network.Trusted && useless && h.peers.UselessNum() >= (h.maxPeers*(uselessPeerPercentage/100)) {


Question: I am not yet familiar with this useless stuff, but why do we even allow a percentage of useless peers at all? Why don't we just disconnect them all?

Well, the peer is useless in the context of sync, i.e. it doesn't support fsnap/1 and opera/63.
But old peers supporting opera/62 should still be allowed to participate.

Ah so I assume useless then already checked that the peer is a opera/62 peer. It's not just any peer. That would make sense.

holisticode · 2023-04-14T17:50:51Z

gossip/handler.go

-			return err
+		// progress and application
+		progressWatchDogTimer := time.NewTimer(noProgressTime)
+		applicationWatchDogTimer := time.NewTimer(noAppMessageTime)


Aren't we recreating the timer on each for iteration here? Therefore the Resets later are useless? It looks to me that either we have to create the timers outside of the for loop, and then Reset them as you do now, or recreating them in each loop iteration and just break when we Reset, although this then results in a lot of garbage collected timers? Or am I missing something?

Oops... the timer should be outside the loop.

holisticode · 2023-04-14T17:52:30Z

gossip/handler.go

+			err := h.handleMsg(p)
+			if err != nil {
+				p.Log().Debug("Message handling failed", "err", err)
+				if strings.Contains(err.Error(), errorToString[ErrPeerNotProgressing]) {


Can we use errors.Is here instead of comparing strings?

We can use errors.Is() only to compare errors. But in this place, the error is defined as a string.
If we want to change it, we should define all the errors as errors.New().

Yes agreed. If there are more such string based errors instead of errors.New() based ones (which I believe would be better) - then this should go into a separate PR to address. So up to you if you want to do anything in this PR.

holisticode · 2023-04-14T17:59:13Z

gossip/handler.go

 		p.SetProgress(progress)
+		// If peer has not progressed for noProgressTime minutes, then disconnect the peer.
+		if !p.IsPeerProgressing() {
+			return errResp(ErrPeerNotProgressing, "%v: %v %v", "epoch is not progressing for ", noProgressTime, "minutes")


Nit: As noProgressTime is a duration, this would print "epoch is not progressing for 3m0s minutes", I think

holisticode · 2023-04-14T18:00:06Z

gossip/handler.go

 		return errResp(ErrInvalidMsgCode, "%v", msg.Code)
 	}
+
+	if msg.Code != ProgressMsg {


I am not yet familiar with all message codes, but is ProgressMsg the only message which signals that there is progress?

holisticode · 2023-04-14T19:39:47Z

gossip/peer.go

+
+func (p *peer) setPeerAsProgressing(x PeerProgress) {
 	p.progress = x
+	p.progressTime = time.Now()


Any specific reason why p.appMessageTime is locked, but p.progressTime isn't?

It's locked in SetProgress() where setPeerAsProgressing() is called.

holisticode · 2023-04-14T19:44:09Z

gossip/peer_test.go

+	newPeer := getPeer()
+	ep1 := PeerProgress{Epoch: 1}
+	newPeer.SetProgress(ep1)
+	time.Sleep(2 * time.Second) //set the threshold to 2 second


All these Sleep acctumulate to 9 seconds - making test runs 9 seconds slower as I understand. Isn't there a different way to test this? Do we actually even need to sleep?

holisticode

I am not sure if I should already be the only person approving, but I want to signal that this looks good to me now (at least).

kick out invalid and inactive peers

0fec79f

jmozah self-assigned this Feb 27, 2023

jmozah added 2 commits March 3, 2023 23:22

Check if peer is progressing by inspecting progress message

acea296

Use timestamp instead of counter for tracking progress

87d1a81

jmozah marked this pull request as ready for review March 16, 2023 11:15

jmozah requested a review from andrecronje as a code owner March 16, 2023 11:15

jmozah requested a review from uprendis March 16, 2023 11:16

jmozah added 2 commits March 25, 2023 11:17

Add few review fixes

11f8b3c

remove unnessary timer definition

433f956

jmozah mentioned this pull request Mar 29, 2023

Added dynamic and static ban #441

Open

holisticode reviewed Apr 14, 2023

View reviewed changes

Address Fabio's comments

db6b7d7

holisticode approved these changes Apr 18, 2023

View reviewed changes

Conversation

jmozah commented Feb 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

holisticode left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jmozah commented Feb 27, 2023 •

edited

Loading