BREAKING: Use RTT to optimize request timeouts #69

afterburn · 2025-09-03T07:48:15Z

This PR introduces adaptive RTT estimator for inflight requests.

Benefits

Fewer false timeouts, timeouts are based on measured round-trip times instead of a fixed value, so peers with higher latency aren’t dropped unnecessarily.
Faster recovery from real packet loss, when the network is healthy, the timeout shrinks, allowing retries sooner if a request truly failed.
Stablility under jitter, conservative learning rates smooth out spikes, so one slow response doesn’t cause large swings in timeout.

How
RTT is updated with exponentially weighted moving averages. Timeout is calculated as estimated_rtt + 4 * deviation, clamped to a minimum (to enforce a safe baseline). This mechanism adapts over time to the actual network conditions.

SeverinAlexB

We need to prove first that this algorithm is useful in the context of a DHT. RTT estimates are great when you communicate with a single entity. In mainline, you communicate with a different node more or less every time you send a UDP packet.

I run your code and printed out the calculated request_timeout when using put_immutable. It fluctuates widely, has a even wave pattern in it (see image).

Please explain to me how this estimate should be a good fit for mainline.

SeverinAlexB · 2025-09-08T11:16:01Z

src/rpc/socket.rs

    pub(crate) fn new(config: &Config) -> Result<Self, std::io::Error> {
-        let request_timeout = config.request_timeout;
        let port = config.port;


The request_timeout is still configurable. Is it used somewhere else? Should it be removed? Should the constant timeout time be kept for backward compatability?

SeverinAlexB · 2025-09-08T11:17:01Z

src/rpc/socket/inflight_requests.rs

+            estimated_rtt: Duration::from_millis(INITIAL_ESTIMATED_RTT_MS),
+            deviation_rtt: Duration::from_millis(DEVIATION_RTT_MS),


Can we extract all RTT methods/variable in it's own struct so we have a clear separation between the inflight request struct and RTT?

This way, it would also be easier to write tests.

SeverinAlexB · 2025-09-08T11:33:08Z

src/rpc/socket/inflight_requests.rs

+    /// Updates RTT estimates using exponentially weighted moving averages (EWMA).
+    /// - Estimated RTT = (1-α) * old_estimate + α * sample
+    /// - Deviation RTT = (1-β) * old_deviation + β * |sample - new_estimate|
+    ///
+    /// Conservative learning rates (α=0.05, β=0.1) make the algorithm less sensitive to
+    /// temporary network fluctuations for stable DHT timeout calculations.


Would likely be great to link a RTT guide so the next developer can easily familiarize themselves with the concept
https://how.dev/answers/how-to-compute-devrtt-estimated-rtt-time-out-interval-in-ccn

SeverinAlexB · 2025-09-08T11:36:51Z

src/rpc/socket/inflight_requests.rs

+/// Conservative learning rate for estimated RTT (lower = more stable, higher = faster adaptation)
+const ALPHA: f64 = 0.05;
+/// Conservative learning rate for RTT deviation (lower = more stable, higher = faster adaptation)
+const BETA: f64 = 0.1;


Instead of calling it ALPHA and BETA which nobody except for devs familiar with the RTT calculation actually understands, why not call it something more reasonable?

What comes to my mind: RTT_LEARNING_RATE_ALPHA, or RTT_LEARNING_RATE_ESTIMATE, or RTT_LEARNING_RATE_DEVIATION.

SeverinAlexB · 2025-09-08T11:39:14Z

src/rpc/socket/inflight_requests.rs

+    }
+
+    fn request_timeout(&self) -> Duration {
+        let timeout = self.estimated_rtt + self.deviation_rtt.mul_f64(4.0);


4.0 = Magic number

feat: use rtt to optimize request timeouts

07f5321

afterburn requested review from SHAcollision and SeverinAlexB September 3, 2025 07:48

SeverinAlexB changed the title ~~Use RTT to optimize request timeouts~~ BREAKING: Use RTT to optimize request timeouts Sep 8, 2025

SeverinAlexB requested changes Sep 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BREAKING: Use RTT to optimize request timeouts #69

BREAKING: Use RTT to optimize request timeouts #69

Uh oh!

afterburn commented Sep 3, 2025

Uh oh!

SeverinAlexB left a comment

Uh oh!

SeverinAlexB Sep 8, 2025 •

edited

Loading

Uh oh!

SeverinAlexB Sep 8, 2025

Uh oh!

SeverinAlexB Sep 8, 2025

Uh oh!

SeverinAlexB Sep 8, 2025

Uh oh!

SeverinAlexB Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		estimated_rtt: Duration::from_millis(INITIAL_ESTIMATED_RTT_MS),
		deviation_rtt: Duration::from_millis(DEVIATION_RTT_MS),

BREAKING: Use RTT to optimize request timeouts #69

Are you sure you want to change the base?

BREAKING: Use RTT to optimize request timeouts #69

Uh oh!

Conversation

afterburn commented Sep 3, 2025

Uh oh!

SeverinAlexB left a comment

Choose a reason for hiding this comment

Uh oh!

SeverinAlexB Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SeverinAlexB Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

SeverinAlexB Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

SeverinAlexB Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

SeverinAlexB Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SeverinAlexB Sep 8, 2025 •

edited

Loading