feat: Add slow sync strike out logic by sergerad · Pull Request #2196 · 0xMiden/node

sergerad · 2026-06-04T02:13:45Z

Relates to #2176.

Strikes out sync subscriptions / connections when they take longer than block_time seconds to read events. Relies on capacity of the watch channels which is 32 atm.

Changelog

changelog = "none"
reason    = "Internal change only."

sergerad · 2026-06-04T02:21:15Z

@Mirko-von-Leipzig this seems like a sensible way of doing it? Might want to tweak the capacity from 32? To lower

Mirko-von-Leipzig

Code itself looks good; but I think there is a problem with using the block interval conceptually.

…-slow-sync

kkovaacs

Looks good to me % the Docker cache ID changes, which we should fix somewhere else?

…-slow-sync

Mirko-von-Leipzig · 2026-06-09T14:48:43Z

-                    proof,
-                    proven_chain_tip: tip,
-                }))
-                .await


I still think we have a problem 😓 I think having this is good. It applies a maximum bound on handling a single block by the client so the loop doesn't get stuck.

But this also means a client can fall behind more and more so long as they are faster than the timeout, but slower than the block interval.

I also think my previous idea was too complex and had quite a few edge cases. I've got a new proposal that I hope is simpler.

We keep the send timeout. This keeps us from locking up the loop.

Then every N blocks that the chain tip advances:

// The current client gap let current_gap = tip - next; if current_gap > previous_gap { bad_windows += 1; } else { bad_windows = bad_windows.saturating_sub(1); } // Give some grace when near the chain tip. previous_gap = current_gap.min(10); if bad_windows > 3 { // drop client; }

Values are placeholders for some better constants of course.

But this also means a client can fall behind more and more so long as they are faster than the timeout, but slower than the block interval.

I don't think this is a problem or at least not one we are solving by disconnecting the client. And I think this logic introduces a problem - we disconnect clients that could have otherwise bursted and caught up to the tip without the need to reconnect.

Disconnecting on the timeout makes sense on the basis that such clients are probably dead / dying and so the connection shouldn't exist anymore. I don't think the connection should cease to exist when clients experience temporary slowdown / lag.

I don't think the connection should cease to exist when clients experience temporary slowdown / lag.

Agreed; but I also don't want clients that will never catch up to the chain tip. And as it stands a single timeout allows for that.

What is the behavior of the client if they are disconnected? Basically, let's say we have a client that is slow and will never catch up. We give it some grace period, but let's say it is falling further and further behind, and we drop the connection. Would the client just try to connect again at that point? Or would we put this client on some kind of list of "bad" clients so that if it tries to connect again, we won't accept the connection in the first place?

It tries to reconnect yeah.

Currently our only restriction here afaik is that we limit the total number of subscriptions available. So this at least gives opportunity for another client to take its place.

Better would be some sort of timeout e.g. client can't resubscribe for N hours or minutes. We could even include that information in the error and we can log it on the client.

Regardless though we need something like this PR first.

My main concern is that clients that aren't near the tip (and therefore load data from disk) are expensive. So if they're never going to be near the tip then I don't want them.

Makes sense. Agreed that we should probably split this up into two parts:

Detect and disconnect clients that are falling behind. I think the logic here could be something like you described here - i.e., if the client is far away from the tip, and the gap is not closing for some time, drop the client.

Put such a client on a list that would prevent immediate re-connection. Otherwise, the effect of the first step is pretty limited.

I have added logic similar to the snippet above in this thread. Main difference is that instead of bad window count, we use the size (in blocks) of the gap to decide whether to disconnect or not. My thinking was that a series of small gaps (single block) could cause a client to disconnect which would be undesirable.

…-slow-sync

sergerad added 2 commits June 4, 2026 14:07

Add slow sync strike out logic

b6fefaa

Add const for capacity

a042c4f

sergerad requested a review from Mirko-von-Leipzig June 4, 2026 02:14

sergerad added the no changelog This PR does not require an entry in the `CHANGELOG.md` file label Jun 4, 2026

Lint

f002548

Mirko-von-Leipzig reviewed Jun 4, 2026

View reviewed changes

Comment thread crates/store/src/state/subscription.rs Outdated

Comment thread crates/store/src/state/subscription.rs

sergerad added 3 commits June 4, 2026 17:06

Add SubscriptionSource trait

15ab59c

Use block limit logic

f5b4438

RM whitespace

1734193

sergerad requested review from Mirko-von-Leipzig and kkovaacs June 5, 2026 00:37

Changelog

409dd2d

sergerad removed the no changelog This PR does not require an entry in the `CHANGELOG.md` file label Jun 5, 2026

sergerad added 3 commits June 8, 2026 10:39

RM gap logic

ef7f63c

Merge branch 'next' of github.com:0xMiden/miden-node into sergerad-dc…

d09b0d9

…-slow-sync

Update docker cache line

7385ce5

kkovaacs approved these changes Jun 8, 2026

View reviewed changes

sergerad added 2 commits June 9, 2026 13:27

Merge branch 'next' of github.com:0xMiden/miden-node into sergerad-dc…

b7c9754

…-slow-sync

Update Dockerfile

5cd5a32

Mirko-von-Leipzig reviewed Jun 9, 2026

View reviewed changes

Merge branch 'next' of github.com:0xMiden/miden-node into sergerad-dc…

3cf34a2

…-slow-sync

Mirko-von-Leipzig mentioned this pull request Jun 10, 2026

Slow stream subscribers should receive a timeout #2234

Open

sergerad added 2 commits June 11, 2026 10:40

Merge branch 'next' of github.com:0xMiden/miden-node into sergerad-dc…

ff48e3c

…-slow-sync

Add running total gap logic

ce66cd3

sergerad added the no changelog This PR does not require an entry in the `CHANGELOG.md` file label Jun 11, 2026

sergerad requested a review from Mirko-von-Leipzig June 11, 2026 00:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add slow sync strike out logic#2196

feat: Add slow sync strike out logic#2196
sergerad wants to merge 15 commits into
nextfrom
sergerad-dc-slow-sync

sergerad commented Jun 4, 2026 •

edited

Loading

Uh oh!

sergerad commented Jun 4, 2026

Uh oh!

Mirko-von-Leipzig left a comment

Uh oh!

Uh oh!

Uh oh!

kkovaacs left a comment

Uh oh!

Mirko-von-Leipzig Jun 9, 2026 •

edited

Loading

Uh oh!

sergerad Jun 9, 2026

Uh oh!

Mirko-von-Leipzig Jun 10, 2026

Uh oh!

bobbinth Jun 10, 2026

Uh oh!

Mirko-von-Leipzig Jun 10, 2026 •

edited

Loading

Uh oh!

bobbinth Jun 10, 2026

Uh oh!

Mirko-von-Leipzig Jun 10, 2026

Uh oh!

sergerad Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sergerad commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog

Uh oh!

sergerad commented Jun 4, 2026

Uh oh!

Mirko-von-Leipzig left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kkovaacs left a comment

Choose a reason for hiding this comment

Uh oh!

Mirko-von-Leipzig Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sergerad Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Mirko-von-Leipzig Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

bobbinth Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Mirko-von-Leipzig Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bobbinth Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Mirko-von-Leipzig Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

sergerad Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sergerad commented Jun 4, 2026 •

edited

Loading

Mirko-von-Leipzig Jun 9, 2026 •

edited

Loading

Mirko-von-Leipzig Jun 10, 2026 •

edited

Loading