fix: fail fast when postage block is ahead of chain tip by martinconic · Pull Request #5460 · ethersphere/bee

martinconic · 2026-05-13T06:42:40Z

Checklist

I have read the coding guide.
My change requires a documentation update, and I have done it.
I have added tests to cover my changes.
I have filled out the description and linked the related issues.

Description

Refuses to start when the persisted chainstate.Block is ahead of
the block number reported by blockchain-rpc-endpoint, with a
clear error pointing at the RPC misconfiguration.
Replaces a 10-minute "syncing in progress" stall plus
lightnode-shutdown / fullnode-init-failure with an immediate, actionable
failure.

Why

The reporter on #4941 observed /stamps returning 503 syncing in progress indefinitely, with /chainstate showing block ~1.18M blocks
ahead of chainTip. The two symptoms are one bug: once
chainstate.Block > chainTip, the postage listener loop in
pkg/postage/listener/listener.go evaluates to < from and continues
pkg/postage/listener/listener.go evaluates to < from and continues
syncStatusFn never returns done=true and /stamps stays in 503.
After postageSyncingStallingTimeout (10 min) the loop exits with
ErrPostageSyncingStalled; for lightnodes this triggers
b.syncingStopped.Signal() and the node shuts down, for fullnodes
init fails.

chainstate.Block is only ever advanced by UpdateBlockNumber
from events the listener received from the RPC. So a stored block
ahead of the current chain tip means the configured RPC is now
serving a different chain than it was on a previous run — a
misrouted public endpoint, a changed blockchain-rpc-endpoint, a
load-balancer pointing to the wrong backend. The chain-ID check at
startup (pkg/node/chain.go:109) does not catch this if the wrong
backend happens to report the configured chain ID. This is an RPC /
operational problem, not local DB corruption, and not something Bee
should auto-heal — silent rebuild would mask the misconfiguration and
trigger long resyncs on every restart.

Change

Before batchSvc.Start runs, query chainBackend.BlockNumber(ctx)
once. If the stored chainstate.Block is strictly greater, return an
error naming both block numbers and explaining the likely cause and
the recovery path (verify the RPC, then --resync). If the
BlockNumber call itself fails, log a warning and continue — we don't
want to block startup on a transient RPC hiccup.

No tolerance is applied: the listener always writes
cs.Block <= blockNumber - tailSize, so under normal operation cs.Block is
strictly below the live tip. The only way the check can trip is the
corruption scenario above.

Open API Spec Version Changes (if applicable)

Motivation and Context (Optional)

Related Issue (Optional)

Screenshots (if appropriate):

AI Disclosure

This PR contains code that has been generated by an LLM.
I have reviewed the AI generated code thoroughly.
I possess the technical expertise to responsibly review the code generated in this PR.

fix: fail fast when postage block is ahead of chain tip

a5a2812

martinconic requested review from acud, akrem-chabchoub, gacevicljubisa, janos and sbackend123 May 13, 2026 06:44

martinconic self-assigned this May 13, 2026

martinconic added this to the 2026 milestone May 13, 2026

fix: probe block height at startup before failing on chainstate ahead

f33ea8f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fail fast when postage block is ahead of chain tip#5460

fix: fail fast when postage block is ahead of chain tip#5460
martinconic wants to merge 2 commits into
masterfrom
fix/4941-chainstate-self-heal

martinconic commented May 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

martinconic commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Description

Why

Change

Open API Spec Version Changes (if applicable)

Motivation and Context (Optional)

Related Issue (Optional)

Screenshots (if appropriate):

AI Disclosure

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

martinconic commented May 13, 2026 •

edited

Loading