[WIP] core, trie: refactor trie API #26995 #1147

gzliudan · 2025-06-24T09:16:33Z

Proposed changes

Ref: ethereum#26995

Types of changes

What types of changes does your code introduce to XDC network?
Put an ✅ in the boxes that apply

Bugfix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation Update (if none of the other choices apply)
Regular KTLO or any of the maintaince work. e.g code style
CICD Improvement

Impacted Components

Which part of the codebase this PR will touch base on,

Put an ✅ in the boxes that apply

Checklist

Put an ✅ in the boxes once you have confirmed below actions (or provide reasons on not doing so) that

This PR has sufficient test coverage (unit/integration test) OR I have provided reason in the PR description for not having test coverage
Provide an end-to-end test plan in the PR description on how to manually test it on the devnet/testnet.
Tested the backwards compatibility.
Tested with XDC nodes running this version co-exist with those running the previous version.
Relevant documentation has been updated as part of this PR
N/A

* eth/downloader: refactor downloader + queue downloader, fetcher: throttle-metrics, fetcher filter improvements, standalone resultcache downloader: more accurate deliverytime calculation, less mem overhead in state requests downloader/queue: increase underlying buffer of results, new throttle mechanism eth/downloader: updates to tests eth/downloader: fix up some review concerns eth/downloader/queue: minor fixes eth/downloader: minor fixes after review call eth/downloader: testcases for queue.go eth/downloader: minor change, don't set progress unless progress... eth/downloader: fix flaw which prevented useless peers from being dropped eth/downloader: try to fix tests eth/downloader: verify non-deliveries against advertised remote head eth/downloader: fix flaw with checking closed-status causing hang eth/downloader: hashing avoidance eth/downloader: review concerns + simplify resultcache and queue eth/downloader: add back some locks, address review concerns downloader/queue: fix remaining lock flaw * eth/downloader: nitpick fixes * eth/downloader: remove the *2*3/4 throttling threshold dance * eth/downloader: print correct throttle threshold in stats Co-authored-by: Péter Szilágyi <[email protected]>

…m#21427

This changes how the downloader works, a little bit. Previously, when block sync started, we immediately started filling up to 8192 blocks. Usually this is fine, blocks are small in the early numbers. The threshold then is lowered as we measure the size of the blocks that are filled. However, if the node is shut down and restarts syncing while we're in a heavy segment, that might be bad. This PR introduces a more conservative initial threshold of 2K blocks instead.

…21504

…um#21987

Fixes a special case when the trie only has a single trie node and the range proof only contains a single element.

…um#22667 * core/state/snapshot: reuse memory data instead of hitting disk when generating * trie: minor nitpicks wrt the resolver optimization * core/state/snapshot, trie: use key/value store for resolver * trie: fix linter Co-authored-by: Péter Szilágyi <[email protected]>

…eum#22760 * trie: add benchmark for proofless range * trie: remove unused returns + use stacktrie

…m#23415 Some tests take quite some time during exit, which I think causes some appveyor fails like this: https://ci.appveyor.com/project/ethereum/go-ethereum/builds/39511210/job/xhom84eg2e4uulq3 One of the things that seem to take time during exit is waiting (up to 100ms) for the syncbloom to close. This PR changes it to use a channel, instead of looping with a 100ms wait. This also includes some unrelated changes improving the reliability of eth/fetcher tests, which fail a lot because they are time-dependent.

…nt to trie ethereum#23567

This functionality is needed in new path-based storage scheme, but can be implemented in a seperate PR though. When an account is deleted, then all the storage slots should be nuked out from the disk as well. In hash-based storage scheme they are still left in the disk but in new scheme, they will be iterated and marked as deleted. But why the NodeBlob API is needed in this scenario? Because when the node is marked deleted, the previous value is also required to be recorded to construct the reverse diff.

…thereum#24392

* trie: fix memory leak in trie iterator In the trie iterator, live nodes are tracked in a stack while iterating. Popped node states should be explictly set to nil in order to get garbage-collected. * trie: fix empty trie iterator

Trie tracer is an auxiliary tool to capture all deleted nodes which can't be captured by trie.Committer. The deleted nodes can be removed from the disk later.

* core, trie, eth, cmd: rework preimage store * trie: address comment

* all: rework trie and trie committer * all: get rid of internal cache in trie * all: fixes * trie: polish * core, trie: address comments * trie: fix imports * core/state: address comments * core/state/snapshot: polish * trie: remove unused code * trie: update tests * trie: don't set db as nil * trie: address comments * trie: unskip test

…written ethereum#25458 * core: use TryGetAccount to read where TryUpdateAccount has been used to write * Gary's review feedback * implement Gary's suggestion * fix bug + rename NewSecure into NewStateTrie * trie: add backwards-compatibility aliases for SecureTrie * Update database.go * make the linter happy Co-authored-by: Felix Lange <[email protected]> Co-authored-by: rjl493456442 <[email protected]>

It's a trivial PR to hide the error log when the trie node is not found in the database. The idea for this change is for all TryXXX functions, the error is already returned and we don't need to fire a log explicitly. Recently there are a few tickets ethereum#25613 ethereum#25589 reporting that the trie nodes are missing because of debug.SetHead. The root cause is after resetting, the chain rewinds to a historical point and re-imports the blocks on top. Since the node is already synced and started to accept transactions previously, these transactions are still kept in the txpool and verified by txpool with a live state. This live state is constructed based on the live trie database, which is changed fast by node referencing and de-referencing. Unfortunately, when we construct a live state(like the state in txpool), we don't reference the state we have. The blockchain will garbage collect the intermediate version nodes in another thread which leads the broken live state. The best solution for this is to forcibly obtain a reference for all live states we create and call release function once it's used up. But it might end up with more junks persisted into disk. Will try to find an elegant solution later in the following PR.

…25694

This PR includes minor updates to comments in trie/committer that reference insertion to the db, and adds an err != nil check for the return value of preimages.commit.

Co-authored-by: Martin Holst Swende <[email protected]>

This PR introduces a node scheme abstraction. The interface is only implemented by `hashScheme` at the moment, but will be extended by `pathScheme` very soon. Apart from that, a few changes are also included which is worth mentioning: - port the changes in the stacktrie, tracking the path prefix of nodes during commit - use ethdb.Database for constructing trie.Database. This is not necessary right now, but it is required for path-based used to open reverse diff freezer

This PR fixes an error in trie commit. If the trie.root is nil, it can be two possible scenarios: - The trie was empty, and no change happens - The trie was non-empty and all nodes are dropped For the latter one, we should collect the deletions and apply them into database(e.g. in PBSS).

* all: cleanup trie interface * eth, trie: address comments

This PR moves some trie-related db accessor methods to a different file, and also removes the schema type. Instead of the schema type, a string is used to distinguish between hashbased/pathbased db accessors. This also moves some code from trie package to rawdb package. This PR is intended to be a no-functionality-change prep PR for ethereum#25963 . --------- Co-authored-by: Gary Rong <[email protected]>

ethereum#26641

This changes the Trie interface to add the plain account address as a parameter to all storage-related methods. After the introduction of the TryAccount* functions, TryGet, TryUpdate and TryDelete are now only meant to read an account's storage. In their current form, they assume that an account storage is stored in a separate trie, and that the hashing of the slot is independent of its account's address. The proposed structure for a stateless storage breaks these two assumptions: the hashing of a slot key requires the address and all slots and accounts are stored in a single trie. This PR therefore adds an address parameter to the interface. It is ignored in the MPT version, so this change has no functional impact, however it will reduce the diff size when merging verkle trees.

This change renames StateTrie methods to remove the Try* prefix. We added the Trie methods with prefix 'Try' a long time ago, working around the problem that most existing methods of Trie did not return the database error. This weird naming convention has persisted until now. Co-authored-by: Gary Rong <[email protected]>

In this PR, all TryXXX(e.g. TryGet) APIs of trie are renamed to XXX(e.g. Get) with an error returned. The original XXX(e.g. Get) APIs are renamed to MustXXX(e.g. MustGet) and does not return any error -- they print a log output. A future PR will change the behaviour to panic on errorrs.

holiman and others added 30 commits June 9, 2025 14:00

eth/downloader: save the correct delivery time for state sync ethereu…

cdc466d

…m#21427

core, eth, trie: prepare trie sync for path based operation ethereum#…

03185ec

…21504

trie: extend range proof ethereum#21250

9ae59fd

all: disable recording preimage of trie keys ethereum#21402

d69a20e

core, trie: speed up some tests with quadratic processing flaw ethere…

081eb10

…um#21987

trie: upgarde for snap protocol ethereum#21482

85318b5

trie: fix range prover ethereum#22210

55f8714

Fixes a special case when the trie only has a single trie node and the range proof only contains a single element.

trie: faster snapshot generation ethereum#22504

26986da

trie: remove redundant returns + use stacktrie where applicable ether…

3f93c3b

…eum#22760 * trie: add benchmark for proofless range * trie: remove unused returns + use stacktrie

trie: simplify range proofs ethereum#22762

a6d6af0

trie: remove the duplicate write for preimage ethereum#23001

ca17b9d

trie: small optimization of delete in fullNode case ethereum#22979

5ba7d94

core, trie: add state metrics ethereum#23433

d61a159

core/state: move state account to core/types + abstracted write accou…

951a98b

…nt to trie ethereum#23567

trie: better error-handling ethereum#23657

ef2a862

trie: reject deletions when verifying range proofs ethereum#23960

128a1a0

trie: remove the sync bloom, used by fast sync ethereum#24047

b94c988

core, trie: use db.has over db.get where possible ethereum#24117

ff42149

trie: readonly interface for trie iterator resolver ethereum#24221

77299f3

trie: fix range prover ethereum#24266

c0d5ac1

trie: test for edgecase in VerifyRangeProof ethereum#24257

8982fc4

core, ethdb, tests, trie: implement NewBatchWithSize API for batcher e…

d95bd09

…thereum#24392

trie: fix two issues in trie iterator ethereum#24539

84080aa

* trie: fix memory leak in trie iterator In the trie iterator, live nodes are tracked in a stack while iterating. Popped node states should be explictly set to nil in order to get garbage-collected. * trie: fix empty trie iterator

core, trie: implement trie tracer ethereum#24403

494fc45

Trie tracer is an auxiliary tool to capture all deleted nodes which can't be captured by trie.Committer. The deleted nodes can be removed from the disk later.

trie: remove unused makeHashNode ethereum#24702

4fd205b

rjl493456442 and others added 30 commits June 11, 2025 17:19

core, eth, trie: rework preimage store ethereum#25287

ebc293b

* core, trie, eth, cmd: rework preimage store * trie: address comment

core, trie: flush preimages to db on database close ethereum#25533

100369b

core/state, trie: add DeleteAccount method ethereum#25531

18f37f9

trie: improve node rlp-decoding ethereum#25357

8dd4f62

trie: fix some typos ethereum#25551 ethereum#25648

a10342c

core/state, trie: fix trie flush order for proper pruning ethereum#25581

99ea20b

trie: better error reporting ethereum#25645

352ba2e

trie: fix unhandled error in test ethereum#25628

34fefc5

trie: check childrens' existence concurrently for snap heal ethereum#…

e84bf3f

…25694

core, trie: remove DiskDB function from trie database ethereum#25690

d1c9323

trie: update comments + err check for preimages ethereum#25672

6272be0

This PR includes minor updates to comments in trie/committer that reference insertion to the db, and adds an err != nil check for the return value of preimages.commit.

trie: handle more batch commit errors in Database ethereum#25674

b602833

cmd, core, eth, trie: track deleted nodes ethereum#22225 ethereum#25757

0bcc063

Co-authored-by: Martin Holst Swende <[email protected]>

trie: fix spelling mistakes

5de2b78

core, trie: clean up trie interface ethereum#26388

efa21ac

* all: cleanup trie interface * eth, trie: address comments

core, trie: port changes from pbss ethereum#26637

ef2e944

core/state, trie: remove unused error-return from trie Commit operation

2e42355

ethereum#26641

trie: remove deprecated uses of math.rand

66e04cb

core/state, trie: port changes from PBSS ethereum#26763

721925d

trie: add error-checks ethereum#26914

4eea75c

trie: reduce unit test time ethereum#26918

567a3ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] core, trie: refactor trie API #26995 #1147

[WIP] core, trie: refactor trie API #26995 #1147

gzliudan commented Jun 24, 2025

Uh oh!

Uh oh!

[WIP] core, trie: refactor trie API #26995 #1147

Are you sure you want to change the base?

[WIP] core, trie: refactor trie API #26995 #1147

Conversation

gzliudan commented Jun 24, 2025

Proposed changes

Types of changes

Impacted Components

Checklist

Uh oh!

Uh oh!