nimbus-eth1

Commit Graph

Author	SHA1	Message	Date
Jacek Sieka	08e98eb385	restore a few tests, cleanup (#2234 ) * remove `compensateLegacySetup`, `localDbOnly` * enable trivially fixable tests	2024-05-28 14:49:35 +02:00
Jordan Hrycaj	7d9e1d8607	Misc updates for full sync (#2140 ) * Code cosmetics * Aristo+Kvt: Fix api wrappers why: Api setup killed the backend descriptor when backend mapping was disabled. * Aristo: Implement masked profiling entries why: Database backend should be listed but not counted in tally * CoreDb: Simplify backend() methods why: DBMS backend access Was provided very early and over engineered. Now there are only two backend machines, one for `Kvt` and the other one for an `Mpt` available only via new API. * CoreDb: Code cleanup regarding descriptor types * CoreDb: Refactor/redefine `persistent()` methods why: There were `persistent()` methods for any type of caching storage facilities `Kvt`, `Mpt`, `Phk`, and `Acc`. Now there is only a single `persistent()` method storing all facilities in tandem (similar to how transactions work.) For non shared `Kvt` tables, there is now an extra storage method `saveOffSite()`. * CoreDb lingo update: `trie` becomes `column` why: Notion of a `trie` is pretty much hidden by the new `CoreDb` api. Revealed are sort of database columns for accounts an storage data, any of which have an internal state represented by a Keccack hash. So a `trie` or `MPT` becomes a `column` and a `rootHash` becomes a column state. * Aristo: rename backend filed `filters` => `journal` * Update full sync logging details: + Disable eth handler noise while syncing + Log journal depth (if available) * Fix copyright year * Fix cruft and unwanted imports	2024-04-19 18:37:27 +00:00
andri lim	c41206be39	Fix styles and reduce compiler warnings (#2086 ) * Fix styles and reduce compiler warnings * Fix copyright year	2024-03-20 14:35:38 +07:00
web3-developer	11691c33e9	Update Nimbus codebase to use the new nim-rocksdb API. (#2054 ) * Bump nim-rocksdb. * Update codebase to use latest nim-rocksdb API. * Update copyright notices. * Fix memory leak due to allocCStringArray without deallocCStringArray. * Improve kvstore_rocksdb code. * Refactor and cleanup RocksStoreRef. * Update nim-rocksdb submodule to latest.	2024-03-05 12:54:42 +08:00
andri lim	6ff2edc416	Fix styles (#2046 ) * Fix styles * Fix copyright year	2024-02-21 23:04:59 +07:00
andri lim	bea558740f	Reduce compiler warnings (#2030 ) * Reduce compiler warnings * Reduce compiler warnings in test code	2024-02-16 16:08:07 +07:00
Jordan Hrycaj	c47f021596	Core db and aristo updates for destructor and tx logic (#1894 ) * Disable `TransactionID` related functions from `state_db.nim` why: Functions `getCommittedStorage()` and `updateOriginalRoot()` from the `state_db` module are nowhere used. The emulation of a legacy `TransactionID` type functionality is administratively expensive to provide by `Aristo` (the legacy DB version is only partially implemented, anyway). As there is no other place where `TransactionID`s are used, they will not be provided by the `Aristo` variant of the `CoreDb`. For the legacy DB API, nothing will change. * Fix copyright headers in source code * Get rid of compiler warning * Update Aristo code, remove unused `merge()` variant, export `hashify()` why: Adapt to upcoming `CoreDb` wrapper * Remove synced tx feature from `Aristo` why: + This feature allowed to synchronise transaction methods like begin, commit, and rollback for a group of descriptors. + The feature is over engineered and not needed for `CoreDb`, neither is it complete (some convergence features missing.) * Add debugging helpers to `Kvt` also: Update database iterator, add count variable yield argument similar to `Aristo`. * Provide optional destructors for `CoreDb` API why; For the upcoming Aristo wrapper, this allows to control when certain smart destruction and update can take place. The auto destructor works fine in general when the storage/cache strategy is known and acceptable when creating descriptors. * Add update option for `CoreDb` API function `hash()` why; The hash function is typically used to get the state root of the MPT. Due to lazy hashing, this might be not available on the `Aristo` DB. So the `update` function asks for re-hashing the gurrent state changes if needed. * Update API tracking log mode: `info` => `debug * Use shared `Kvt` descriptor in new Ledger API why: No need to create a new descriptor all the time	2023-11-16 19:35:03 +00:00
Jordan Hrycaj	786263c0b8	Core db update api and fix tracer methods (#1816 ) * CoreDB: Re-org API details: Legacy API internally uses vertex ID for root node abstraction * Cosmetics: Move some unit test helpers to common sub-directory * Extract constant from `accouns_cache.nim` => `constants.nim` * Fix tracer methods why: Logger dump data were wrongly dumped from the production database. This caused an assert exception when iterating over the persistent database (instead of the memory logger.) This event in turn was enabled after fixing another inconsistency which just set up an empty iterator. Unit tests failed to detect that.	2023-10-11 20:09:11 +01:00
Jordan Hrycaj	6d132811ba	Core db update providing additional results code interface (#1776 ) * Split `core_db/base.nim` into several sources * Rename `core_db/legacy.nim` => `core_db/legacy_db.nim` * Update `CoreDb` API, dual methods returning `Result[]` or plain value detail: Plain value methods implemet the legacy API, they defect on error results * Redesign `CoreDB` direct backend access why: Made the `backend` directive integral part of the API * Discontinue providing unused or otherwise available functions details: + setTransactionID() removed, not used and not easily replicable in Aristo + maybeGet() removed, available via direct backend access + newPhk() removed, never used & was experimental anyway * Update/reorg backend API why: + Added error print function `$$()` + General descriptor completion (and optional validation) via `bless()` * Update `Aristo`/`Kvt` exception handling why: Avoid `CatchableError` exceptions, rather pass them as error code where appropriate. * More `CoreDB` compliant `Aristo` and `Kvt` methods details: + Providing functions like `contains()`, `getVtxRc()` (returns `Result[]`). + Additional error code: `NotImplemented` * Rewrite/reorg of Aristo DB constructor why: Previously used global object `DefaultQidLayoutRef` as default initialiser. This object was created at compile time which lead to non-gc safe functions. * Update nimbus/db/core_db/legacy_db.nim Co-authored-by: Kim De Mey <kim.demey@gmail.com> * Update nimbus/db/aristo/aristo_transcode.nim Co-authored-by: Kim De Mey <kim.demey@gmail.com> * Update nimbus/db/core_db/legacy_db.nim Co-authored-by: Kim De Mey <kim.demey@gmail.com> --------- Co-authored-by: Kim De Mey <kim.demey@gmail.com>	2023-09-26 10:21:13 +01:00
Jordan Hrycaj	6bc55d4e6f	Core db aristo and kvt updates preparing for integration (#1760 ) * Kvt: Implemented multi-descriptor access on the same backend why: This behaviour mirrors the one of Aristo and can be used for simultaneous transactions on Aristo + Kvt * Kvt: Update database iterators why: Forgot to run on the top layer first * Kvt: Misc fixes * Aristo, use `openArray[byte]` rather than `Blob` in prototype * Aristo, by default hashify right after cloning descriptor why: Typically, a completed descriptor is expected after cloning. Hashing can be suppressed by argument flag. * Aristo provides `replicate()` iterator, similar to legacy `replicate()` * Aristo API fixes and updates * CoreDB: Rename `legacy_persistent` => `legacy_rocksdb` why: More systematic, will be in line with Aristo DB which might have more than one persistent backends * CoreDB: Prettify API sources why: Better to read and maintain details: Annotating with custom pragmas which cleans up the prototypes * CoreDB: Update MPT/put() prototype allowing `CatchableError` why: Will be needed for Aristo API (legacy is OK with `RlpError`)	2023-09-18 21:20:28 +01:00
Jordan Hrycaj	221e6c9e2f	Unified database frontend integration (#1670 ) * Nimbus folder environment update details: * Integrated `CoreDbRef` for the sources in the `nimbus` sub-folder. * The `nimbus` program does not compile yet as it needs the updates in the parallel `stateless` sub-folder. * Stateless environment update details: * Integrated `CoreDbRef` for the sources in the `stateless` sub-folder. * The `nimbus` program compiles now. * Premix environment update details: * Integrated `CoreDbRef` for the sources in the `premix` sub-folder. * Fluffy environment update details: * Integrated `CoreDbRef` for the sources in the `fluffy` sub-folder. * Tools environment update details: * Integrated `CoreDbRef` for the sources in the `tools` sub-folder. * Nodocker environment update details: * Integrated `CoreDbRef` for the sources in the `hive_integration/nodocker` sub-folder. * Tests environment update details: * Integrated `CoreDbRef` for the sources in the `tests` sub-folder. * The unit tests compile and run cleanly now. * Generalise `CoreDbRef` to any `select_backend` supported database why: Generalisation was just missed due to overcoming some compiler oddity which was tied to rocksdb for testing. * Suppress compiler warning for `newChainDB()` why: Warning was added to this function which must be wrapped so that any `CatchableError` is re-raised as `Defect`. * Split off persistent `CoreDbRef` constructor into separate file why: This allows to compile a memory only database version without linking the backend library. * Use memory `CoreDbRef` database by default detail: Persistent DB constructor needs to import `db/core_db/persistent why: Most tests use memory DB anyway. This avoids linking `-lrocksdb` or any other backend by default. * fix `toLegacyBackend()` availability check why: got garbled after memory/persistent split. * Clarify raw access to MPT for snap sync handler why: Logically, `kvt` is not the raw access for the hexary trie (although this holds for the legacy database)	2023-08-04 12:10:09 +01:00
Jordan Hrycaj	4c865ec884	Snap sync update pivot updating via rpc (#1583 ) * Unit tests update, code cosmetics * Fix segfault with zombie handling why: In order to save memory, the data records of zombie entries are removed and only the key (aka peer node) is kept. Consequently, logging these zombies can only be done by the key. * Allow to accept V2 payload without `shanghaiTime` set while syncing why: Currently, `shanghaiTime` is missing (alt least) while snap syncing. So beacon node headers can be processed regardless. Normal (aka strict) processing will be automatically restored when leaving snap sync mode.	2023-05-16 14:52:44 +01:00
Jordan Hrycaj	f40a066cc6	Update snap sync ready to succeed at lab test (#1556 ) * Extract RocksDB timing tests from snap unit tests as separate module why: Declutter, make space for more snap related unit tests. * Renamed `undumpNextGroup()` => `undumpBlocks()` why: Source file name is called `undump_blocks.nim` which should be sort of in sync with the method name(s). * Implement snap/1 server method `getByteCodes()` * Implement snap/1 client method `getByteCodes()` * Implement faculty for handling contract code fetching via snap/1 * Provide persistent storage for contract code records * Implement contract code snap sync fetch & store * Code massage, cosmetics * Unit tests for verifying snap sync snapshot dump details: Use `undump_kvp.dumpAllDb()` to dump any database.	2023-04-21 22:11:04 +01:00
Jordan Hrycaj	0a3bc102eb	Pre functional snap to full sync (#1546 ) * Update sync scheduler pool mode why: The pool mode allows to loop over active peers one after another. This is ideal for soft re-starting peers. As this is a two tier experience (start/stop, setup/release) the loop must be run twice. This is controlled by a more rigid re-definition of how to use the `poolMode` flag. * Mitigate RLP serialiser deficiency why: Currently, serialising the `BlockBody` in not conevrtible and need to be checked in the `eth` module. Currently a local fix for the wire protocol applies. Unit tests will stay (after this local solution will have been removed.) * Code cosmetics and massage details: Main part is `types.toStr()` as a unified function for logging block numbers. * Allow to use a logical genesis replacement (start of history) why: Snap sync will set up an arbitrary pivot at a block number different from zero. In fact, the higher the block number the better. details: A non-genesis start of history will currently only affect the score values which were derived from the difficulty. * Provide function to store the snap pivot block header in chain db why: Together with the start of history facility, this allows to proceed with full syncing once snap has finished. details: Snap db storage was switched from a sub-tables to the flat chain db. * Provide database completeness and sanity checker details: For debugging on smaller databases, only * Implement snap -> full sync switch	2023-04-14 23:28:57 +01:00
Jordan Hrycaj	33023aaf39	Update snap server client test scenario (#1518 ) * Redesign snap1 message GetTrieNodes argument prototypes why: A list of sub-objects `seq[SnapTriePath]` is more intuitive to work with than an opaque definition `seq[seq[Blob]]` because the inner object `SnapTriePath` object has a dedicated inner structure (for how to interprete `seq[Blob]`.) * Collect some public constants into `constants.nim` file * Reorg `hexary_paths.nim` why: + Collecting nodes following a partial path properly ending at an extension node failed to collect this last node. + Merged the nodes collecting algorithm for persistent and in-memory into a single generic function `hexary_paths.rootPathExtend()` info: Extracted common tasks to `hexary_nodes_helper.nim` * Implement `StorageRanges` message handler for snap/1 protocol	2023-03-22 20:11:49 +00:00
Jordan Hrycaj	15d0ccb39c	Prepare snap server client test scenario cont4 (#1507 ) * Add state root to node steps path register `RPath` or `XPath` why: Typically, the first node in the path register is the state root. There are occasions, when the path register is empty (i.e. there are no node references) which typically applies to a zero node key. In order to find the next node key greater than zero, the state root is is needed which is now part of the `RPath` or `XPath` data types. * Extracted hexary tree debugging functions into separate files * Update empty path fringe case for left/right node neighbour why: When starting at zero, the node steps path register would be empty. So will any path that is before the fist non-zero link of a state root (if it is a `Branch` node.) The `hexaryNearbyRight()` or `hexaryNearbyLeft()` function required a non-zero node steps path register. Now the first node is to be advanced starting at the first state root link if necessary. * Simplify/reorg neighbour node finder why: There was too mach code repetition for the cases * persistent or in-memory database * left or right move details: Most algorithms apply for persistent and in-memory alike. Using templates/generic functions most of these algorithms can be stated in a unified way * Update storage slots snap/1 handler details: Minor changes to be more debugging friendly. * Fix detection of full database for snap sync * Docu: Snap sync test & debugging scenario	2023-03-17 14:46:50 +00:00
Jordan Hrycaj	fe3a6d67c6	Prepare snap server client test scenario cont2 (#1487 ) * Clean up some function prototypes why: Simplify polymorphic prototype variances for easier maintenance. * Fix fringe condition crash when importing bogus RLP node why: Accessing non-list RLP entry as a list causes `Defect` * Fix left boundary proof at range extractor why: Was insufficient. The main problem was that there was no unit test for the validity of the generated left boundary. * Handle incomplete left boundary proofs early why: Attempt to do it later leads to overly complex code in order to prevent looping when the same peer repeats to send the same incomplete proof. Contrary, gaps in the leaf sequence can be handled gracefully with registering the gaps * Implement a manual pivot setup mechanism for snap sync why: For a test scenario it is convenient to set the pivot to something lower than the beacon header from the consensus layer. This does not need rely on any RPC mechanism. details: The file containing the pivot specs is specified by the `--sync-ctrl-file` option. It is regularly parsed for updates. * Fix calculation error why: Prevent from calculating negative square root	2023-03-07 14:23:22 +00:00
Jordan Hrycaj	bf53226c2c	Minor updates for testing and cosmetics (#1476 ) * Fix locked database file annoyance with unit tests on Windows why: Need to clean up old files first from previous session as files remain locked despite closing of database. * Fix initialisation order detail: Apparently this has no real effect as the ticker is only initialised here but started later. This possible bug has been in all for a while and was running with the previous compiler and libraries. * Better naming of data fields for sync descriptors details: * BuddyRef[S,W]: buddy.data -> buddy.only * CtxRef[S]: ctx.data -> ctx.pool	2023-02-23 13:13:02 +00:00
Jordan Hrycaj	b793f0de8d	Snap sync extractor and sub range proofs cont1 (#1468 ) * Redefine `seq[Blob]` => `seq[SnapProof]` for `snap/1` protocol why: Proof nodes are traded as `Blob` type items rather than Nim objects. So the RLP transcoder must not extra wrap proofs which are of type seq[Blob]. Without custom encoding one would produce a `list(blob(item1), blob(item2) ..)` instead of `list(item1, item2 ..)`. * Limit leaf extractor by RLP size rather than number of items why: To be used serving `snap/1` requests, the result of function `hexaryRangeLeafsProof()` is limited by the maximal space needed to serialise the result which will be part of the `snap/1` repsonse. * Let the range extractor `hexaryRangeLeafsProof()` return RLP list sizes why: When collecting accounts, the size oft the accounts list when encoded as RLP is continually updated. So the summed up value is available anyway. For the proof nodes list, there are not many (~ 10) so summing up is not expensive here.	2023-02-15 10:14:40 +00:00
Jordan Hrycaj	df1217b7ca	Silence compiler gossip after nim upgrade cont3 (#1466 ) * Removed some Windows specific unit test annoyances details: + Short put()/get() cycles on persistent database have a race condition with vendor rocksdb. On a specific (and slow) qemu/win7 a 50ms `sleep()` in between will mostly do the job (i.e. unless heavy CPU load.) This issue was not observed on github/ci. + Removed annoyances when qemu/Win7 keeps the rocksdb database files locked even after closing the db. The problem is solved by strictly using fresh names for each test. No assumption made to be able to properly clean up. This issue was not observed on github/ci. * Silence some compiler gossip -- part 7, misc/non(sync or graphql) details: Adding some missing exception annotation	2023-02-14 20:27:17 +00:00
Jordan Hrycaj	c2fc46a99a	Snap sync extractor test sub range proofs (#1460 ) * Unit tests to verify calculations based on hard coded constants why: Sizes of RLP encoded objects are available at run time only. * Changed argument order for `hexaryRangeLeafsProof()` prototype why: Better to read as a stand-alone function (arguments were optimised for functional pipelines) * Run sub-range proof tests for extracted ranges	2023-02-02 13:27:09 +00:00
Jordan Hrycaj	6ca6bcd96f	Snap sync fix trie interpolation fringe condition (#1457 ) * Cosmetics details: + Update doc generator + Fix key type representation in `hexary_desc` for debugging + Redefine `isImportOk()` as template for better `check()` line reporting * Fix fringe condition when interpolating Merkle-Patricia tries details: Small change with profound effect fixing some pathological condition that haunted the unit test set on large data sers. There is still one condition left which might well be due to an incomplete data set. * Unit test proof nodes for node range extractor * Unit tests to run on full extraction set why: Left over from troubleshooting, range length was only 5	2023-02-01 18:56:06 +00:00
Jordan Hrycaj	197d2b16dd	Snap sync interval range extractor (#1449 ) * Update comments and test noise * Fix boundary proofs why: Where neither used in production, nor unit tested. For production, other methods apply to test leaf range integrity directly based of the proof nodes. * Added `hexary_range()`: interval range + proof extractor details: + Will be used for `snap/1` protocol handler + Unit tests added (also for testing left boundary proof) todo: Need to verify completeness of proof nodes * Reduce some nim 1.6 compiler noise * Stop unit test gossip for ci tests	2023-01-30 17:50:58 +00:00
Jordan Hrycaj	e093fa452d	Declutter snap sync unit tests (#1444 ) * Extracted RocksDB timing unit tests into separate file why: make space for more in main module :) * Extracted `inspectionRunner()` unit tests into separate file why: make space for more in main module :) * Extracted `storagesRunner()` unit tests into separate file why: make space for more in main module :) * Extracted pivot checkpoint store/retrieval unit tests into separate file why: make space for more in main module :) * Extract helper functions into separate source file * Extracted account import unit tests into separate file why: make space for more in main module :) * Rename `test_decompose()` => `test_NodeRangeDecompose()` why: There will be more functions with `test_NodeRange` prefix.	2023-01-23 16:09:12 +00:00
Jordan Hrycaj	3766eddf5a	Some updates to the envelope module (#1353 ) details: + Add detailed error return codes + Remove cruft + Some prototype wrappers	2022-12-06 20:13:31 +00:00
Jordan Hrycaj	85de03fd6e	Rename and update dismantle => hexaryEnvelopeDecompose() (#1351 ) * Rename and update dismantle => hexaryEnvelopeDecompose() why: + As for naming, a positive connotation is prefered + The unit tests were really insufficient + The function result was wrong on a few boundry conditions detail: + Extracted the function from `hexary_paths.nim` and re-implemented it together with other envelope functions => `hexary_envelope.nim` + Re-wrote docu for `hexaryEnvelopeDecompose()` * Relaxed right condition for `hexaryEnvelopeDecompose()` range argument why; Previously, the right point of the argument interval had to be a path to an allocated leaf node. While this is typically a given for accounts, it is easier to require an arbitrary range of paths (or keys) with the requirement of a `boundary proof` for left and right (i.e. enough nodes in the database to find the end points.) also: Bug fixes for related functions (typos, missing conditions etc.) * Add missing unit tests include file	2022-12-06 17:35:56 +00:00
jangko	94a94c5b65	implement better hardfork management	2022-12-02 13:51:42 +07:00
Jordan Hrycaj	44a57496d9	Snap sync interval complement method to speed up trie perusal (#1328 ) * Add quick hexary trie inspector, called `dismantle()` why: + Full hexary trie perusal is slow if running down leaf nodes + For known range of leaf nodes, work out the UInt126-complement of partial sub-trie paths (for existing nodes). The result should cover no (or only a few) sub-tries with leaf nodes. * Extract common healing methods => `sub_tries_helper.nim` details: Also apply quick hexary trie inspection tool `dismantle()` Replace `inspectAccountsTrie()` wrapper by `hexaryInspectTrie()` * Re-arrange task dispatching in main peer worker * Refactor accounts and storage slots downloaders * Rename `HexaryDbError` => `HexaryError`	2022-11-28 09:03:23 +00:00
Jordan Hrycaj	7688148565	Snap sync can start on saved checkpoint (#1327 ) * Stop negotiating pivot if peer repeatedly replies w/usesless answers why: There is some fringe condition where a peer replies with legit but useless empty headers repetely. This goes on until somebody stops. We stop now. * Rename `missingNodes` => `sickSubTries` why: These (probably missing) nodes represent in reality fully or partially missing sub-tries. The top nodes may even exist, e.g. as a shallow sub-trie. also: Keep track of account healing on/of by bool variable `accountsHealing` controlled in `pivot_helper.execSnapSyncAction()` * Add `nimbus` option argument `snapCtx` for starting snap recovery (if any) also: + Trigger the recovery (or similar) process from inside the global peer worker initialisation `worker.setup()` and not by the `snap.start()` function. + Have `runPool()` returned a `bool` code to indicate early stop to scheduler. * Can import partial snap sync checkpoint at start details: + Modified what is stored with the checkpoint in `snapdb_pivot.nim` + Will be loaded within `runDaemon()` if activated * Forgot to import total coverage range why: Only the top (or latest) pivot needs coverage but the total coverage is the list of all ranges for all pivots -- simply forgotten.	2022-11-25 14:56:42 +00:00
Jordan Hrycaj	bba1bea4c8	Snap sync state save (#1302 ) * Piecemeal trie inspection details: Trie inspection will stop after maximum number of nodes visited. The inspection can be resumed using the returned state from the last session. why: This feature allows for task switch between `piecemeal` sessions. * Extract pivot helper code from `worker.nim` => `pivot_helper.nim` * Accounts import will now return dangling paths from `proof` nodes why: With proper bookkeeping, this can be used to start healing without analysing the the probably full trie. * Update `unprocessed` account range handling why: More generally, the API of a pairs of unprocessed intervals favours the first set and not before that is exhausted the second set comes into play. This was unfortunately implemented which caused the ranges to be unnecessarily fractioned. Now the number of range interval typically remains in the lower single digit numbers. * Save sync state after end of downloading some accounts details: restore/resume to be implemented later	2022-11-16 23:51:06 +00:00
Jordan Hrycaj	e14fd4b96c	Prep for full sync after snap make 6 (#1291 ) * Update log ticker, using time interval rather than ticker count why: Counting and logging ticker occurrences is inherently imprecise. So time intervals are used. * Use separate storage tables for snap sync data * Left boundary proof update why: Was not properly implemented, yet. * Capture pivot in peer worker (aka buddy) tasks why: The pivot environment is linked to the `buddy` descriptor. While there is a task switch, the pivot may change. So it is passed on as function argument `env` rather than retrieved from the buddy at the start of a sub-function. * Split queues `fetchStorage` into `fetchStorageFull` and `fetchStoragePart` * Remove obsolete account range returned from `GetAccountRange` message why: Handler returned the wrong right value of the range. This range was for convenience, only. * Prioritise storage slots if the queue becomes large why: Currently, accounts processing is prioritised up until all accounts are downloaded. The new prioritisation has two thresholds for + start processing storage slots with a new worker + stop account processing and switch to storage processing also: Provide api for `SnapTodoRanges` pair of range sets in `worker_desc.nim` * Generalise left boundary proof for accounts or storage slots. why: Detailed explanation how this works is documented with `snapdb_accounts.importAccounts()`. Instead of enforcing a left boundary proof (which is still the default), the importer functions return a list of `holes` (aka node paths) found in the argument ranges of leaf nodes. This in turn is used by the book keeping software for data download. * Forgot to pass on variable in function wrapper also: + Start healing not before 99% accounts covered (previously 95%) + Logging updated/prettified	2022-11-08 18:56:04 +00:00
Jordan Hrycaj	1b4572ed3b	Prep for full sync after snap make 4 (#1282 ) * Re-arrange fetching storage slots in batch module why; Previously, fetching partial slot ranges first has a chance of terminating the worker peer 9due to network error) while there were many inheritable storage slots on the queue. Now, inheritance is checked first, then full slot ranges and finally partial ranges. * Update logging * Bundled node information for healing into single object `NodeSpecs` why: Previously, partial paths and node keys were kept in separate variables. This approach was error prone due to copying/reassembling function argument objects. As all partial paths, keys, and node data types are more or less handled as `Blob`s over the network (using Eth/6x, or Snap/1) it makes sense to hold these `Blob`s as named field in a single object (even if not all fields are active for the current purpose.) * For good housekeeping, using `NodeKey` type only for account keys why: previously, a mixture of `NodeKey` and `Hash256` was used. Now, only state or storage root keys use the `Hash256` type. * Always accept latest pivot (and not a slightly older one) why; For testing it was tried to use a slightly older pivot state root than available. Some anecdotal tests seemed to suggest an advantage so that more peers are willing to serve on that older pivot. But this could not be confirmed in subsequent tests (still anecdotal, though.) As a side note, the distance of the latest pivot to its predecessor is at least 128 (or whatever the constant `minPivotBlockDistance` is assigned to.) * Reshuffle name components for some file and function names why: Clarifies purpose: "storages" becomes: "storage slots" "store" becomes: "range fetch" * Stash away currently unused modules in sub-folder named "notused"	2022-10-27 14:49:28 +01:00
Jordan Hrycaj	c0d580715e	Remodel persistent snapdb access (#1274 ) * Re-model persistent database access why: Storage slots healing just run on the wrong sub-trie (i.e. the wrong key mapping). So get/put and bulk functions now use the definitions in `snapdb_desc` (earlier there were some shortcuts for `get()`.) * Fixes: missing return code, typo, redundant imports etc. * Remove obsolete debugging directives from `worker_desc` module * Correct failing unit tests for storage slots trie inspection why: Some pathological cases for the extended tests do not produce any hexary trie data. This is rightly detected by the trie inspection and the result checks needed to adjusted.	2022-10-20 17:59:54 +01:00
Jordan Hrycaj	85fdb61699	Prep for full sync after snap make 3 (#1270 ) * For snap sync, publish `EthWireRef` in sync descriptor why: currently used for noise control * Detect and reuse existing storage slots * Provide healing module for storage slots * Update statistic ticker (adding range factor for unprocessed storage) * Complete mere function for work item ranges why: Merging interval into existing partial item was missing * Show av storage queue lengths in ticker detail; Previous attempt shows average completeness which did not tell much * Correct the meaning of the storage counter (per pivot) detail: Is the # accounts that have a storage saved	2022-10-19 11:04:06 +01:00
Jordan Hrycaj	8c7d91512b	Prep for full sync after snap mark2 (#1263 ) * Rename `LeafRange` => `NodeTagRange` * Replacing storage slot partition point by interval why: The partition point only allows to describe slots `[point,high(Uint256)]` for fetching interval slot ranges. This has been generalised for any interval. * Replacing `SnapAccountRanges` by `SnapTrieRangeBatch` why: Generalised healing status for accounts, and later for storage slots. * Improve accounts healing loop * Split `snap_db` into accounts and storage modules why: It is cleaner to have separate session descriptors for accounts and storage slots (based on a common base descriptor.) Also, persistent storage handling might be changed in future which requires the storage slot implementation disentangled from the accounts handling. * Re-model worker queues for storage slots why: There is a dynamic list of storage sub-tries, each one has to be treated similar to the accounts database. This applied to slot interval downloads as well as to healing * Compress some return value report lists for snapdb methods why: No need to report all handling details for work items that are filteres out and discarded, anyway. * Remove inner loop frame from healing function why: The healing function runs as a loop body already.	2022-10-14 17:40:32 +01:00
Jordan Hrycaj	d53eacb854	Prep for full sync after snap (#1253 ) * Split fetch accounts into sub-modules details: There will be separated modules for accounts snapshot, storage snapshot, and healing for either. * Allow to rebase pivot before negotiated header why: Peers seem to have not too many snapshots available. By setting back the pivot block header slightly, the chances might be higher to find more peers to serve this pivot. Experiment on mainnet showed that setting back too much (tested with 1024), the chances to find matching snapshot peers seem to decrease. * Add accounts healing * Update variable/field naming in `worker_desc` for readability * Handle leaf nodes in accounts healing why: There is no need to fetch accounts when they had been added by the healing process. On the flip side, these accounts must be checked for storage data and the batch queue updated, accordingly. * Reorganising accounts hash ranges batch queue why: The aim is to formally cover as many accounts as possible for different pivot state root environments. Formerly, this was tried by starting the accounts batch queue at a random value for each pivot (and wrapping around.) Now, each pivot environment starts with an interval set mutually disjunct from any interval set retrieved with other pivot state roots. also: Stop fishing for more pivots in `worker` if 100% download is reached * Reorganise/update accounts healing why: Error handling was wrong and the (math. complexity of) whole process could be better managed. details: Much of the algorithm is now documented at the top of the file `heal_accounts.nim`	2022-10-08 18:20:50 +01:00
Jordan Hrycaj	4ff0948fed	Snap sync accounts healing (#1225 ) * Added inspect module why: Find dangling references for trie healing support. details: + This patch set provides only the inspect module and some unit tests. + There are also extensive unit tests which need bulk data from the `nimbus-eth1-blob` module. * Alternative pivot finder why: Attempt to be faster on start up. Also tying to decouple pivot finder somehow by providing different mechanisms (this one runs in `single` mode.) * Use inspect module for healing details: + After some progress with account and storage data, the inspect facility is used to find dangling links in the database to be filled nose-wise. + This is a crude attempt to cobble together functional elements. The set up needs to be honed. * fix scheduler to avoid starting dead peers why: Some peers drop out while in `sleepAsync()`. So extra `if` clauses make sure that this event is detected early. * Bug fixes causing crashes details: + prettify.toPC(): int/intToStr() numeric range over/underflow + hexary_inspect.hexaryInspectPath(): take care of half initialised step with branch but missing index into branch array * improve handling of dropped peers in alternaive pivot finder why: Strange things may happen while querying data from the network. Additional checks make sure that the state of other peers is updated immediately. * Update trace messages * reorganise snap fetch & store schedule	2022-09-16 08:24:12 +01:00
Jordan Hrycaj	72a31593a9	Snap fetch account storage data (#1211 ) * Removed database write comparison statistics * Provide life storage tests data details: database dumps on external repo `nimbus-eth1`-blobs` * Update hexary tree interpolation for storage bulk tests * fetch storage update	2022-09-02 19:16:09 +01:00
Jordan Hrycaj	de2c13e136	Update snap offline tests (#1199 ) * Re-implemented `hexaryFollow()` in a more general fashion details: + New name for re-implemented `hexaryFollow()` is `hexaryPath()` + Renamed `rTreeFollow()` as `hexaryPath()` why: Returning similarly organised structures, the results of the `hexaryPath()` functions become comparable when running over the persistent and the in-memory databases. * Added traversal functionality for persistent ChainDB * Using `Account` values as re-packed Blob * Repack samples as compressed data files * Produce test data details: + Can force pivot state root switch after minimal coverage. + For emulating certain network behaviour, downloading accounts stops for a particular pivot state root if 30% (some static number) coverage is reached. Following accounts are downloaded for a later pivot state root.	2022-08-24 14:44:18 +01:00
Jordan Hrycaj	f07945d37b	Misc snap sync updates (#1192 ) * Bump nim-stew why: Need fixed interval set * Keep track of accumulated account ranges over all state roots * Added comments and explanations to unit tests * typo	2022-08-17 08:30:11 +01:00
Jordan Hrycaj	7489784ba8	Snap sync accounts db code reorg (#1189 ) * Extracted functionality into sub-modules for maintainability * Setting SST bulk load as default in `accounts_db` details: + currently, the same data are stored via rocksdb if available, and the same via embedded `storage_type` with (non-standard) prefix 200 for time comparisons + fallback to normal `put()` unless rocksdb is accessible	2022-08-15 16:51:50 +01:00
Jordan Hrycaj	7d7e26d45f	Experimental bulk loader tests (#1187 ) why: Rocksdb bulk loading might provide a slight advantage when loading larger data sets into the system	2022-08-12 16:42:07 +01:00
Jordan Hrycaj	5f0e89a41e	Snap accounts bulk import preparer (#1183 ) * Provided common scheduler API, applied to `full` sync * Use hexary trie as storage for proofs_db records also: + Store metadata with account for keeping track of account state + add iterator over accounts * Common scheduler API applied to `snap` sync * Prepare for accounts bulk import details: + Added some ad-hoc checks for proving accounts data received from the snap/1 (will be replaced by proper database version when ready) + Added code that dumps some of the received snap/1 data into a file (turned of by default, see `worker_desc.nim`)	2022-08-04 09:04:30 +01:00
Jordan Hrycaj	134fe26997	Store proved snap accounts (#1145 ) * Relocated `IntervalSets` to nim-stew repo * Accumulate accounts on temporary kv-DB why: Explore the data as returned from snap/1. Will be converted to a `eth/db` next. details: Verify and accumulate per/state-root accounts downloaded via snap. also: Some unit tests * Replace `Table` by `TrieDatabaseRef` for accounts accumulator * update ticker statistics details: mean/variance based counter update * allow persistent db for proved accounts * rebase, and globally activate unit test * fix statistics	2022-07-01 12:42:17 +01:00

44 Commits