nimbus-eth1

mirror of https://github.com/status-im/nimbus-eth1.git synced 2025-03-01 04:10:45 +00:00

Author	SHA1	Message	Date
Jacek Sieka	caca11b30b	Simplify txFrame protocol, improve persist performance (#3077 ) * Simplify txFrame protocol, improve persist performance To prepare forked-layers for further surgery to avoid the nesting tax, the commit/rollback style of interacting must first be adjusted, since it does not provide a point in time where the frame is "done" and goes from being actively written to, to simply waiting to be persisted or discarded. A collateral benefit of this change is that the scheme removes some complexity from the process by moving the "last saved block number" into txframe along with the actual state changes thus reducing the risk that they go "out of sync" and removing the "commit" consolidation responsibility from ForkedChain. * commit/rollback become checkpoint/dispose - since these are pure in-memory constructs, there's less error handling and there's no real "rollback" involved - dispose better implies that the instance cannot be used and we can more aggressively clear the memory it uses * simplified block number handling that moves to become part of txFrame just like the data that the block number references * avoid reparenting step by replacing the base instead of keeping a singleton instance * persist builds the set of changes from the bottom which helps avoid moving changes in the top layers through each ancestor level of the frame stack * when using an in-memory database in tests, allow the instance to be passed around to enable testing persist and reload logic	2025-02-17 01:51:56 +00:00
Jacek Sieka	42bb640443	Simplify shared rocksdb instance / write batch handling (#3063 ) By introducing the "shared rocksdb instance" concept to the backend, we can remove the "piggybacking" mode , thus reducing the complexity of database initialisation and opening the possibility of extending how write batching works across kvt/aristo. The change makes explicit the hidden shared state that was previously hiding in closures and provides the first step towards simplifying the "commit/persist" interface of coredb, preparing it for optimizations to reduce the "layering tax" that `forked-layers` introduced.	2025-02-14 09:40:22 +01:00
pmmiranda	411a3cadfa	Renamed 'nimbus' directory and its references to 'execution_chain' (#3052 ) * renamed nimbus folder to execution_chain * Renamed "nimbus" references to "execution_chain" * fixed wrongly changed http reference * delete snap types file given that it was deleted before this PR merge * missing 'execution_chain' replacement --------- Co-authored-by: pmmiranda <pedro.miranda@nimbus.team>	2025-02-11 22:28:42 +00:00
Jacek Sieka	2961905a95	aristo: fork support via layers/txframes (#2960 ) * aristo: fork support via layers/txframes This change reorganises how the database is accessed: instead holding a "current frame" in the database object, a dag of frames is created based on the "base frame" held in `AristoDbRef` and all database access happens through this frame, which can be thought of as a consistent point-in-time snapshot of the database based on a particular fork of the chain. In the code, "frame", "transaction" and "layer" is used to denote more or less the same thing: a dag of stacked changes backed by the on-disk database. Although this is not a requirement, in practice each frame holds the change set of a single block - as such, the frame and its ancestors leading up to the on-disk state represents the state of the database after that block has been applied. "committing" means merging the changes to its parent frame so that the difference between them is lost and only the cumulative changes remain - this facility enables frames to be combined arbitrarily wherever they are in the dag. In particular, it becomes possible to consolidate a set of changes near the base of the dag and commit those to disk without having to re-do the in-memory frames built on top of them - this is useful for "flattening" a set of changes during a base update and sending those to storage without having to perform a block replay on top. Looking at abstractions, a side effect of this change is that the KVT and Aristo are brought closer together by considering them to be part of the "same" atomic transaction set - the way the code gets organised, applying a block and saving it to the kvt happens in the same "logical" frame - therefore, discarding the frame discards both the aristo and kvt changes at the same time - likewise, they are persisted to disk together - this makes reasoning about the database somewhat easier but has the downside of increased memory usage, something that perhaps will need addressing in the future. Because the code reasons more strictly about frames and the state of the persisted database, it also makes it more visible where ForkedChain should be used and where it is still missing - in particular, frames represent a single branch of history while forkedchain manages multiple parallel forks - user-facing services such as the RPC should use the latter, ie until it has been finalized, a getBlock request should consider all forks and not just the blocks in the canonical head branch. Another advantage of this approach is that `AristoDbRef` conceptually becomes more simple - removing its tracking of the "current" transaction stack simplifies reasoning about what can go wrong since this state now has to be passed around in the form of `AristoTxRef` - as such, many of the tests and facilities in the code that were dealing with "stack inconsistency" are now structurally prevented from happening. The test suite will need significant refactoring after this change. Once this change has been merged, there are several follow-ups to do: * there's no mechanism for keeping frames up to date as they get committed or rolled back - TODO * naming is confused - many names for the same thing for legacy reason * forkedchain support is still missing in lots of code * clean up redundant logic based on previous designs - in particular the debug and introspection code no longer makes sense * the way change sets are stored will probably need revisiting - because it's a stack of changes where each frame must be interrogated to find an on-disk value, with a base distance of 128 we'll at minimum have to perform 128 frame lookups for every database interaction - regardless, the "dag-like" nature will stay * dispose and commit are poorly defined and perhaps redundant - in theory, one could simply let the GC collect abandoned frames etc, though it's likely an explicit mechanism will remain useful, so they stay for now More about the changes: * `AristoDbRef` gains a `txRef` field (todo: rename) that "more or less" corresponds to the old `balancer` field * `AristoDbRef.stack` is gone - instead, there's a chain of `AristoTxRef` objects that hold their respective "layer" which has the actual changes * No more reasoning about "top" and "stack" - instead, each `AristoTxRef` can be a "head" that "more or less" corresponds to the old single-history `top` notion and its stack * `level` still represents "distance to base" - it's computed from the parent chain instead of being stored * one has to be careful not to use frames where forkedchain was intended - layers are only for a single branch of history! * fix layer vtop after rollback * engine fix * Fix test_txpool * Fix test_rpc * Fix copyright year * fix simulator * Fix copyright year * Fix copyright year * Fix tracer * Fix infinite recursion bug * Remove aristo and kvt empty files * Fic copyright year * Fix fc chain_kvt * ForkedChain refactoring * Fix merge master conflict * Fix copyright year * Reparent txFrame * Fix test * Fix txFrame reparent again * Cleanup and fix test * UpdateBase bugfix and fix test * Fixe newPayload bug discovered by hive * Fix engine api fcu * Clean up call template, chain_kvt, andn txguid * Fix copyright year * work around base block loading issue * Add test * Fix updateHead bug * Fix updateBase bug * Change func commitBase to proc commitBase * Touch up and fix debug mode crash --------- Co-authored-by: jangko <jangko128@gmail.com>	2025-02-06 14:04:50 +07:00
andri lim	aba9b582db	Rename stateDB to ledger (#2966 ) * Rename stateDB to ledger * Fix readOnlyLedger	2024-12-21 20:46:13 +07:00
Jacek Sieka	d45d03ce0c	reduce tx naming overload (#2952 ) * if it's a db function, use `txFrame...` * if it's not a db function, don't use `txFrame...`	2024-12-18 23:03:51 +07:00
Jacek Sieka	7bbb0f4421	Stream blocks during import (#2937 ) When running the import, currently blocks are loaded in batches into a `seq` then passed to the importer as such. In reality, blocks are still processed one by one, so the batching does not offer any performance advantage. It does however require that the client wastes memory, up to several GB, on the block sequence while they're waiting to be processed. This PR introduces a persister that accepts these potentially large blocks one by one and at the same time removes a number of redundant / unnecessary copies, assignments and resets that were slowing down the import process in general.	2024-12-18 13:21:20 +01:00
andri lim	45bc6422a0	Reduce getCanonicalHead usage, and delegate to ForkedChain (#2948 ) The current getCanonicalHead of core db should not be confused with ForkedChain.latestHeader. Therefore we need to use getCanonicalHead to restricted case only, e.g. initializing ForkedChain.	2024-12-18 11:04:23 +07:00
andri lim	847cc311eb	Remove verifyFrom, vmState, and checkSeal from ChainRef (#2932 )	2024-12-13 12:12:57 +07:00
Jacek Sieka	3d58393b4c	Offload signature checking to taskpools (#2927 ) In block processing, depending on the complexity of a transaction and hotness of caches etc, signature checking can actually make up the majority of time needed to process a transaction (60% observed in some randomly sampled block ranges). Fortunately, this is a task that trivially can be offloaded to a task pool similar to how nimbus-eth2 does it. This PR introduces taskpools in the most simple way possible, by performing signature checking concurrently with other TX processing, assigning a taskpool task per TX effectively. With this little trick, we're in gigagas land 🎉 on my laptop! ``` INF 2024-12-10 21:05:35.170+01:00 Imported blocks blockNumber=3874817 b... mgps=1222.707 ... ``` Tests don't use the taskpool for now because it needs manual cleanup and we don't have a good mechanism in place. Future PR:s should address this by creating a common shutdown sequence that also closes and cleans up other resources like the DB. Co-authored-by: andri lim <jangko128@gmail.com>	2024-12-13 11:53:41 +07:00
andri lim	6b86acfb8d	Cleanup db/core_apps error handling (#2838 ) * Cleanup db/core_apps error handling * Fix persistHeader * Fix getUncles	2024-11-07 08:24:21 +07:00
andri lim	89fac051cd	Reduce declared but not used warnings (#2822 )	2024-11-03 00:11:24 +00:00
Chirag Parmar	2838191c4f	replace deprecated types (#2704 ) * partial commit * fixes * remove converters too * revert changes on nimbus_verified_proxy * revert changes in converter * revert changes(re-xport) in rpc_types * update copyright year * replace types in other binaries * chain config bug * fix rebase conflict imcomplete buffer * fix more rebase buffers * remove ditto types and converters * fix the tests * update copyright year	2024-10-16 08:34:12 +07:00
andri lim	76c2a75a53	Proof-of-stakiness based on block header (#2682 ) * Proof-of-stakiness based on block header * Remove unnecessary PoS check from test_txpool2 * Fix engine api simulator * Fix indentation * Fix vmstate debug util * Fix MainNet ForkId calculation issue	2024-10-08 09:37:36 +07:00
andri lim	a70bb78d27	Fix engine_sim compilation issue (#2594 )	2024-09-06 11:06:31 +07:00
andri lim	4d9e288340	Wiring ForkedChainRef to other components (#2423 ) * Wiring ForkedChainRef to other components - Disable majority of hive simulators - Only enable pyspec_sim for the moment - The pyspec_sim is using a smaller RPC service wired to ForkedChainRef - The RPC service will gradually grow * Addressing PR review * Fix test_beacon/setup_env * Enable consensus_sim (#2441) * Enable consensus_sim * Remove isFile check * Enable Engine API jwt auth tests and exchange cap tests * Enable engine api in build_sim.sh * Wire ForkedChainRef to Engine API newPayload * Wire Engine API getBodies to ForkedChainRef * Wire Engine API api_forkchoice to ForkedChainRef * Wire more RPC methods to ForkedChainRef * Implement eth_syncing * Implement eth_call and eth_getlogs * TxPool: simplify smartHead * Fix smartHead usage * Fix txpool headDiff * Remove hasBlockHeader and use headerExists * Addressing review	2024-09-04 09:54:54 +00:00
Jacek Sieka	43d93bcdab	Don't write slot hashes on import (#2564 ) The reverse slot hash mechanism causes quite a bit of database traffic but is broadly not useful except for iterating the storage of an account, something that a validator never does (it's used by the tracers). This flag adds one more thing that is not stored in the database, to be explored more comprehensively when designing full, validator and archive modes with different pruning options in the future. `ldb` says this is 60gb of data (!): ``` ldb --db=. --ignore_unknown_options --column_family=KvtGen approxsize --hex --from=0x05 --to=0x05ffffffffffffffffffffffffffffffffffffffffffffff 66488353954 ```	2024-08-16 08:22:51 +02:00
Jordan Hrycaj	800fd77333	Core db remove legacy phrases (#2468 ) * Rename `newKvt()` -> `ctx.getKvt()` why: Clean up legacy shortcut. Also, the `KVT` returned is not instantiated but refers to the shared `KVT` that resides in a context which is a generalisation of an in-memory database fork. The function `ctx` retrieves the default context. * Rename `newTransaction()` -> `ctx.newTransaction()` why: Clean up legacy shortcut. The transaction is applied to a context as a generalisation of an in-memory database fork. The function `ctx` retrieves the default context. * Rename `getColumn(CtGeneric)` -> `getGeneric()` why: No more a list of well known sub-tries needed, a single one is enough. In fact, `getColumn()` did only support a single sub-tree by now. * Reduce TODO list	2024-07-10 12:19:35 +00:00
andri lim	f04f30c72b	Reduce EVM complexity by removing forkOverride (#2448 ) * Reduce EVM complexity by removing forkOverride * Fixes	2024-07-04 15:48:36 +02:00
andri lim	c24affadee	Use simpler schema when writing transactions, receipts, and withdrawals (#2420 ) * Use simpler schema when writing transactions, receipts, and withdrawals Using MPT not only slow but also take up more spaces than needed. Aristo will remove older tries and only keep the last block tries. Using simpler schema will avoid those problems. * Rename getTransaction to getTransactionByIndex	2024-06-29 12:43:17 +07:00
andri lim	61a809cf4d	Remove EVM indirect imports and unused EVM errors (#2370 ) Those indirect imports are used when there was two EVMs.	2024-06-17 09:56:39 +02:00
andri lim	5a18537450	Bump nim-eth, nim-web3, nimbus-eth2 (#2344 ) * Bump nim-eth, nim-web3, nimbus-eth2 - Replace std.Option with results.Opt - Fields name changes * More fixes * Fix Portal stream async raises and portal testnet Opt usage * Bump eth + nimbus-eth2 + more fixes related to eth_types changes * Fix in utp test app and nimbus-eth2 bump * Fix test_blockchain_json rebase conflict * Fix EVMC block_timestamp conversion plus commentary --------- Co-authored-by: kdeme <kim.demey@gmail.com>	2024-06-14 14:31:08 +07:00
Jacek Sieka	189a20bbae	Avoid recomputing hashes when persisting data (#2350 )	2024-06-14 07:10:00 +02:00
Jacek Sieka	c48b527eea	simplify error handling in block processing (#2337 ) * ValidationResult -> Result * get rid of mixed exception / other styles	2024-06-11 17:50:22 +02:00
Jacek Sieka	0b32078c4b	Consolidate block type for block processing (#2325 ) This PR consolidates the split header-body sequences into a single EthBlock sequence and cleans up the fallout from that which significantly reduces block processing overhead during import thanks to less garbage collection and fewer copies of things all around. Notably, since the number of headers must always match the number of bodies, we also get rid of a pointless degree of freedom that in the future could introduce unnecessary bugs. * only read header and body from era file * avoid several unnecessary copies along the block processing way * simplify signatures, cleaning up unused arguemnts and returns * use `stew/assign2` in a few strategic places where the generated nim assignent is slow and add a few `move` to work around poor analysis in nim 1.6 (will need to be revisited for 2.0) ``` stats-20240607_2223-a814aa0b.csv vs stats-20240608_0714-21c1d0a9.csv bps_x bps_y tps_x tps_y bpsd tpsd timed block_number (498305, 713245] 1,540.52 1,809.73 2,361.58 2775.340189 17.63% 17.63% -14.92% (713245, 928185] 730.36 865.26 1,715.90 2028.973852 18.01% 18.01% -15.21% (928185, 1143126] 663.03 789.10 2,529.26 3032.490771 19.79% 19.79% -16.28% (1143126, 1358066] 393.46 508.05 2,152.50 2777.578119 29.13% 29.13% -22.50% (1358066, 1573007] 370.88 440.72 2,351.31 2791.896052 18.81% 18.81% -15.80% (1573007, 1787947] 283.65 335.11 2,068.93 2441.373402 17.60% 17.60% -14.91% (1787947, 2002888] 287.29 342.11 2,078.39 2474.179448 18.99% 18.99% -15.91% (2002888, 2217828] 293.38 343.16 2,208.83 2584.77457 17.16% 17.16% -14.61% (2217828, 2432769] 140.09 167.86 1,081.87 1296.336926 18.82% 18.82% -15.80% blocks: 1934464, baseline: 3h13m1s, contender: 2h43m47s bpsd (mean): 19.55% tpsd (mean): 19.55% Time (total): -29m13s, -15.14% ```	2024-06-09 16:32:20 +02:00
Jordan Hrycaj	e9eae4df70	Core db disable legacy api n remove distinct tries (#2299 ) * CoreDb: Remove crufty second/off-site KVT why: Was used to allow late `Clique` to store directly to disk * CoreDb: Remove prune flag related functionality why: Is completely legacy stuff * CoreDb: Remove dependence on legacy API (tests unsupported yet) why: Does not fully support Aristo * Re-factoring `state_db` using new API details: Only minimum changes needed to compile `nimbus` * Update tests and aux modules * Turn off legacy API and remove `distinct_tries` comment: The legacy API has now cruft status, will be removed soon * Fix copyright years * Update rpc for verified proxy --------- Co-authored-by: Jacek Sieka <jacek@status.im>	2024-06-05 20:52:04 +00:00
Jacek Sieka	7f76586214	Speed up account ledger a little (#2279 ) `persist` is a hotspot when processing blocks because it is run at least once per transaction and loops over the entire account cache every time. Here, we introduce an extra `dirty` map that keeps track of all accounts that need checking during `persist` which fixes the immediate inefficiency, though probably this could benefit from a more thorough review - we also get rid of the unused clearCache flag - we start with a fresh cache on every fresh vmState. * avoid unnecessary code hash comparisons * avoid unnecessary copies when iterating * use EMPTY_CODE_HASH throughout for code hash comparison	2024-06-02 21:21:29 +02:00
Jacek Sieka	a375720c16	import: read from era files (#2254 ) This PR extends the `nimbus import` command to also allow reading from era files - this command allows creating or topping up an existing database with data coming from era files instead of network sync. * add `--era1-dir` and `--max-blocks` options to command line * make `persistBlocks` report basic stats like transactions and gas * improve error reporting in several API * allow importing multiple RLP files in one go * clean up logging options to match nimbus-eth2 * make sure database is closed properly on shutdown	2024-05-31 09:13:56 +02:00
tersec	e895c0baeb	rm Clique consensus method support and Goerli network (#2219 ) * rm Clique consensus method support and Goerli network * rm a few more SealingEngineRef and GoerliNets	2024-05-25 16:12:14 +02:00
jangko	053fc79a8b	Engine-API simulator: allow testee client to import invalid block	2024-05-19 10:08:05 +07:00

30 Commits