nimbus-eth1

Commit Graph

Author	SHA1	Message	Date
Jacek Sieka	7bbb0f4421	Stream blocks during import (#2937 ) When running the import, currently blocks are loaded in batches into a `seq` then passed to the importer as such. In reality, blocks are still processed one by one, so the batching does not offer any performance advantage. It does however require that the client wastes memory, up to several GB, on the block sequence while they're waiting to be processed. This PR introduces a persister that accepts these potentially large blocks one by one and at the same time removes a number of redundant / unnecessary copies, assignments and resets that were slowing down the import process in general.	2024-12-18 13:21:20 +01:00
andri lim	1101895f92	Move rlp block import into it's own subcommand (#2904 ) * Move rlp block import into it's own subcommand * Fix test_configuration	2024-12-04 20:36:07 +07:00
Advaita Saha	3519b4a38c	fix: show correct blockNumber and log in end of era files (#2691 )	2024-10-04 01:37:50 +00:00
Jacek Sieka	ce331b4de8	post-merge nrpc fix (#2685 ) * post-merge nrpc fix * bump * bump * bump * bump * bump * bump * bump * bump * bump * bump * bump	2024-10-03 11:42:24 +00:00
Advaita Saha	379592e711	Fix import stuck with era history behind (#2629 ) * fix: nimbus state ahead of era history * comments * fix: suggestions * fix: messages * fix edge case resume * check from last file * formatting * fix: typo * fix: unwanted quit before rlp import	2024-09-21 08:38:38 +02:00
andri lim	4d9e288340	Wiring ForkedChainRef to other components (#2423 ) * Wiring ForkedChainRef to other components - Disable majority of hive simulators - Only enable pyspec_sim for the moment - The pyspec_sim is using a smaller RPC service wired to ForkedChainRef - The RPC service will gradually grow * Addressing PR review * Fix test_beacon/setup_env * Enable consensus_sim (#2441) * Enable consensus_sim * Remove isFile check * Enable Engine API jwt auth tests and exchange cap tests * Enable engine api in build_sim.sh * Wire ForkedChainRef to Engine API newPayload * Wire Engine API getBodies to ForkedChainRef * Wire Engine API api_forkchoice to ForkedChainRef * Wire more RPC methods to ForkedChainRef * Implement eth_syncing * Implement eth_call and eth_getlogs * TxPool: simplify smartHead * Fix smartHead usage * Fix txpool headDiff * Remove hasBlockHeader and use headerExists * Addressing review	2024-09-04 09:54:54 +00:00
Jacek Sieka	dbabe7e0a7	import: reduce stack usage (#2575 ) Because EthBlock is quite large, the stack usage that results from the multiple copies (temporary and not) present in the import command is larger than it should be - this PR moves some of that data to a closure environment allocated once per EthBlock - a larger restructuring of the code is due but in the meantime, this simple change speeds up garbage collection a little bit.	2024-08-22 10:06:45 +02:00
Jacek Sieka	d72a73de8b	avoid digest when loading era block (#2572 ) Computing the digest is unnecessary but takes a little bit of time - remove computation and reduce mem usage slightly when loading era blocks	2024-08-20 15:23:14 +02:00
Jacek Sieka	43d93bcdab	Don't write slot hashes on import (#2564 ) The reverse slot hash mechanism causes quite a bit of database traffic but is broadly not useful except for iterating the storage of an account, something that a validator never does (it's used by the tracers). This flag adds one more thing that is not stored in the database, to be explored more comprehensively when designing full, validator and archive modes with different pruning options in the future. `ldb` says this is 60gb of data (!): ``` ldb --db=. --ignore_unknown_options --column_family=KvtGen approxsize --hex --from=0x05 --to=0x05ffffffffffffffffffffffffffffffffffffffffffffff 66488353954 ```	2024-08-16 08:22:51 +02:00
Jacek Sieka	bdc86b3fd4	small cleanups (#2526 ) * remove some redundant EH * avoid pessimising move (introduces a copy in this case!) * shift less data around when reading era files (reduces stack usage)	2024-07-26 12:32:01 +07:00
Advaita Saha	08bbb0079f	faster slot finding in nimbus import (#2491 ) * faster slot finding in nimbus import * feat: blocknumber based slot finding * fix: formatting * added comments * fix: added is_execution_block * added comment	2024-07-22 21:17:07 +00:00
Advaita Saha	25af347dfd	Shift era helpers to a different file (#2475 ) * shift helpers to a different file * fix: few logic fixed for transition from era1 to era	2024-07-12 03:15:14 +00:00
Advaita Saha	9a499eb45f	Era support for nimbus import (#2429 ) * add the era-dir option * feat: support for era files in nimbus import * fix: metric logs * fix: eraDir check * fix: redundant code and sepolia support * fix: remove dependency from csv + formatting * fix: typo * fix: RVO * fix: parseBiggestInt * fix: opt impl * fix: network agnostic loading * fix: shift to int64	2024-07-09 15:28:01 +02:00
andri lim	4fa3756860	Convert GasInt to uint64, bump nim-eth and nimbus-eth2 (#2461 ) * Convert GasInt to uint64, bump nim-eth and nimbus-eth2 * Bump nimbus-eth2 * int64.high.GasInt instead of 0x7fffffffffffffff.GasInt	2024-07-07 06:52:11 +00:00
Jacek Sieka	79788c01d4	Add debug mode for disabling per-chunk state root validation (#2453 ) This significantly speeds up block import at the cost of less protection against invalid data, potentially resulting in an invalid database getting stored. The risk is small given that import is used only for validated data - evaluating the right level of of validation vs performance is left for a future PR. A side effect of this approach is that there is no cached stated root in the database - computing it currently requires a lot of memory since the intermediate roots get cached in memory in full while the computation is ongoing - a future PR will need to address this deficiency, for example by streaming the already-computed hashes directly to the database.	2024-07-04 16:51:50 +02:00
Jacek Sieka	9521582005	avoid closure environment for mpt methods (#2408 ) An instance of `CoreDbMptRef` is created for and stored in every account - when we are processing blocks and have many accounts in memory, this closure environment takes up hundreds of mb of memory (around block 5M, it is the 4:th largest memory consumer!) - incidentally, this also removes a circular reference in the setup that causes the `AristoCodeDbMptRef` to linger in memory much longer than it has to which is the core reason why it takes so much. The real solution here is to remove the methods indirection entirely, but this PR provides relief until that has been done. Similar treatment is given to some of the other core api functions to avoid circulars there too.	2024-06-24 07:56:41 +02:00
Jacek Sieka	83b3eeeb18	metrics: enable during import (#2401 ) This allows monitoring the import process using prometheus/grafana/etc	2024-06-20 19:06:58 +02:00
Jacek Sieka	242bbf03fc	Light verification and storage mode for import (#2367 ) When performing block import, we can batch state root verifications and header checks, doing them only once per chunk of blocks, assuming that the other blocks in the batch are valid by extension. When we're not generating receipts, we can also skip per-transaction state root computation pre-byzantium, which is what provides a ~20% speedup in this PR, at least on those early blocks :) We also stop storing transactions, receipts and uncles redundantly when importing from era1 - there is no need to waste database storage on this when we can load it from the era1 file (eventually).	2024-06-15 11:22:37 +02:00
andri lim	5a18537450	Bump nim-eth, nim-web3, nimbus-eth2 (#2344 ) * Bump nim-eth, nim-web3, nimbus-eth2 - Replace std.Option with results.Opt - Fields name changes * More fixes * Fix Portal stream async raises and portal testnet Opt usage * Bump eth + nimbus-eth2 + more fixes related to eth_types changes * Fix in utp test app and nimbus-eth2 bump * Fix test_blockchain_json rebase conflict * Fix EVMC block_timestamp conversion plus commentary --------- Co-authored-by: kdeme <kim.demey@gmail.com>	2024-06-14 14:31:08 +07:00
Jacek Sieka	189a20bbae	Avoid recomputing hashes when persisting data (#2350 )	2024-06-14 07:10:00 +02:00
Jacek Sieka	0b32078c4b	Consolidate block type for block processing (#2325 ) This PR consolidates the split header-body sequences into a single EthBlock sequence and cleans up the fallout from that which significantly reduces block processing overhead during import thanks to less garbage collection and fewer copies of things all around. Notably, since the number of headers must always match the number of bodies, we also get rid of a pointless degree of freedom that in the future could introduce unnecessary bugs. * only read header and body from era file * avoid several unnecessary copies along the block processing way * simplify signatures, cleaning up unused arguemnts and returns * use `stew/assign2` in a few strategic places where the generated nim assignent is slow and add a few `move` to work around poor analysis in nim 1.6 (will need to be revisited for 2.0) ``` stats-20240607_2223-a814aa0b.csv vs stats-20240608_0714-21c1d0a9.csv bps_x bps_y tps_x tps_y bpsd tpsd timed block_number (498305, 713245] 1,540.52 1,809.73 2,361.58 2775.340189 17.63% 17.63% -14.92% (713245, 928185] 730.36 865.26 1,715.90 2028.973852 18.01% 18.01% -15.21% (928185, 1143126] 663.03 789.10 2,529.26 3032.490771 19.79% 19.79% -16.28% (1143126, 1358066] 393.46 508.05 2,152.50 2777.578119 29.13% 29.13% -22.50% (1358066, 1573007] 370.88 440.72 2,351.31 2791.896052 18.81% 18.81% -15.80% (1573007, 1787947] 283.65 335.11 2,068.93 2441.373402 17.60% 17.60% -14.91% (1787947, 2002888] 287.29 342.11 2,078.39 2474.179448 18.99% 18.99% -15.91% (2002888, 2217828] 293.38 343.16 2,208.83 2584.77457 17.16% 17.16% -14.61% (2217828, 2432769] 140.09 167.86 1,081.87 1296.336926 18.82% 18.82% -15.80% blocks: 1934464, baseline: 3h13m1s, contender: 2h43m47s bpsd (mean): 19.55% tpsd (mean): 19.55% Time (total): -29m13s, -15.14% ```	2024-06-09 16:32:20 +02:00
Jacek Sieka	0268093fcc	import: add csv debug option (#2301 ) This new option saves a CSV to disk while performing `import` such that the performance of one import can be compared with the other. This early version is likely to change in the future	2024-06-06 07:03:11 +02:00
Jacek Sieka	99f2ba75f7	import: nicer stats (#2283 )	2024-06-02 13:00:05 +02:00
Jordan Hrycaj	bda760f41d	Run coredb without journal (#2266 ) * Add persistent last state stamp feature why: This allows to run `CoreDb` without journal * Start `CoreDb` without journal * Remove journal related functions from `CoredDb`	2024-05-31 17:32:22 +00:00
Jacek Sieka	a375720c16	import: read from era files (#2254 ) This PR extends the `nimbus import` command to also allow reading from era files - this command allows creating or topping up an existing database with data coming from era files instead of network sync. * add `--era1-dir` and `--max-blocks` options to command line * make `persistBlocks` report basic stats like transactions and gas * improve error reporting in several API * allow importing multiple RLP files in one go * clean up logging options to match nimbus-eth2 * make sure database is closed properly on shutdown	2024-05-31 09:13:56 +02:00

25 Commits