nimbus-eth1

Commit Graph

Author	SHA1	Message	Date
Jacek Sieka	81e75622cf	storage: store root id together with vid, for better locality of refe… (#2449 ) The state and account MPT:s currenty share key space in the database based on that vertex id:s are assigned essentially randomly, which means that when two adjacent slot values from the same contract are accessed, they might reside at large distance from each other. Here, we prefix each vertex id by its root causing them to be sorted together thus bringing all data belonging to a particular contract closer together - the same effect also happens for the main state MPT whose nodes now end up clustered together more tightly. In the future, the prefix given to the storage keys can also be used to perform range operations such as reading all the storage at once and/or deleting an account with a batch operation. Notably, parts of the API already supported this rooting concept while parts didn't - this PR makes the API consistent by always working with a root+vid.	2024-07-04 15:46:52 +02:00
Jacek Sieka	b23795ab39	remove pPrf, fRpp (#2445 ) No longer used now that hashify is gone	2024-07-03 22:21:57 +02:00
Jordan Hrycaj	ea7c756a9d	Core db reorg (#2444 ) * CoreDb: Merged all sub-descriptors into `base_desc` module * Dissolve `aristo_db/common_desc.nim` * No need to export `Aristo` methods in `CoreDb` * Resolve/tighten methods in `aristo_db` sub-moduled why: So they can be straihgt implemented into the `base` module * Moved/re-implemented `KVT` methods into `base` module * Moved/re-implemented `MPT` methods into `base` module * Moved/re-implemented account methods into `base` module * Moved/re-implemented `CTX` methods into `base` module * Moved/re-implemented `handler_{aristo,kvt}` into `aristo_db` module * Moved/re-implemented `TX` methods into `base` module * Moved/re-implemented base methods into `base` module * Replaced `toAristoSavedStateBlockNumber()` by proper base method why: Was the last for keeping reason for keeping low level backend access methods * Remove dedicated low level access to `Aristo` backend why: Not needed anymore, for debugging the descriptors can be accessed directly also: some clean up stuff * Re-factor `CoreDb` descriptor layout and adjust base methods * Moved/re-implemented iterators into `base_iterator` modules Update docu	2024-07-03 15:50:27 +00:00
Jacek Sieka	1f60e8e453	Use `Hash256` directly for account path (#2439 ) Account paths are always a hash - passing it around as such helps avoid confusion as to how long it is	2024-07-03 10:14:26 +02:00
Jordan Hrycaj	2c87fd1636	Aristo code cosmetics and tests update (#2434 ) * Update some docu * Resolve obsolete compile time option why: Not optional anymore * Update checks why: The notion of what constitutes a valid `Aristo` db has changed due to (even more) lazy calculating Merkle hash keys. * Disable redundant unit test for production	2024-07-01 10:59:18 +00:00
andri lim	740882d8ce	Import forked_chain_test in all_tests (#2433 )	2024-07-01 09:57:42 +07:00
andri lim	401537ad38	Add ForkedChainRef tests (#2430 ) ForkedChainRef have become quite complex. test_blockchain_json is not sufficient cover for edge cases or synthetic cases.	2024-06-30 14:40:14 +07:00
andri lim	c24affadee	Use simpler schema when writing transactions, receipts, and withdrawals (#2420 ) * Use simpler schema when writing transactions, receipts, and withdrawals Using MPT not only slow but also take up more spaces than needed. Aristo will remove older tries and only keep the last block tries. Using simpler schema will avoid those problems. * Rename getTransaction to getTransactionByIndex	2024-06-29 12:43:17 +07:00
andri lim	b751d3adee	Combine smaller tests into bigger one (#2425 ) 1. test_state_db and test_ledger -> test_ledger. They are the same thing now. 2. stack, memory, code_stream, gas_meter, misc, overflow -> test_evm_support. They are small tests and fall into the same area.	2024-06-29 08:57:30 +07:00
Jordan Hrycaj	8dd038144b	Some cleanups (#2428 ) * Remove `dirty` set from structural objects why: Not used anymore, the tree is dirty by default. * Rename `aristo_hashify` -> `aristo_compute` * Remove cruft, update comments, cosmetics, etc. * Simplify `SavedState` object why: The key chaining have become obsolete after extra lazy hashing. There is some available space for a state hash to be maintained in future. details: Accept the legacy `SavedState` object serialisation format for a while (which will be overwritten by new format.)	2024-06-28 18:43:04 +00:00
Jordan Hrycaj	14c3772545	On demand mpt revisited (#2426 ) * rebased from `github/on-demand-mpt` ackn: wip: on-demand mpt construction Given that actual data is stored in the `Vertex` structure, it's useful to think of the MPT as a cache for computing roots rather than being a functional requirement on its own. This PR engenders this line of thinking by incrementally computing the MPT only when it's needed, ie when a state (or similar) root is needed. This has the effect of siginficantly reducing memory usage as well as improving performance: * no need for dirty-mpt-node book-keeping * no need to build complex forest of upcoming hashing work * only hashes that are functionally needed are ever computed - intermediate nodes whose MTP root is not observed are never computed / processed * Unit test hot fixes * Unit test hot fixes cont. (somehow lost that part) --------- Co-authored-by: Jacek Sieka <jacek@status.im>	2024-06-28 15:03:12 +00:00
andri lim	44deff9b28	Enable test_txpool by disabling failing cases (#2421 ) * Enable test_txpool by disabling failing cases Because we cannot use goerli replay to feed the txpool anymore, we use only a list of transactions. But some test cases still failing because it requires block state replay. * Fix tx info	2024-06-28 11:53:25 +07:00
Jordan Hrycaj	6dc2773957	Only use pre hashed addresses as account keys (#2424 ) * Normalised storage tree addressing in function prototypes detail: Argument list is always `<db> <account-path> <slot-path> ..` with both path arguments as `openArray[]` * Remove cruft * CoreDb internally Use full account paths rather than addresses * Update API logging * Use hashed account address only in prototypes why: This avoids unnecessary repeated hashing of the same account address. The burden of doing that is upon the application. In the case here, the ledger caches all kinds of stuff anyway so it is common sense to exploit that for account address hashes. caveat: Using `openArray[byte]` argument types for hashed accounts is inherently fragile. In non-release mode, a length verification `doAssert` is enabled by default. * No accPath in data record (use `AristoAccount` as `CoreDbAccount`) * Remove now unused `eAddr` field from ledger `AccountRef` type why: Is duplicate of lookup key * Avoid merging the account record/statement in the ledger twice.	2024-06-27 19:21:01 +00:00
Jordan Hrycaj	61bbf40014	Update storage tree admin (#2419 ) * Tighten `CoreDb` API for accounts why: Apart from cruft, the way to fetch the accounts state root via a `CoreDbColRef` record was unnecessarily complicated. * Extend `CoreDb` API for accounts to cover storage tries why: In future, this will make the notion of column objects obsolete. Storage trees will then be indexed by the account address rather than the vertex ID equivalent like a `CoreDbColRef`. * Apply new/extended accounts API to ledger and tests details: This makes the `distinct_ledger` module obsolete * Remove column object constructors why: They were needed as an abstraction of MPT sub-trees including storage trees. Now, storage trees are handled by the account (e.g. via address) they belong to and all other trees can be identified by a constant well known vertex ID. So there is no need for column objects anymore. Still there are some left-over column object methods wnich will be removed next. * Remove `serialise()` and `PayloadRef` from default Aristo API why: Not needed. `PayloadRef` was used for unstructured/unknown payload formats (account or blob) and `serialise()` was used for decodng `PayloadRef`. Now it is known in advance what the payload looks like. * Added query function `hasStorageData()` whether a storage area exists why: Useful for supporting `slotStateEmpty()` of the `CoreDb` API * In the `Ledger` replace `storage.stateEmpty()` by `slotStateEmpty()` * On Aristo, hide the storage root/vertex ID in the `PayloadRef` why: The storage vertex ID is fully controlled by Aristo while the `AristoAccount` object is controlled by the application. With the storage root part of the `AristoAccount` object, there was a useless administrative burden to keep that storage root field up to date. * Remove cruft, update comments etc. * Update changed MPT access paradigms why: Fixes verified proxy tests * Fluffy cosmetics	2024-06-27 09:01:26 +00:00
andri lim	b80521a84d	ForkedChain become ForkedChainRef (#2417 ) * ForkedChain become ForkedChainRef It will be shared between engine API, RPC, and txPool * Fix ForkedChainRef constructor	2024-06-27 12:54:52 +07:00
andri lim	cd21c4fbec	ForkedChain implementation (#2405 ) * ForkedChain implementation - revamp test_blockchain_json using ForkedChain - re-enable previously failing test cases. * Remove excess error handling * Avoid reloading parent header * Do not force base update * Write baggage to database * Add findActiveChain to finalizedSegment * Create new stagingTx in addBlock * Check last stateRoot existence in test_blockchain_json * Resolve rebase conflict * More precise nomenclature for block import cursor * Ensure bad block nor imported and good block not rejected * finalizeSegment become forkChoice and align with engine API forkChoice spec * Display reason when good block rejected * Fix comments * Put BaseDistance into CalculateNewBase equation * Separate finalizedHash from baseHash * Add more doAssert constraint * Add push raises: []	2024-06-26 07:27:48 +07:00
Jacek Sieka	768307d91d	Cache code and invalid jump destination tables (fixes #2268 ) (#2404 ) It is common for many accounts to share the same code - at the database level, code is stored by hash meaning only one copy exists per unique program but when loaded in memory, a copy is made for each account. Further, every time we execute the code, it must be scanned for invalid jump destinations which slows down EVM exeuction. Finally, the extcodesize call causes code to be loaded even if only the size is needed. This PR improves on all these points by introducing a shared CodeBytesRef type whose code section is immutable and that can be shared between accounts. Further, a dedicated `len` API call is added so that the EXTCODESIZE opcode can operate without polluting the GC and code cache, for cases where only the size is requested - rocksdb will in this case cache the code itself in the row cache meaning that lookup of the code itself remains fast when length is asked for first. With 16k code entries, there's a 90% hit rate which goes up to 99% during the 2.3M attack - the cache significantly lowers memory consumption and execution time not only during this event but across the board.	2024-06-21 09:44:10 +02:00
Jacek Sieka	41cf81f80b	Fix dboptions init (#2391 ) For the block cache to be shared between column families, the options instance must be shared between the various column families being created. This also ensures that there is only one source of truth for configuration options instead of having two different sets depending on how the tables were initialized. This PR also removes the re-opening mechanism which can double startup time - every time the database is opened, the log is replayed - a large log file will take a long time to open. Finally, several options got correclty implemented as column family options, including an one that puts a hash index in the SST files.	2024-06-19 10:55:57 +02:00
andri lim	83f6f89869	Add t8n debugging tool and fix EVM regression (#2386 ) - fix blockNumber overflow in blockHash op code - reenable 3 test cases of test_blockchain_json - fix t8n crash when creating invalid tracer stream	2024-06-19 08:58:08 +07:00
Jordan Hrycaj	8727307ef4	Aristo uses pre classified tree types cont1 (#2389 ) * Provide dedicated functions for deleteing accounts and storage trees why: Storage trees are always linked to an account, so there is no need for an application to fiddle about (e.g. re-cycling, unlinking) storage tree vertex IDs. * Remove `delete()` and other cruft from API, `aristo_delete`, etc. * clean up delete functions details: The delete implementations `deleteImpl()` and `delTreeImpl()` do not need to be super generic anymore as all the edge cases are covered by the specialised `deleteAccountPayload()`, `deleteGenericData()`, etc. * Avoid unnecessary re-calculations of account keys why: The function `registerAccountForUpdate()` did extract the storage ID (if any) and automatically marked the Merkle keys along the account path for re-hashing. This would also apply if there was later detected that the account or the storage tree did not need to be updated. So the `registerAccountForUpdate()` function was split into a part which retrieved the storage ID, and another one which marked the Merkle keys for re-calculation to be applied only when needed.	2024-06-18 19:30:01 +00:00
Jordan Hrycaj	51f02090b8	Aristo uses pre classified tree types (#2385 ) * Remove unused `merge()` functions (for production) details: Some functionality moved to test suite Make sure that only `AccountData` leaf type is exactly used on VertexID(1) * clean up payload type * Provide dedicated functions for merging accounts and storage trees why: Storage trees are always linked to an account, so there is no need for an application to fiddle about (e.e. creating, re-cycling) with storage tree vertex IDs. * CoreDb: Disable tracer functionality why: Must be updated to accommodate new/changed `Aristo` functions. * CoreDb: Use new `mergeXXX()` functions why: Makes explicit vertex ID management obsolete for creating new storage trees. * Remove `mergePayload()` and other cruft from API, `aristo_merge`, etc. * clean up merge functions details: The merge implementation `mergePayloadImpl()` does not need to be super generic anymore as all the edge cases are covered by the specialised functions `mergeAccountPayload()`, `mergeGenericData()`, and `mergeStorageData()`. * No tracer available at the moment, so disable offending tests	2024-06-18 11:14:02 +00:00
Jacek Sieka	8926da02b6	Fix lowest-hanging fruit in VM (#2382 ) * replace set with bitseq for code validity test * remove unusued code from CodeStream * avoid unnecessary byte-by-byte copies	2024-06-18 07:55:35 +07:00
andri lim	a6960c3d0a	Enable test_accounts_cache (#2373 ) The module name is a misnomer, because AccountsCache have been replaced by LedgerRef. But the test still applicable. Instead of replaying unsupported goerli blocks, we generate our own transactions and block.	2024-06-17 14:19:12 +02:00
andri lim	61a809cf4d	Remove EVM indirect imports and unused EVM errors (#2370 ) Those indirect imports are used when there was two EVMs.	2024-06-17 09:56:39 +02:00
tersec	e1bb65fdfa	rm PoW hash function and validation support (#2372 )	2024-06-16 10:22:06 +07:00
andri lim	69044dda60	Remove AccountStateDB (#2368 ) * Remove AccountStateDB AccountStateDB should no longer be used. It's usage have been reduce to read only operations. Replace it with LedgerRef to reduce maintenance burden. * remove extra spaces Co-authored-by: tersec <tersec@users.noreply.github.com> --------- Co-authored-by: tersec <tersec@users.noreply.github.com>	2024-06-16 10:21:02 +07:00
Jordan Hrycaj	debba5a620	Coeredb related clean up and maint fixes (#2360 ) * Fix initialiser why: Possible crash (app profiling, tracer etc.) * Update column family options processing why: Same for kvt as for aristo * Move `AristoDbDualRocks` backend type to the test suite why: So it is not available for production * Fix typos in API jump table why: Used for tracing and app profiling only. Needed some update * Purged CoreDb legacy API why: Not needed anymore, was transitionary and disabled. * Rename `flush` argument to `eradicate` in a DB close context why: The word `eradicate` leaves no doubt what is meant * Rename `stoFlush()` -> `stoDelete()` * Rename `core_apps_newapi` -> `core_apps` (not so new anymore)	2024-06-14 11:19:48 +00:00
andri lim	5a18537450	Bump nim-eth, nim-web3, nimbus-eth2 (#2344 ) * Bump nim-eth, nim-web3, nimbus-eth2 - Replace std.Option with results.Opt - Fields name changes * More fixes * Fix Portal stream async raises and portal testnet Opt usage * Bump eth + nimbus-eth2 + more fixes related to eth_types changes * Fix in utp test app and nimbus-eth2 bump * Fix test_blockchain_json rebase conflict * Fix EVMC block_timestamp conversion plus commentary --------- Co-authored-by: kdeme <kim.demey@gmail.com>	2024-06-14 14:31:08 +07:00
andri lim	329a8f05bb	Add Cancun timestamp to MainNet preset (#2342 ) * Add Cancun timestamp to MainNet preset * Fix forkid test: add Cancun forkid	2024-06-14 05:29:09 +00:00
Jacek Sieka	189a20bbae	Avoid recomputing hashes when persisting data (#2350 )	2024-06-14 07:10:00 +02:00
Jordan Hrycaj	5a5cc6295e	Triggered write event for kvt (#2351 ) * bump rockdb * Rename `KVT` objects related to filters according to `Aristo` naming details: filter* => delta* roFilter => balancer * Compulsory error handling if `persistent()` fails * Add return code to `reCentre()` why: Might eventually fail if re-centring is blocked. Some logic will be added in subsequent patch sets. * Add column families from earlier session to rocksdb in opening procedure why: All previously used CFs must be declared when re-opening an existing database. * Update `init()` and add rocksdb `reinit()` methods for changing parameters why: Opening a set column families (with different open options) must span at least the ones that are already on disk. * Provide write-trigger-event interface into `Aristo` backend why: This allows to save data from a guest application (think `KVT`) to get synced with the write cycle so the guest and `Aristo` save all atomically. * Use `KVT` with new column family interface from `Aristo` * Remove obsolete guest interface * Implement `KVT` piggyback on `Aristo` backend * CoreDb: Add separate `KVT`/`Aristo` backend mode for debugging * Remove `rocks_db` import from `persist()` function why: Some systems (i.p `fluffy` and friends) use the `Aristo` memory backend emulation and do not link against rocksdb when building the application. So this should fix that problem.	2024-06-13 18:15:11 +00:00
andri lim	689834517b	Enable test_blockchain_json by disable some problematic cases (#2346 )	2024-06-13 09:19:07 +00:00
Jacek Sieka	c48b527eea	simplify error handling in block processing (#2337 ) * ValidationResult -> Result * get rid of mixed exception / other styles	2024-06-11 17:50:22 +02:00
Jordan Hrycaj	a347291413	Aristo use rocksdb cf instead of key pfx (#2332 ) * Use RocksDb column families instead of a prefixed single column why: Better performance * Use structural objects `VertexRef` and `HashKey` in LRU cache for RocksDb why: Avoids repeated de/serialisation	2024-06-10 12:04:22 +00:00
Jacek Sieka	f6be4bd0ec	avoid initTable (#2328 ) `initTable` is obsolete since nim 0.19 and can introduce significant memory overhead while providing no benefit (since the table will be grown to the default initial size on first use anyway). In particular, aristo layers will not necessarily use all tables they initialize, for exampe when many empty accounts are being created.	2024-06-10 11:05:30 +02:00
Jacek Sieka	0b32078c4b	Consolidate block type for block processing (#2325 ) This PR consolidates the split header-body sequences into a single EthBlock sequence and cleans up the fallout from that which significantly reduces block processing overhead during import thanks to less garbage collection and fewer copies of things all around. Notably, since the number of headers must always match the number of bodies, we also get rid of a pointless degree of freedom that in the future could introduce unnecessary bugs. * only read header and body from era file * avoid several unnecessary copies along the block processing way * simplify signatures, cleaning up unused arguemnts and returns * use `stew/assign2` in a few strategic places where the generated nim assignent is slow and add a few `move` to work around poor analysis in nim 1.6 (will need to be revisited for 2.0) ``` stats-20240607_2223-a814aa0b.csv vs stats-20240608_0714-21c1d0a9.csv bps_x bps_y tps_x tps_y bpsd tpsd timed block_number (498305, 713245] 1,540.52 1,809.73 2,361.58 2775.340189 17.63% 17.63% -14.92% (713245, 928185] 730.36 865.26 1,715.90 2028.973852 18.01% 18.01% -15.21% (928185, 1143126] 663.03 789.10 2,529.26 3032.490771 19.79% 19.79% -16.28% (1143126, 1358066] 393.46 508.05 2,152.50 2777.578119 29.13% 29.13% -22.50% (1358066, 1573007] 370.88 440.72 2,351.31 2791.896052 18.81% 18.81% -15.80% (1573007, 1787947] 283.65 335.11 2,068.93 2441.373402 17.60% 17.60% -14.91% (1787947, 2002888] 287.29 342.11 2,078.39 2474.179448 18.99% 18.99% -15.91% (2002888, 2217828] 293.38 343.16 2,208.83 2584.77457 17.16% 17.16% -14.61% (2217828, 2432769] 140.09 167.86 1,081.87 1296.336926 18.82% 18.82% -15.80% blocks: 1934464, baseline: 3h13m1s, contender: 2h43m47s bpsd (mean): 19.55% tpsd (mean): 19.55% Time (total): -29m13s, -15.14% ```	2024-06-09 16:32:20 +02:00
andri lim	4497b5f4f1	Enable test_tracer_json (#2326 )	2024-06-08 11:36:51 +00:00
web3-developer	db8c5b90bd	Cleanup stateless and block witness code. (#2295 ) * Cleanup unneeded stateless and block witness code. Keeping MultiKeys which is used in the eth_getProofsByBlockNumber RPC endpoint which is needed for the Fluffy state network bridge. * Rename generateWitness flag to collectWitnessData to better describe what the flag does. We only collect the keys of the touched accounts and storage slots but no block witness generation is supported for now. * Move remaining stateless code into nimbus directory. * Add vmstate parameter to ChainRef to fix test. * Exclude *.in from check copyright year --------- Co-authored-by: jangko <jangko128@gmail.com>	2024-06-08 15:05:00 +07:00
tersec	5008b89185	EIP-2537 BLS12-381 G1 add/mul/exp and G2 add/mul support with tests (#2315 )	2024-06-08 07:39:53 +07:00
tersec	4e50b05564	re-disable test_tracer_json (#2320 )	2024-06-07 13:29:55 +00:00
Jordan Hrycaj	392088e5e9	Coredb fix storage tree issues (#2317 ) * Code cosmetics * Re-org `aristo_merge`, internally split into sub-modules why: Became a burden for maintenance because it hosts two different functionalities under the same merge paradigm: account/data merge and snap proof merge where the latter produces a partial trie. * Fix CoreDb tracer * Ledger: fix potential account vs. storage tree sync problems * Remove bound on the size of removable whole storage trees * Activate `test_tracer_json`	2024-06-07 10:56:31 +00:00
andri lim	b3a5c67532	Remove exceptions from EVM (#2314 ) * Remove exception from evm memory * Remove exception from gas meter * Remove exception from stack * Remove exception from precompiles * Remove exception from gas_costs * Remove exception from op handlers * Remove exception from op dispatcher * Remove exception from call_evm * Remove exception from EVM * Fix tools and tests * Remove exception from EVMC * fix evmc * Fix evmc * Remove remnants of async evm stuff * Remove superflous error handling * Proc to func * Fix errors detected by CI * Fix EVM op call stack usage * REmove exception handling from getVmState * Better error message instead of just doAssert * Remove unused validation * Remove superflous catchRaise * Use results.expect instead of unsafeValue	2024-06-07 15:24:32 +07:00
tersec	41ea2c0c0a	rm Goerli test replay (#2313 )	2024-06-07 09:01:59 +07:00
tersec	0f6c8f6e6a	enable JWT auth tests (#2312 )	2024-06-07 09:01:45 +07:00
tersec	d9375858fe	rm withdrawn EIP-2315 (#2309 ) * rm withdrawn EIP-2315 * copyright year linting * EVMC	2024-06-07 08:59:05 +07:00
Jordan Hrycaj	1e65093b3e	Remove obsolete tests (#2307 ) * Remove `test_sync_snap` why: Snap sync needs to be re-factored. All the interesting database parts from this test suite has been recycled into `Aristo` * Remove `test_rocksdb_timing` * Update `all_tests`	2024-06-06 09:29:38 +00:00
Jacek Sieka	32c7fe74be	Remove keyed_queue rlp support (#2300 ) It's unused and causing trouble because of unhandled exception effects - if we were to use it, it would need re-implementation such that it doesn't reallocate the whole queue on writing.	2024-06-06 00:01:18 +02:00
Jordan Hrycaj	e9eae4df70	Core db disable legacy api n remove distinct tries (#2299 ) * CoreDb: Remove crufty second/off-site KVT why: Was used to allow late `Clique` to store directly to disk * CoreDb: Remove prune flag related functionality why: Is completely legacy stuff * CoreDb: Remove dependence on legacy API (tests unsupported yet) why: Does not fully support Aristo * Re-factoring `state_db` using new API details: Only minimum changes needed to compile `nimbus` * Update tests and aux modules * Turn off legacy API and remove `distinct_tries` comment: The legacy API has now cruft status, will be removed soon * Fix copyright years * Update rpc for verified proxy --------- Co-authored-by: Jacek Sieka <jacek@status.im>	2024-06-05 20:52:04 +00:00
Jacek Sieka	c876729c4d	Add some basic rocksdb options to command line (#2286 ) These options are there mainly to drive experiments, and are therefore hidden. One thing that this PR brings in is an initial set of caches and buffers for rocksdb - the set that I've been using during various performance tests to get to a viable baseline performance level.	2024-06-05 17:08:29 +02:00
Jordan Hrycaj	69a158864c	Remove vid recycling feature (#2294 )	2024-06-04 15:05:13 +00:00

1 2 3 4 5 ...

977 Commits