nimbus-eth1

Commit Graph

Author	SHA1	Message	Date
Jordan Hrycaj	14c3772545	On demand mpt revisited (#2426 ) * rebased from `github/on-demand-mpt` ackn: wip: on-demand mpt construction Given that actual data is stored in the `Vertex` structure, it's useful to think of the MPT as a cache for computing roots rather than being a functional requirement on its own. This PR engenders this line of thinking by incrementally computing the MPT only when it's needed, ie when a state (or similar) root is needed. This has the effect of siginficantly reducing memory usage as well as improving performance: * no need for dirty-mpt-node book-keeping * no need to build complex forest of upcoming hashing work * only hashes that are functionally needed are ever computed - intermediate nodes whose MTP root is not observed are never computed / processed * Unit test hot fixes * Unit test hot fixes cont. (somehow lost that part) --------- Co-authored-by: Jacek Sieka <jacek@status.im>	2024-06-28 15:03:12 +00:00
Jordan Hrycaj	6dc2773957	Only use pre hashed addresses as account keys (#2424 ) * Normalised storage tree addressing in function prototypes detail: Argument list is always `<db> <account-path> <slot-path> ..` with both path arguments as `openArray[]` * Remove cruft * CoreDb internally Use full account paths rather than addresses * Update API logging * Use hashed account address only in prototypes why: This avoids unnecessary repeated hashing of the same account address. The burden of doing that is upon the application. In the case here, the ledger caches all kinds of stuff anyway so it is common sense to exploit that for account address hashes. caveat: Using `openArray[byte]` argument types for hashed accounts is inherently fragile. In non-release mode, a length verification `doAssert` is enabled by default. * No accPath in data record (use `AristoAccount` as `CoreDbAccount`) * Remove now unused `eAddr` field from ledger `AccountRef` type why: Is duplicate of lookup key * Avoid merging the account record/statement in the ledger twice.	2024-06-27 19:21:01 +00:00
Jordan Hrycaj	61bbf40014	Update storage tree admin (#2419 ) * Tighten `CoreDb` API for accounts why: Apart from cruft, the way to fetch the accounts state root via a `CoreDbColRef` record was unnecessarily complicated. * Extend `CoreDb` API for accounts to cover storage tries why: In future, this will make the notion of column objects obsolete. Storage trees will then be indexed by the account address rather than the vertex ID equivalent like a `CoreDbColRef`. * Apply new/extended accounts API to ledger and tests details: This makes the `distinct_ledger` module obsolete * Remove column object constructors why: They were needed as an abstraction of MPT sub-trees including storage trees. Now, storage trees are handled by the account (e.g. via address) they belong to and all other trees can be identified by a constant well known vertex ID. So there is no need for column objects anymore. Still there are some left-over column object methods wnich will be removed next. * Remove `serialise()` and `PayloadRef` from default Aristo API why: Not needed. `PayloadRef` was used for unstructured/unknown payload formats (account or blob) and `serialise()` was used for decodng `PayloadRef`. Now it is known in advance what the payload looks like. * Added query function `hasStorageData()` whether a storage area exists why: Useful for supporting `slotStateEmpty()` of the `CoreDb` API * In the `Ledger` replace `storage.stateEmpty()` by `slotStateEmpty()` * On Aristo, hide the storage root/vertex ID in the `PayloadRef` why: The storage vertex ID is fully controlled by Aristo while the `AristoAccount` object is controlled by the application. With the storage root part of the `AristoAccount` object, there was a useless administrative burden to keep that storage root field up to date. * Remove cruft, update comments etc. * Update changed MPT access paradigms why: Fixes verified proxy tests * Fluffy cosmetics	2024-06-27 09:01:26 +00:00
web3-developer	ea94e8a351	Use RocksDb column family handles instead of name strings. (#2418 ) * Bump RocksDb to latest and update Nimbus database to pass column family handles to RocksDb API. * Bump RocksDb version.	2024-06-27 16:51:43 +08:00
andri lim	b80521a84d	ForkedChain become ForkedChainRef (#2417 ) * ForkedChain become ForkedChainRef It will be shared between engine API, RPC, and txPool * Fix ForkedChainRef constructor	2024-06-27 12:54:52 +07:00
andri lim	27339e9520	Simplify txpool baseFeeGet (#2416 ) * Simplify txpool baseFeeGet - Avoid using toEVMFork because we are not in EVM - Rename `isLondon` to `isLondonOrLater` * Remove timestamp from isLondonOrLater	2024-06-27 12:54:36 +07:00
Jacek Sieka	c8cdffa775	Small cleanups (#2414 ) * remove unnecessary / expensive error checking * avoid some trivial memory allocs * work around table move bug	2024-06-26 09:25:09 +02:00
andri lim	cd21c4fbec	ForkedChain implementation (#2405 ) * ForkedChain implementation - revamp test_blockchain_json using ForkedChain - re-enable previously failing test cases. * Remove excess error handling * Avoid reloading parent header * Do not force base update * Write baggage to database * Add findActiveChain to finalizedSegment * Create new stagingTx in addBlock * Check last stateRoot existence in test_blockchain_json * Resolve rebase conflict * More precise nomenclature for block import cursor * Ensure bad block nor imported and good block not rejected * finalizeSegment become forkChoice and align with engine API forkChoice spec * Display reason when good block rejected * Fix comments * Put BaseDistance into CalculateNewBase equation * Separate finalizedHash from baseHash * Add more doAssert constraint * Add push raises: []	2024-06-26 07:27:48 +07:00
Jacek Sieka	3e001e322c	Fix memory usage spikes during sync, give memory to rocksdb (#2413 ) * creating a seq from a table that holds lots of changes means copying all data into the table - this can be several GB of data while syncing blocks * nim fails to optimize the moving of the `WidthFirstForest` - the real solution is to not construct a `wff` to begin with, but this PR provides relief while that is being worked on This spike fix allows us to bump the rocksdb cache by another 2 GB and still have a significantly lower peak memory usage during sync.	2024-06-25 13:39:53 +02:00
Jacek Sieka	f294d1e086	Clear account cache after each block (#2411 ) When processing long ranges of blocks, the account cache grows unbounded which cause huge memory spikes. Here, we move the cache to a second-level cache after each block - the second-level cache is cleared on the next block after that which creates a simple LRU effect. There's a small performance cost of course, though overall the freed-up memory can now be reassigned to the rocksdb row cache which not only makes up for the loss but overall leads to a performance increase. The bump to 2gb of rocksdb row cache here needs more testing but is slightly less and loosely basedy on the savings from this PR and the circular ref fix in #2408 - another way to phrase this is that it's better to give rocksdb more breathing room than let the memory sit unused until circular ref collection happens ;)	2024-06-25 07:30:32 +02:00
andri lim	c79b0b8a47	Avoid loading parent header from db in gaslimit validation (#2410 )	2024-06-24 08:40:22 +02:00
andri lim	6a10dfd0fe	Remove pre and post opcode handlers from EVM (#2409 )	2024-06-24 07:58:15 +02:00
Jacek Sieka	9521582005	avoid closure environment for mpt methods (#2408 ) An instance of `CoreDbMptRef` is created for and stored in every account - when we are processing blocks and have many accounts in memory, this closure environment takes up hundreds of mb of memory (around block 5M, it is the 4:th largest memory consumer!) - incidentally, this also removes a circular reference in the setup that causes the `AristoCodeDbMptRef` to linger in memory much longer than it has to which is the core reason why it takes so much. The real solution here is to remove the methods indirection entirely, but this PR provides relief until that has been done. Similar treatment is given to some of the other core api functions to avoid circulars there too.	2024-06-24 07:56:41 +02:00
andri lim	99ff8dc876	Fix t8n: blobGasUsed exceeds allowance issue (#2407 ) * Fix t8n: blobGasUsed exceeds allowance issue * Put blobGasUsed validation into transaction precessing pipeline	2024-06-24 07:56:24 +02:00
Jacek Sieka	6b68ff92d3	Allocation-free nibbles buffer (#2406 ) This buffer eleminates a large part of allocations during MPT traversal, reducing overall memory usage and GC pressure. Ideally, we would use it throughout in the API instead of `openArray[byte]` since the built-in length limit appropriately exposes the natural 64-nibble depth constraint that `openArray` fails to capture.	2024-06-22 22:33:37 +02:00
Jacek Sieka	768307d91d	Cache code and invalid jump destination tables (fixes #2268 ) (#2404 ) It is common for many accounts to share the same code - at the database level, code is stored by hash meaning only one copy exists per unique program but when loaded in memory, a copy is made for each account. Further, every time we execute the code, it must be scanned for invalid jump destinations which slows down EVM exeuction. Finally, the extcodesize call causes code to be loaded even if only the size is needed. This PR improves on all these points by introducing a shared CodeBytesRef type whose code section is immutable and that can be shared between accounts. Further, a dedicated `len` API call is added so that the EXTCODESIZE opcode can operate without polluting the GC and code cache, for cases where only the size is requested - rocksdb will in this case cache the code itself in the row cache meaning that lookup of the code itself remains fast when length is asked for first. With 16k code entries, there's a 90% hit rate which goes up to 99% during the 2.3M attack - the cache significantly lowers memory consumption and execution time not only during this event but across the board.	2024-06-21 09:44:10 +02:00
Jacek Sieka	83b3eeeb18	metrics: enable during import (#2401 ) This allows monitoring the import process using prometheus/grafana/etc	2024-06-20 19:06:58 +02:00
Jordan Hrycaj	081cb15493	Coredb maintenance (#2398 ) * CoreDb: remove PHK tries why: There is no general use anymore for an MPT with a pre-hashed key. It was used to resemble the `SecureHexaryTrie` logic from the legacy DB. The only pace where this is needed is the `Leger` which uses a a distinct MPT version anyway (see `distinct_ledgers.nim`.) * Rename `CoreDx` -> `CoreDb` why: The naming `CoreDx` was used to differentiate the new CoreDb API from the legacy API which had descriptors named `CoreDb`.	2024-06-19 14:13:12 +00:00
Jordan Hrycaj	e7be0d185c	Aristo uses pre classified tree types cont2 (#2397 ) * Provide dedicated functions for fetching accounts and storage trees why: Different prototypes for each class `account`, `generic` and `storage`. * Remove `fetchPayload()` and other cruft from API, `aristo_fetch`, etc. * Fix typos, debugging left overs, comments	2024-06-19 12:40:00 +00:00
andri lim	035ef696a6	EVMC refundGas not breaching host/evm separation anymore (#2395 )	2024-06-19 14:15:23 +02:00
andri lim	0e5fd3ffc9	LedgerRef: stateOrVoid become stateEmptyOrVoid (#2394 )	2024-06-19 14:14:36 +02:00
andri lim	5a39fc0d69	Remove unused dbkey (#2396 )	2024-06-19 14:11:14 +02:00
Jacek Sieka	41cf81f80b	Fix dboptions init (#2391 ) For the block cache to be shared between column families, the options instance must be shared between the various column families being created. This also ensures that there is only one source of truth for configuration options instead of having two different sets depending on how the tables were initialized. This PR also removes the re-opening mechanism which can double startup time - every time the database is opened, the log is replayed - a large log file will take a long time to open. Finally, several options got correclty implemented as column family options, including an one that puts a hash index in the SST files.	2024-06-19 10:55:57 +02:00
andri lim	83f6f89869	Add t8n debugging tool and fix EVM regression (#2386 ) - fix blockNumber overflow in blockHash op code - reenable 3 test cases of test_blockchain_json - fix t8n crash when creating invalid tracer stream	2024-06-19 08:58:08 +07:00
Kim De Mey	4fd2ecddec	Bump nim-eth/web3/kzg4844/nimbus-eth2 and related fixes (#2392 ) Bump nim-eth, which requires nimbus-eth2 bump, which requires bumps of web3 and kzg4844 + related fixes to all those bumps.	2024-06-19 08:57:45 +07:00
Jacek Sieka	1a96b4a97c	evm: generate more specialized functions (#2390 ) Nicer name in profiler and avoids a few range checks	2024-06-19 08:57:29 +07:00
Miran	ea0d18424a	use Nim 2.0.6 (#2384 ) * use Nim 2.0.6 * Fixes for nim 2.0.6 * Workaround nim 2.0 array indexing issue * Remove excess gcsafe pragma * Oops, fix recursive template * Fix imports * Fluffy nph linting --------- Co-authored-by: jangko <jangko128@gmail.com> Co-authored-by: tersec <tersec@users.noreply.github.com>	2024-06-19 01:27:54 +00:00
Jordan Hrycaj	8727307ef4	Aristo uses pre classified tree types cont1 (#2389 ) * Provide dedicated functions for deleteing accounts and storage trees why: Storage trees are always linked to an account, so there is no need for an application to fiddle about (e.g. re-cycling, unlinking) storage tree vertex IDs. * Remove `delete()` and other cruft from API, `aristo_delete`, etc. * clean up delete functions details: The delete implementations `deleteImpl()` and `delTreeImpl()` do not need to be super generic anymore as all the edge cases are covered by the specialised `deleteAccountPayload()`, `deleteGenericData()`, etc. * Avoid unnecessary re-calculations of account keys why: The function `registerAccountForUpdate()` did extract the storage ID (if any) and automatically marked the Merkle keys along the account path for re-hashing. This would also apply if there was later detected that the account or the storage tree did not need to be updated. So the `registerAccountForUpdate()` function was split into a part which retrieved the storage ID, and another one which marked the Merkle keys for re-calculation to be applied only when needed.	2024-06-18 19:30:01 +00:00
Jordan Hrycaj	51f02090b8	Aristo uses pre classified tree types (#2385 ) * Remove unused `merge()` functions (for production) details: Some functionality moved to test suite Make sure that only `AccountData` leaf type is exactly used on VertexID(1) * clean up payload type * Provide dedicated functions for merging accounts and storage trees why: Storage trees are always linked to an account, so there is no need for an application to fiddle about (e.e. creating, re-cycling) with storage tree vertex IDs. * CoreDb: Disable tracer functionality why: Must be updated to accommodate new/changed `Aristo` functions. * CoreDb: Use new `mergeXXX()` functions why: Makes explicit vertex ID management obsolete for creating new storage trees. * Remove `mergePayload()` and other cruft from API, `aristo_merge`, etc. * clean up merge functions details: The merge implementation `mergePayloadImpl()` does not need to be super generic anymore as all the edge cases are covered by the specialised functions `mergeAccountPayload()`, `mergeGenericData()`, and `mergeStorageData()`. * No tracer available at the moment, so disable offending tests	2024-06-18 11:14:02 +00:00
Jacek Sieka	8926da02b6	Fix lowest-hanging fruit in VM (#2382 ) * replace set with bitseq for code validity test * remove unusued code from CodeStream * avoid unnecessary byte-by-byte copies	2024-06-18 07:55:35 +07:00
Jacek Sieka	135ef222a2	avoid intermediate const in opcodes (#2381 ) The extra layer of `const` makes the function name harder to see a debugger / profiler	2024-06-17 18:13:38 +02:00
Jacek Sieka	9c6fd46a51	avoid computing state root just to know if storage is empty (#2380 ) The state root computation here is one of the major hotspots in block processing - in the cases the code only needs to know if it's empty or not, it can be done a lot faster. Adding a separate function for this looks fragile and should probably be revisited.	2024-06-17 15:29:07 +02:00
Jacek Sieka	9cf7e6aea3	Avoid creating database transaction for every block (#2379 ) Broadly, when importing blocks we don't need a transaction / frame per block because we can simply abort the whole update and try again with a smaller range if we find a faulty block. Of course, this applies mainly to semi-trusted blocks where we're not expected to fail in applying them - this could be blocks either from files or header-verified blocks as given by consensus.	2024-06-17 15:28:44 +02:00
Jacek Sieka	1fb658ff03	Remove hashify calls when forking (#2377 ) This appears to no longer be needed and we want to delay hashing as much as possible.	2024-06-17 14:18:50 +02:00
andri lim	61a809cf4d	Remove EVM indirect imports and unused EVM errors (#2370 ) Those indirect imports are used when there was two EVMs.	2024-06-17 09:56:39 +02:00
tersec	e1bb65fdfa	rm PoW hash function and validation support (#2372 )	2024-06-16 10:22:06 +07:00
andri lim	69044dda60	Remove AccountStateDB (#2368 ) * Remove AccountStateDB AccountStateDB should no longer be used. It's usage have been reduce to read only operations. Replace it with LedgerRef to reduce maintenance burden. * remove extra spaces Co-authored-by: tersec <tersec@users.noreply.github.com> --------- Co-authored-by: tersec <tersec@users.noreply.github.com>	2024-06-16 10:21:02 +07:00
Jacek Sieka	af34f90fe4	fix `max_total_wal_size` which should be set on the DB (#2363 )	2024-06-16 02:11:30 +00:00
andri lim	c5508b8dac	Bump nim-blscurve for gcc-14 compatibility (#2365 ) * Bump nim-blscurve for gcc-14 compatibility * Fix evm/blscurve.nim pointer works when using blscurve_abi	2024-06-15 17:34:07 +00:00
andri lim	27d710294b	Vm2Ctx -> VmCtx, Vm2Op -> VmOp (#2369 ) Legacy evm have been removed, no longer to keep the Vm2 prefix.	2024-06-15 23:18:53 +07:00
Jacek Sieka	242bbf03fc	Light verification and storage mode for import (#2367 ) When performing block import, we can batch state root verifications and header checks, doing them only once per chunk of blocks, assuming that the other blocks in the batch are valid by extension. When we're not generating receipts, we can also skip per-transaction state root computation pre-byzantium, which is what provides a ~20% speedup in this PR, at least on those early blocks :) We also stop storing transactions, receipts and uncles redundantly when importing from era1 - there is no need to waste database storage on this when we can load it from the era1 file (eventually).	2024-06-15 11:22:37 +02:00
andri lim	4c458190e9	Bump nim-kzg4844 and nimbus-eth2 for gcc-14 compatibility (#2357 )	2024-06-14 21:41:59 +07:00
Jacek Sieka	68f462e3e4	avoid state root lookup when computing linear history (#2362 ) State lookups potentially trigger expensive re-hashings - this is the first of several steps to remove the unnecessary ones from the general flow of block processing * avoid re-reading parent block header from database when it's already in memory	2024-06-14 15:56:56 +02:00
Jordan Hrycaj	debba5a620	Coeredb related clean up and maint fixes (#2360 ) * Fix initialiser why: Possible crash (app profiling, tracer etc.) * Update column family options processing why: Same for kvt as for aristo * Move `AristoDbDualRocks` backend type to the test suite why: So it is not available for production * Fix typos in API jump table why: Used for tracing and app profiling only. Needed some update * Purged CoreDb legacy API why: Not needed anymore, was transitionary and disabled. * Rename `flush` argument to `eradicate` in a DB close context why: The word `eradicate` leaves no doubt what is meant * Rename `stoFlush()` -> `stoDelete()` * Rename `core_apps_newapi` -> `core_apps` (not so new anymore)	2024-06-14 11:19:48 +00:00
andri lim	5784945650	Fix beacon_sync mapBodiesToHeader bug (#2355 )	2024-06-14 17:41:58 +07:00
Jacek Sieka	0f53d11b52	Add startup banner (#2354 ) Same as all other nimbuses	2024-06-14 10:08:44 +02:00
andri lim	5a18537450	Bump nim-eth, nim-web3, nimbus-eth2 (#2344 ) * Bump nim-eth, nim-web3, nimbus-eth2 - Replace std.Option with results.Opt - Fields name changes * More fixes * Fix Portal stream async raises and portal testnet Opt usage * Bump eth + nimbus-eth2 + more fixes related to eth_types changes * Fix in utp test app and nimbus-eth2 bump * Fix test_blockchain_json rebase conflict * Fix EVMC block_timestamp conversion plus commentary --------- Co-authored-by: kdeme <kim.demey@gmail.com>	2024-06-14 14:31:08 +07:00
andri lim	329a8f05bb	Add Cancun timestamp to MainNet preset (#2342 ) * Add Cancun timestamp to MainNet preset * Fix forkid test: add Cancun forkid	2024-06-14 05:29:09 +00:00
Jacek Sieka	189a20bbae	Avoid recomputing hashes when persisting data (#2350 )	2024-06-14 07:10:00 +02:00
Jordan Hrycaj	5a5cc6295e	Triggered write event for kvt (#2351 ) * bump rockdb * Rename `KVT` objects related to filters according to `Aristo` naming details: filter* => delta* roFilter => balancer * Compulsory error handling if `persistent()` fails * Add return code to `reCentre()` why: Might eventually fail if re-centring is blocked. Some logic will be added in subsequent patch sets. * Add column families from earlier session to rocksdb in opening procedure why: All previously used CFs must be declared when re-opening an existing database. * Update `init()` and add rocksdb `reinit()` methods for changing parameters why: Opening a set column families (with different open options) must span at least the ones that are already on disk. * Provide write-trigger-event interface into `Aristo` backend why: This allows to save data from a guest application (think `KVT`) to get synced with the write cycle so the guest and `Aristo` save all atomically. * Use `KVT` with new column family interface from `Aristo` * Remove obsolete guest interface * Implement `KVT` piggyback on `Aristo` backend * CoreDb: Add separate `KVT`/`Aristo` backend mode for debugging * Remove `rocks_db` import from `persist()` function why: Some systems (i.p `fluffy` and friends) use the `Aristo` memory backend emulation and do not link against rocksdb when building the application. So this should fix that problem.	2024-06-13 18:15:11 +00:00

1 2 3 4 5 ...

1793 Commits