mirror of https://github.com/status-im/nimbus-eth1.git synced 2025-02-10 19:17:16 +00:00

History

* Imported/rebase from `no-ext`, PR #2485

  Store extension nodes together with the branch

  Extension nodes must be followed by a branch - as such, it makes sense
  to store the two together both in the database and in memory:

  * fewer reads, writes and updates to traverse the tree
  * simpler logic for maintaining the node structure
  * less space used, both memory and storage, because there are fewer
    nodes overall

  There is also a downside: hashes can no longer be cached for an
  extension - instead, only the extension+branch hash can be cached - this
  seems like a fine tradeoff since computing it should be fast.

  TODO: fix commented code

* Fix merge functions and `toNode()`

* Update `merkleSignCommit()` prototype

why:
  Result is always a 32bit hash

* Update short Merkle hash key generation

details:
  Ethereum reference MPTs use Keccak hashes as node links if the size of
  an RLP encoded node is at least 32 bytes. Otherwise, the RLP encoded
  node value is used as a pseudo node link (rather than a hash.) This is
  specified in the yellow paper, appendix D.

  Different to the `Aristo` implementation, the reference MPT would not
  store such a node on the key-value database. Rather the RLP encoded node value is stored instead of a node link in a parent node
  is stored as a node link on the parent database.

  Only for the root hash, the top level node is always referred to by the
  hash.

* Fix/update `Extension` sections

why:
  Were commented out after removal of a dedicated `Extension` type which
  left the system disfunctional.

* Clean up unused error codes

* Update unit tests

* Update docu

---------

Co-authored-by: Jacek Sieka <jacek@status.im>

2024-07-16 19:47:59 +00:00

aristo

No ext update (#2494 )

2024-07-16 19:47:59 +00:00

core_db

No ext update (#2494 )

2024-07-16 19:47:59 +00:00

era1_db

Consolidate block type for block processing (#2325 )

2024-06-09 16:32:20 +02:00

kvt

Separate config for core db and ledger (#2479 )

2024-07-12 13:12:25 +00:00

ledger

avoid re-writing code (#2490 )

2024-07-15 15:02:23 +02:00

.gitignore

Database architecture diagram & module overview (#2065 )

2024-03-08 18:42:46 +00:00

access_list.nim

avoid initTable (#2328 )

2024-06-10 11:05:30 +02:00

aristo.nim

Core db reorg (#2444 )

2024-07-03 15:50:27 +00:00

core_db.nim

Cleanup unused raises in evm/state and other obsolete informations (#2243 )

2024-05-30 09:03:54 +00:00

era1_db.nim

era: simplify, instant startup (#2218 )

2024-05-26 08:24:13 +02:00

kvstore_rocksdb.nim

Bump RocksDb version and enable autoClose on opt types to prevent memory leaks (#2427 )

2024-07-02 13:44:09 +08:00

kvt.nim

Core db reorg (#2444 )

2024-07-03 15:50:27 +00:00

ledger.nim

Separate constructor helpers for core db and ledger (#2480 )

2024-07-12 19:32:31 +00:00

opts.nim

Fix memory usage spikes during sync, give memory to rocksdb (#2413 )

2024-06-25 13:39:53 +02:00

README.md

Aristo resume off line syncing on pre loaded database (#2203 )

2024-05-22 13:41:14 +00:00

storage_types.nim

don't rewrite hash->slot map (#2463 )

2024-07-09 17:25:43 +02:00

transient_storage.nim

avoid initTable (#2328 )

2024-06-10 11:05:30 +02:00

README.md

Nimbus-eth1 -- Ethereum execution layer database architecture

Last update: 2024-03-08

The following diagram gives a simplified view how components relate with regards to the data storage management.

An arrow between components a and b (as in a->b) is meant to be read as a relies directly on b, or a is served by b. For classifying the functional type of a component in the below diagram, the abstraction type is enclosed in brackets after the name of a component.

(application)
This is a group of software modules at the top level of the hierarchy. In the diagram below, the EVM is used as an example. Another application might be the RPC service.
(API)
The API classification is used for a thin software layer hiding a set of different drivers where only one driver is active for the same API instance. It servers as sort of a logical switch.
(concentrator)
The concentrator merges several sub-module instances and provides their collected services as a single unified instance. There is not much additional logic implemented besides what the sub-modules provide.
(driver)
The driver instances are sort of the lower layer workhorses. The implement logic for solving a particular problem, providing a typically well defined service, etc.

(engine)
This is a bottom level driver in the below diagram.

                       +-------------------+
                       | EVM (application) |
                       +-------------------+
                               |     |
                               v     |
   +-----------------------------+   |
   |   State DB (concentrator)   |   |
   +-----------------------------+   |
       |                       |     |
       v                       |     |
   +------------------------+  |     |
   |      Ledger (API)      |  |     |
   +------------------------+  |     |
       |              |        |     |
       v              |        |     |
   +--------------+   |        |     |
   | ledger cache |   |        |     |
   |   (driver)   |   |        |     |
   +--------------+   |        |     |
       |              v        |     |
       |   +----------------+  |     |
       |   |   Common       |  |     |
       |   | (concentrator) |  |     |
       |   +----------------+  |     |
       |             |         |     |
       v             v         v     v
   +---------------------------------------+
   |               Core DB (API)           |
   +---------------------------------------+
                     |
                     v
   +---------------------------------------+
   |    Aristo DB (driver,concentrator)    |
   +---------------------------------------+
             |             |
             v             v
   +--------------+  +---------------------+
   | Kvt (driver) |  | Aristo MPT (driver) |
   +--------------+  +---------------------+
             |             |
             v             v
   +---------------------------------------+
   |         Rocks DB (engine)             |
   +---------------------------------------+

Here is a list of path references for the components with some explanation. The sources for the components are not always complete but indicate the main locations where to start looking at.

Aristo DB (driver)
- Sources:
  ./nimbus/db/core_db/backend/aristo_*
- Synopsis:
  Combines both, the Kvt and the Aristo driver sub-modules providing an interface similar to the legacy DB (concentrator) module.
Aristo MPT (driver)
- Sources:
  ./nimbus/db/aristo*
- Synopsis:
  Revamped implementation of a hexary Merkle Patricia Tree.
Common (concentrator)
- Sources:
  ./nimbus/common*
- Synopsis:
  Collected information for running block chain execution layer applications.
Core DB (API)
- Sources:
  ./nimbus/db/core_db*
- Synopsis:
  Database abstraction layer. Unless for legacy applications, there should be no need to reach out to the layers below.
EVM (application)
- Sources:
  ./nimbus/core/executor/* ./nimbus/evm/*
- Synopsis:
  An implementation of the Ethereum Virtual Machine.
Hexary DB (driver)
- Sources:
  ./vendor/nim-eth/eth/trie/hexary.nim
- Synopsis:
  Implementation of an MPT, see compact Merkle Patricia Tree.
Key-value table (driver)
- Sources:
  ./vendor/nim-eth/eth/trie/db.nim
- Synopsis:
  Key value table interface to be used directly for key-value storage or by the Hexary DB (driver) module for storage. Some magic is applied in order to treat hexary data accordingly (based on key length.)
Kvt (driver)
- Sources:
  ./nimbus/db/kvt*
- Synopsis:
  Key value table interface for the Aristo DB (driver) module. Contrary to the Key-value table (driver), it is not used for MPT data.
Ledger (API)
- Sources:
  ./nimbus/db/ledger*
- Synopsis:
  Abstraction layer for either the legacy cache (driver) accounts cache (which works with the legacy DB (driver) backend only) or the ledger cache (driver) re-write which is supposed to work with all Core DB (API) backends.
ledger cache (driver)
- Sources:
  ./nimbus/db/ledger/accounts_ledger.nim
  ./nimbus/db/ledger/backend/accounts_ledger*
  ./nimbus/db/ledger/distinct_ledgers.nim
- Synopsis:
  Management of accounts and storage data. This is a re-write of the legacy DB (driver) which is supposed to work with all Core DB (API) backends.
legacy DB (concentrator)
- Sources:
  ./nimbus/db/core_db/backend/legacy_*
- Synopsis:
  Legacy database abstraction. It mostly forwards requests directly to the to the Key-value table (driver) and/or the hexary DB (driver).
Rocks DB (engine)
- Sources:
  ./vendor/nim-rocksdb/*
- Synopsis:
  Persistent storage engine.
State DB (concentrator)
- Sources:
  ./nimbus/evm/state.nim
  ./nimbus/evm/types.nim
- Synopsis:
  Integrated collection of modules and methods relevant for the EVM.