nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Etan Kissling	aff53e962f	merge LC db into main BN db (#3832 ) * merge LC db into main BN db To treat derived LC data similar to derived state caches, merge it into the main beacon node DB. * shorten table names, group with lc prefix	2022-07-04 23:46:32 +03:00
Etan Kissling	499abd927f	persist LC data across restarts (#3823 ) * persist LC data across restarts With the Altair spec `LightClientUpdate` structure taking its final form it is finally possible to persist LC data across restarts without having to worry about data migration due to spec changes. A separate `lcdataV1` database is created in the `caches` subdirectory to hold known LC data. A full database with default settings (129 periods) uses <15 MB disk. * extend LC data DB rationale * wording * add `isSupportedBySQLite` helper and explicit return * remove redundant `return`	2022-06-30 13:04:39 +00:00
Jacek Sieka	c145916414	cleanups (#3819 ) * avoid circular panda imports * move deposit merkleization helpers to spec/ * normalize validator signature helpers to spec names / params * remove redundant functions for remote signing	2022-06-29 18:53:59 +02:00
Etan Kissling	bc1cc8f643	encapsulate LC config into one type (#3817 ) Separate LC initialization options from the main ChainDAGRef options to allow ChainDAGRef to treat them as opaque and reduce risk for conflicts when extending those options in the future.	2022-06-28 22:52:29 +02:00
Etan Kissling	e8e9ce1aab	introduce types for LC merkle proofs (#3808 ) Merkle proofs tend to have long underlying type definitions, e.g., `array[log2trunc(NEXT_SYNC_COMMITTEE_INDEX), Eth2Digest]`. For the ones used in the LC sync protocol, dedicated types are introduced to improve readability. Furthermore, the `CachedLightClientBootstrap` wrapper that solely wrapped a merkle branch is eliminated.	2022-06-28 07:52:23 +02:00
Etan Kissling	91d543440a	add option to configure max historic LC data periods (#3799 ) Adds a `--light-client-data-max-periods` option to override the number of sync committee periods to retain light client data. Raising it above the default enables archive nodes to serve full data. Lowering below the default speeds up import times (still no persistence)	2022-06-27 13:24:38 +02:00
Etan Kissling	aa1b8e4a17	bump nim-ssz-serialization to `3db6cc0f282708aca6c290914488edd832971d61` (#3119 ) This updates `nim-ssz-serialization` to `3db6cc0f282708aca6c290914488edd832971d61`. Notable changes: - Use `uint64` for `GeneralizedIndex` - Add support for building merkle multiproofs	2022-06-26 19:33:06 +02:00
Etan Kissling	2e98c7722f	encapsulate LC data variables into single structure (#3777 ) Combines the LC data configuration options (serve / importMode), the callbacks (finality / optimistic LC update) as well as the cache storing light client data, into a new `LightClientDataStore` structure. Also moves the structure into a light client specific file.	2022-06-24 16:57:50 +02:00
Etan Kissling	afcc5c2ea0	track LC data section that imported without errors (#3753 ) If database access errors are encountered while proccessing LC data, track the section which was accessed without errors so that the rest may be attempted to be re-indexed later.	2022-06-19 08:53:10 +03:00
Etan Kissling	ac7393b8ac	remove unused `withStateVars` template (#3738 ) Removes the `withStateVars` template that was not used meaningfully.	2022-06-16 11:46:35 +02:00
Etan Kissling	20e646a47f	avoid casting types in LC data code (#3743 ) Use `asSigned()` for type safety over `isomorphicCast` in LC data code.	2022-06-14 23:33:18 +02:00
Etan Kissling	81ff20b3f0	use block ID vs full block in LC data caching (#3741 ) `cacheLightClientData` does not need full block data, pass just ID.	2022-06-14 22:13:00 +02:00
Etan Kissling	0c00b85782	cleanup LC data helpers (#3746 ) Use more general `lowSlot` in LC data helpers, and avoid using `earliestSlot` variable name as that one has a different meaning.	2022-06-14 22:02:03 +02:00
Etan Kissling	cba041ddfa	fix LC data import for Altair fork period (#3744 ) The initial sync committee period follows a different finality rule than the other ones. Instead of next sync committee finalizing as soon as the `finalizedHead.slot >= period.start_slot` have to use Altair start slot.	2022-06-14 17:31:10 +02:00
Etan Kissling	52ba4f7999	rename light client config parameters (#3740 ) For consistency with other options, use a common prefix for light client data configuration options. * `--serve-light-client-data` --> `--light-client-data-serve` * `--import-light-client-data` --> `--light-client-data-import-mode` No deprecation of the old identifiers as they were only sparingly used and all usage can be easily updated without interferance.	2022-06-14 12:03:39 +03:00
Etan Kissling	e3f0d2ecbc	remove unused `getExistingForkedBlock` overload (#3742 ) Removes an unused overload of a local LC data function.	2022-06-14 08:19:11 +00:00
Etan Kissling	72a46bd520	integrate light client into beacon node (#3557 ) Adds a `LightClient` instance to the beacon node as preparation to accelerate syncing in the future (optimistic sync). - `--light-client-enable` turns on the feature - `--light-client-trusted-block-root` configures block to start from If no block root is configured, light client tracks DAG `finalizedHead`.	2022-06-07 19:01:11 +02:00
Etan Kissling	c808f17a37	update to latest light client libp2p protocol (#3623 ) Incorporates the latest changes to the light client sync protocol based on Devconnect AMS feedback. Note that this breaks compatibility with the previous prototype, due to changes to data structures and endpoints. See https://github.com/ethereum/consensus-specs/pull/2802	2022-05-23 14:02:54 +02:00
Jacek Sieka	f70ff38b53	enable `styleCheck:usages` (#3573 ) Some upstream repos still need fixes, but this gets us close enough that style hints can be enabled by default. In general, "canonical" spellings are preferred even if they violate nep-1 - this applies in particular to spec-related stuff like `genesis_validators_root` which appears throughout the codebase.	2022-04-08 16:22:49 +00:00
Etan Kissling	fd1ffd62dd	update light client server for DAG failure modes (#3514 ) Gracefully handles the new failure modes recently introduced to the DAG as part of https://github.com/status-im/nimbus-eth2/pull/3513 Data that is deemed to exist but fails to load leads to an error log to avoid suppressing logic errors accidentally. In `verifyFinalization` mode, the assertions remain active.	2022-03-20 11:58:59 +01:00
Etan Kissling	637f1e2be6	simplify `computeEarliestLightClientSlot` (#3524 ) Combine DAG and LC import tails in `computeEarliestLightClientSlot`.	2022-03-19 09:58:55 +01:00
Etan Kissling	18bd6df1b4	fix light client data collection for checkpoint sync (#3498 ) When doing checkpoint sync, collecting light client data of known blocks and states incorrectly assumes that `finalized_checkpoint` information is also known. Hardens collection to only collect finalized checkpoint data after `dag.computeEarliestLightClientSlot`.	2022-03-18 15:47:53 +01:00
Jacek Sieka	05ffe7b2bf	Prune `BlockRef` on finalization (#3513 ) Up til now, the block dag has been using `BlockRef`, a structure adapted for a full DAG, to represent all of chain history. This is a correct and simple design, but does not exploit the linearity of the chain once parts of it finalize. By pruning the in-memory `BlockRef` structure at finalization, we save, at the time of writing, a cool ~250mb (or 25%:ish) chunk of memory landing us at a steady state of ~750mb normal memory usage for a validating node. Above all though, we prevent memory usage from growing proportionally with the length of the chain, something that would not be sustainable over time - instead, the steady state memory usage is roughly determined by the validator set size which grows much more slowly. With these changes, the core should remain sustainable memory-wise post-merge all the way to withdrawals (when the validator set is expected to grow). In-memory indices are still used for the "hot" unfinalized portion of the chain - this ensure that consensus performance remains unchanged. What changes is that for historical access, we use a db-based linear slot index which is cache-and-disk-friendly, keeping the cost for accessing historical data at a similar level as before, achieving the savings at no percievable cost to functionality or performance. A nice collateral benefit is the almost-instant startup since we no longer load any large indicies at dag init. The cost of this functionality instead can be found in the complexity of having to deal with two ways of traversing the chain - by `BlockRef` and by slot. * use `BlockId` instead of `BlockRef` where finalized / historical data may be required * simplify clearance pre-advancement * remove dag.finalizedBlocks (~50:ish mb) * remove `getBlockAtSlot` - use `getBlockIdAtSlot` instead * `parent` and `atSlot` for `BlockId` now require a `ChainDAGRef` instance, unlike `BlockRef` traversal * prune `BlockRef` parents on finality (~200:ish mb) * speed up ChainDAG init by not loading finalized history index * mess up light client server error handling - this need revisiting :)	2022-03-17 17:42:56 +00:00
Jacek Sieka	c64bf045f3	remove StateData (#3507 ) One more step on the journey to reduce `BlockRef` usage across the codebase - this one gets rid of `StateData` whose job was to keep track of which block was last assigned to a state - these duties have now been taken over by `latest_block_root`, a fairly recent addition that computes this block root from state data (at a small cost that should be insignificant) 99% mechanical change.	2022-03-16 08:20:40 +01:00
Jacek Sieka	a3bd01b58d	move dependent root computations to `BeaconState` / `EpochRef` (#3478 ) * fewer deps on `BlockRef` traversal in anticipation of pruning * allows identifying EpochRef:s by their shuffling as a first step of * tighten error handling around missing blocks using the zero hash for signalling "missing block" is fragile and easy to miss - with checkpoint sync now, and pruning in the future, missing blocks become "normal".	2022-03-15 09:24:55 +01:00
Etan Kissling	ae408c279a	add option to collect light client data (#3474 ) Light clients require full nodes to serve additional data so that they can stay in sync with the network. This patch adds a new launch option `--import-light-client-data` to configure what data to make available. For now, data is only kept in memory; it is not persisted at this time. Note that data is only locally collected, a separate patch is needed to actually make it availble over the network. `--serve-light-client-data` will be used for serving data, but is not functional yet outside tests.	2022-03-11 21:28:10 +01:00

26 Commits