nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
tersec	1819d79e07	avoid potential database inconsistency after fcU `INVALID`+crash (#4192 ) * avoid database race-condition inconsistency after fcU `INVALID` then crash * ensure head doesn't fall behind finalized; add more tests for head movement/reloading DAG	2022-09-28 21:07:31 +00:00
Jacek Sieka	b1bc830a92	Harden EpochRef loading against bogus block root at tail (#4178 ) * add more error information when things go wrong with database * lower log level when reloading attestations from no-block epoch start slot	2022-09-27 18:56:08 +02:00
tersec	0f6d19b4b3	implement v1.2.0 optimistic sync tests (#4174 ) * implement v1.2.0 optimistic sync tests * Update beacon_chain/consensus_object_pools/blockchain_dag.nim Co-authored-by: Etan Kissling <etan@status.im> * `lvh` -> `latestValidHash` and only invalidate one specific block" * `getEarliestInvalidRoot` -> `getEarliestInvalidBlockRoot`; `defaultEarliestInvalidRoot` -> `defaultEarliestInvalidBlockRoot` Co-authored-by: Etan Kissling <etan@status.im>	2022-09-27 15:11:47 +03:00
tersec	9750cd3a38	update state diffs to Bellatrix (#4177 )	2022-09-26 19:13:50 +00:00
tersec	3c03ba86c1	update consensus spec ref URLs to v1.2.0 (#4164 )	2022-09-23 07:56:06 +00:00
Etan Kissling	6069003a1f	fix check for attaching to pre-finalized parent (#4161 ) When the BN's head is reorged while shut down, reloading the BN will not assign `BlockRef` to alternate branches. However, blocks from other branches are still present in the database, leading to their descendants incorrectly marked as `UnviableFork`. By restricting the check to blocks that have been finalized, they should be reported as `MissingParent` instead, eventually re-assigning a `BlockRef` to them.	2022-09-22 18:33:26 +00:00
Jacek Sieka	f9a2860a61	log attestation/block when slashing protection is activated (#4148 )	2022-09-19 19:50:19 +00:00
tersec	e3750e96e8	fix order of current/previous dependent root in REST SSE (#4141 )	2022-09-19 13:28:52 +03:00
Jacek Sieka	ef8bab58eb	load suggested fee recipient file also when keymanager is disabled (#4078 ) Since these files may have been created in a previous run or manually, we want to keep loading them even on nodes that don't enable the keystore API (for example static setups) Other changes: * log keystore loading progressively (#3699) * print initial fee recipient when loading validators * log dynamic fee recipient updates	2022-09-17 08:30:07 +03:00
tersec	0410aec9d8	remove rest of `withState.state` usage (#4120 ) * remove rest of `withState.state` usage * remove scaffolding	2022-09-16 15:35:00 +02:00
tersec	80f44f4491	update consensus layer spec ref URLs to v1.2.0-rc.3 (#4117 )	2022-09-13 17:30:11 +00:00
tersec	8be964a152	update consensus layer spec ref URLs to v1.2.0-rc.3 (#4109 )	2022-09-10 17:16:38 +00:00
tersec	19bf460a3b	more `withState` `state` -> `forkyState` (#4104 )	2022-09-10 08:12:07 +02:00
tersec	1d620f0123	consensus spec URL updates to v1.2.0-rc.3 (#4105 )	2022-09-09 21:56:06 +00:00
tersec	eb791cfac8	avoid rewinds during syncing (#4093 )	2022-09-08 00:31:24 +00:00
tersec	cd46af17e9	handle INVALIDATED forkchoiceUpdated better (#4081 )	2022-09-07 22:54:37 +02:00
tersec	bf3a014287	more efficient forkchoiceUpdated usage (#4055 ) * more efficient forkchoiceUpdated usage * await rather than asyncSpawn; ensure head update before dag.updateHead * use action tracker rather than attached validators to check for next slot proposal; use wall slot + 1 rather than state slot + 1 to correctly check when missing blocks * re-add two-fcU case for when newPayload not VALID * check dynamicFeeRecipientsStore for potential proposal * remove duplicate checks for whether next proposer	2022-09-07 20:34:52 +02:00
tersec	776f09215c	only mark post-finalized blocks invalid (#4072 )	2022-09-06 11:43:19 +00:00
tersec	e183dccc7f	blockchain DAG and fork choice comment cleanup (#4070 )	2022-09-05 23:25:28 +00:00
tersec	301e5a919d	remove some Bellatrix-specific references (#4019 ) * remove some Bellatrix-specific references * remove more bellatrixData-dependencies	2022-09-03 20:56:20 +00:00
tersec	ad0d30093f	state/forkyState cleanup; spec URL updates; rm unused imports (#4052 )	2022-08-31 13:29:34 +02:00
tersec	9ae796daed	Cache and resend, rather than recreate, builder API registrations (#4040 )	2022-08-31 03:29:03 +03:00
Jacek Sieka	59092e5b3b	add some log data for fishy trusted attestations (#4049 )	2022-08-30 02:59:42 +00:00
Etan Kissling	574b84f96f	add REST endpoint for fork choice context (#4042 ) Implements a proposed REST endpoint for analyzing fork choice behaviour. See https://github.com/ethereum/beacon-APIs/pull/232	2022-08-29 22:02:29 +00:00
Etan Kissling	613f4a9a50	accelerate EL sync with LC with `--sync-light-client` (#4041 ) When the BN-embedded LC makes sync progress, pass the corresponding execution block hash to the EL via `engine_forkchoiceUpdatedV1`. This allows the EL to sync to wall slot while the chain DAG is behind. Renamed `--light-client` to `--sync-light-client` for clarity, and `--light-client-trusted-block-root` to `--trusted-block-root` for consistency with `nimbus_light_client`. Note that this does not work well in practice at this time: - Geth sticks to the optimistic sync: "Ignoring payload while snap syncing" (when passing the LC head) "Forkchoice requested unknown head" (when updating to LC head) - Nethermind syncs to LC head but does not report ancestors as VALID, so the main forward sync is still stuck in optimistic mode: "Pre-pivot block, ignored and returned Syncing" To aid EL client teams in fixing those issues, having this available as a hidden option is still useful.	2022-08-29 12:16:35 +00:00
Etan Kissling	994339c7ee	adjust checkpoint tracking for devnets (#4039 ) Track checkpoints more defensively on devnets with low participation.	2022-08-29 09:26:01 +02:00
tersec	b60456fdf3	`withState`: `state` -> `forkyState` (#4038 )	2022-08-26 22:47:40 +00:00
tersec	66a5e88203	allow accessing withState forky state via `forkyState` (#4026 )	2022-08-26 17:14:18 +03:00
Etan Kissling	64972e3c8a	set `safe_block_hash` to fork choice justified (#4010 ) Implements the fork choice safe block spec, where `safe_block_hash` in `forkChoiceUpdated` is set to justified (used to be `ZERO_HASH`). https://github.com/ethereum/consensus-specs/blob/v1.2.0-rc.3/fork_choice/safe-block.md#get_safe_execution_payload_hash	2022-08-25 23:34:02 +00:00
Etan Kissling	9180f09641	reduce LC optsync latency (#4002 ) The optimistic sync spec was updated since the LC based optsync module was introduced. It is no longer necessary to wait for the justified checkpoint to have execution enabled; instead, any block is okay to be optimistically imported to the EL client, as long as its parent block has execution enabled. Complex syncing logic has been removed, and the LC optsync module will now follow gossip directly, reducing the latency when using this module. Note that because this is now based on gossip instead of using sync manager / request manager, that individual blocks may be missed. However, EL clients should recover from this by fetching missing blocks themselves.	2022-08-25 03:53:59 +00:00
tersec	1d55743ebb	allow execution clients several seconds to construct blocks (#4012 )	2022-08-23 19:19:52 +03:00
Jacek Sieka	9e9db216c5	Harden block proposal against expired slashings/exits (#4013 ) * Harden block proposal against expired slashings/exits When a message is signed in a phase0 domain, it can no longer be validated under bellatrix due to the correct fork no longer being available in the `BeaconState`. To ensure that all slashing/exits are still valid, in this PR we re-run the checks in the state that we're proposing for, thus hardening against both signatures and other changes in the state that might have invalidated the message. * fix same message added multiple times in case of attestation slashing of multiple validators in one go	2022-08-23 18:30:46 +03:00
Etan Kissling	74dc388ad9	do not prune LC data by default (#4008 ) Aligns the default retention policy for LC data with the one for blocks. Minimum spec requirement for both blocks and LC data is ~5 months. Additional use cases are better supported by retaining data for longer.	2022-08-21 11:24:59 +02:00
tersec	c65eaca1bf	update spec ref URLs (#4005 )	2022-08-20 16:03:32 +00:00
Jacek Sieka	0d9fd54857	cache shuffling separately from other EpochRef data (fixes #2677 ) (#3990 ) In order to avoid full replays when validating attestations hailing from untaken forks, it's better to keep shufflings separate from `EpochRef` and perform a lookahead on the shuffling when processing the block that determines them. This also helps performance in the case where REST clients are trying to perform lookahead on attestation duties and decreases memory usage by sharing shufflings between EpochRef instances of the same dependent root.	2022-08-18 21:07:01 +03:00
tersec	3ad1d251ef	make newPayload/forkchoiceUpdated failures errors (#3989 )	2022-08-18 12:57:32 +00:00
tersec	c0f673dc09	spec ref URL updates: v1.2.0-rc.{1,2} for phase0/fork-choice altair/beacon-chain (#3986 )	2022-08-18 07:25:33 +00:00
Etan Kissling	5c8e58ea23	update LC spec references for v1.2.0-rc.2 (#3982 ) Updates light client spec references for latest spec (no more `vFuture`)	2022-08-17 19:47:06 +00:00
tersec	8274d5373b	update spec ref URLs (#3979 )	2022-08-17 11:33:19 +00:00
zah	dc50abbc90	Implement a missing ingnore rule for sync committee contributions (#3941 )	2022-08-09 12:52:11 +03:00
Etan Kissling	9c6a4316aa	document LC data serving options (#3922 ) Adds a documentation page for configuring LC data serving.	2022-08-02 12:23:03 +00:00
Miran	dfd4afc9f2	compatibility with Nim 1.4+ (#3888 )	2022-07-29 10:53:42 +00:00
Zahary Karadjov	9b081e524c	Merge branch 'stable' into unstable	2022-07-29 11:28:43 +03:00
Zahary Karadjov	64e791be66	Revert "avoid packing attestations from other forks (#3893 )" This reverts commit `5dcfb0c4e7`.	2022-07-27 20:14:40 +03:00
tersec	2f77f05a1a	optimistic block gossip validation (#3876 )	2022-07-21 21:39:43 +03:00
tersec	f4208cfb23	opportunistically even less async optimistic sync (#3880 )	2022-07-21 21:26:36 +03:00
Eugene Kabanov	c3d3397843	VC: doppelganger protection (#3877 ) * Improve fallback_service. * Improve logging in fallback_service. * Apply signal handling for all stages. * Fix some logging statements. * Add doppelganger REST api endpoint. Add some structures to VC. * Add client API call implementation. * Initial fix & refactor onceToAll() Add doppelganger service. Add doppelganger helpers. * Add doppelganger checks. * Move doppelganger log messages to higher levels. * Fix firstSuccess(). * Bump chronos. * Post rebase fixes. * Proper chronos bump. * Address review comments. * Attempt to fix finalization test issue. * Fix nimbus_signing_node. * Mark validators which are added at GENESIS_SLOT in GENESIS_EPOCH as passed doppelganger validation. * Do not send empty requests to server. * Fix log statement. * Address review comments and re-raise cancellations. Co-authored-by: zah <zahary@gmail.com>	2022-07-21 19:54:07 +03:00
Etan Kissling	5dcfb0c4e7	avoid packing attestations from other forks (#3893 ) When there is heavy forking, proposals may get missed due to including attestations from different forks that later fail verification. Checking attestation signatures when building blocks should fix this.	2022-07-21 14:04:56 +03:00
Etan Kissling	a6deacd878	allow driving EL with LC (#3865 ) Adds the `--web3-url` launch argument to `nimbus_light_client` to enable driving the EL with the optimistic head obtained from LC sync protocol. This will keep issuing `newPayload` / `forkChoiceUpdated` requests for new blocks, marking them as optimistic. `ZERO_HASH` is reported as the finalized block for now.	2022-07-14 04:07:40 +00:00
tersec	06c8e10ae2	move consensus_manager to consensus_object_pools (#3852 )	2022-07-13 14:13:54 +00:00
tersec	ce6cbd84e2	rename verifyFinalization internal flag to strictVerification (#3866 ) * rename verifyFinalization internal flag to strictVerification * Update beacon_chain/extras.nim Co-authored-by: Etan Kissling <etan@status.im> Co-authored-by: Etan Kissling <etan@status.im>	2022-07-13 13:48:09 +00:00
Jacek Sieka	b00eac7a50	dag: protect against finalized epoch moving before BlockRef cutoff (#3847 )	2022-07-07 14:24:31 +00:00
Jacek Sieka	e1830519a4	Introduce message router (#3829 ) Whether new blocks/attestations/etc are produced internally or received via REST, their journey through the node is the same - to ensure that they get the same treatment (logging, metrics, processing), this PR moves the routing to a dedicated module and fixes several small differences that existed before. * `xxxValidator` -> `processMessageName` - the processor also was adding messages to pools, so we want the name to reflect that action * add missing "sent" metrics for some messages * document ignore policy better - already-seen messages are not actaully rebroadcast by libp2p * skip redundant signature checks for internal validators consistently	2022-07-06 16:11:44 +00:00
Etan Kissling	2a2bcea70d	group justified and finalized `Checkpoint` (#3841 ) The justified and finalized `Checkpoint` are frequently passed around together. This introduces a new `FinalityCheckpoint` data structure that combines them into one. Due to the large usage of this structure in fork choice, also took this opportunity to update fork choice tests to the latest v1.2.0-rc.1 spec. Many additional tests enabled, some need more work, e.g. EL mock blocks. Also implemented `discard_equivocations` which was skipped in #3661, and improved code reuse across fork choice logic while at it.	2022-07-06 13:33:02 +03:00
Etan Kissling	aff53e962f	merge LC db into main BN db (#3832 ) * merge LC db into main BN db To treat derived LC data similar to derived state caches, merge it into the main beacon node DB. * shorten table names, group with lc prefix	2022-07-04 23:46:32 +03:00
tersec	1221bb66e8	optimistic sync (#3793 ) * optimistic sync * flag that initially loaded blocks from database might need execution block root filled in * return optimistic status in REST calls * refactor blockslot pruning * ensure beacon_blocks_by_{root,range} do not provide optimistic blocks * handle forkchoice head being pre-merge with block being postmerge * re-enable blocking head updates on validator duties * fix is_optimistic_candidate_block per spec; don't crash with nil future * fix is_optimistic_candidate_block per spec; don't crash with nil future * mark blocks sans execution payloads valid during head update	2022-07-04 23:35:33 +03:00
tersec	ba4d4c14db	fix Nim 1.6 deprecation and unused import warnings (#3834 )	2022-07-01 21:52:23 +00:00
Etan Kissling	499abd927f	persist LC data across restarts (#3823 ) * persist LC data across restarts With the Altair spec `LightClientUpdate` structure taking its final form it is finally possible to persist LC data across restarts without having to worry about data migration due to spec changes. A separate `lcdataV1` database is created in the `caches` subdirectory to hold known LC data. A full database with default settings (129 periods) uses <15 MB disk. * extend LC data DB rationale * wording * add `isSupportedBySQLite` helper and explicit return * remove redundant `return`	2022-06-30 13:04:39 +00:00
Jacek Sieka	c145916414	cleanups (#3819 ) * avoid circular panda imports * move deposit merkleization helpers to spec/ * normalize validator signature helpers to spec names / params * remove redundant functions for remote signing	2022-06-29 18:53:59 +02:00
Etan Kissling	bc1cc8f643	encapsulate LC config into one type (#3817 ) Separate LC initialization options from the main ChainDAGRef options to allow ChainDAGRef to treat them as opaque and reduce risk for conflicts when extending those options in the future.	2022-06-28 22:52:29 +02:00
Eugene Kabanov	d1581a2d8c	Fix proper timing check for bellatrix epoch. (#3807 )	2022-06-28 10:21:16 +00:00
Etan Kissling	e8e9ce1aab	introduce types for LC merkle proofs (#3808 ) Merkle proofs tend to have long underlying type definitions, e.g., `array[log2trunc(NEXT_SYNC_COMMITTEE_INDEX), Eth2Digest]`. For the ones used in the LC sync protocol, dedicated types are introduced to improve readability. Furthermore, the `CachedLightClientBootstrap` wrapper that solely wrapped a merkle branch is eliminated.	2022-06-28 07:52:23 +02:00
Etan Kissling	91d543440a	add option to configure max historic LC data periods (#3799 ) Adds a `--light-client-data-max-periods` option to override the number of sync committee periods to retain light client data. Raising it above the default enables archive nodes to serve full data. Lowering below the default speeds up import times (still no persistence)	2022-06-27 13:24:38 +02:00
Etan Kissling	aa1b8e4a17	bump nim-ssz-serialization to `3db6cc0f282708aca6c290914488edd832971d61` (#3119 ) This updates `nim-ssz-serialization` to `3db6cc0f282708aca6c290914488edd832971d61`. Notable changes: - Use `uint64` for `GeneralizedIndex` - Add support for building merkle multiproofs	2022-06-26 19:33:06 +02:00
Etan Kissling	2e98c7722f	encapsulate LC data variables into single structure (#3777 ) Combines the LC data configuration options (serve / importMode), the callbacks (finality / optimistic LC update) as well as the cache storing light client data, into a new `LightClientDataStore` structure. Also moves the structure into a light client specific file.	2022-06-24 16:57:50 +02:00
Jacek Sieka	347a485b5b	bearssl: split abi (#3755 )	2022-06-21 10:29:16 +02:00
Eugene Kabanov	eb6b7affee	Add the `execution_optimistic` flag to REST API responses. (#3780 ) * Initial commit * Make `events` API spec compliant. * Add `Eth-Consensus-Version` in responses. * Bump chronos to get redirect with headers working. * Add `is_optimistic` field and handling to syncing RestSyncInfo.	2022-06-20 08:53:39 +03:00
Etan Kissling	afcc5c2ea0	track LC data section that imported without errors (#3753 ) If database access errors are encountered while proccessing LC data, track the section which was accessed without errors so that the rest may be attempted to be re-indexed later.	2022-06-19 08:53:10 +03:00
tersec	8eb5d5de09	use ZERO_HASH for default(Eth2Digest)/Eth2Digest() in func calls (#3770 )	2022-06-18 04:57:37 +00:00
Etan Kissling	ac7393b8ac	remove unused `withStateVars` template (#3738 ) Removes the `withStateVars` template that was not used meaningfully.	2022-06-16 11:46:35 +02:00
Etan Kissling	20e646a47f	avoid casting types in LC data code (#3743 ) Use `asSigned()` for type safety over `isomorphicCast` in LC data code.	2022-06-14 23:33:18 +02:00
Etan Kissling	81ff20b3f0	use block ID vs full block in LC data caching (#3741 ) `cacheLightClientData` does not need full block data, pass just ID.	2022-06-14 22:13:00 +02:00
Etan Kissling	0c00b85782	cleanup LC data helpers (#3746 ) Use more general `lowSlot` in LC data helpers, and avoid using `earliestSlot` variable name as that one has a different meaning.	2022-06-14 22:02:03 +02:00
Etan Kissling	cba041ddfa	fix LC data import for Altair fork period (#3744 ) The initial sync committee period follows a different finality rule than the other ones. Instead of next sync committee finalizing as soon as the `finalizedHead.slot >= period.start_slot` have to use Altair start slot.	2022-06-14 17:31:10 +02:00
Etan Kissling	52ba4f7999	rename light client config parameters (#3740 ) For consistency with other options, use a common prefix for light client data configuration options. * `--serve-light-client-data` --> `--light-client-data-serve` * `--import-light-client-data` --> `--light-client-data-import-mode` No deprecation of the old identifiers as they were only sparingly used and all usage can be easily updated without interferance.	2022-06-14 12:03:39 +03:00
Etan Kissling	e3f0d2ecbc	remove unused `getExistingForkedBlock` overload (#3742 ) Removes an unused overload of a local LC data function.	2022-06-14 08:19:11 +00:00
tersec	aa4f105c0c	improve panda display (#3732 )	2022-06-11 00:48:04 +00:00
Etan Kissling	15967c4076	keep track of latest blocks for optimistic sync (#3715 ) When launched with `--light-client-enable` the latest blocks are fetched and optimistic candidate blocks are passed to a callback (log for now). This helps accelerate syncing in the future (optimistic sync).	2022-06-10 14:16:37 +00:00
tersec	65cecc50ca	cleanups: unused and duplicate imports, inconsistent naming conventions, URL updates (#3724 )	2022-06-09 14:30:13 +00:00
tersec	83793c3599	fix Nim 1.6 build deprecation warnings (#3712 )	2022-06-09 12:09:38 +03:00
Etan Kissling	72a46bd520	integrate light client into beacon node (#3557 ) Adds a `LightClient` instance to the beacon node as preparation to accelerate syncing in the future (optimistic sync). - `--light-client-enable` turns on the feature - `--light-client-trusted-block-root` configures block to start from If no block root is configured, light client tracks DAG `finalizedHead`.	2022-06-07 19:01:11 +02:00
tersec	38737549ac	refactor fork consistency checking and gate compilation on it (#3704 )	2022-06-04 19:15:15 +00:00
Dustin Brody	21200f4a64	fix false-positive in overlap between default {CAPELLA,SHARDING}_FORK_VERIONs	2022-06-04 14:52:03 +00:00
tersec	faf4d4a001	initial Capella support in RuntimeConfig (#3698 )	2022-06-03 14:42:40 +00:00
tersec	ea113fc420	disallow non-(genesis, far-future) equal transition epochs (#3691 )	2022-06-03 09:37:03 +00:00
tersec	ce143a1078	update CL spec URLs (#3690 )	2022-06-01 15:52:45 +00:00
tersec	f929980bf3	update 20 CL spec ref URLs (#3677 )	2022-05-31 11:15:31 +00:00
Etan Kissling	01efa93cf6	add light client (standalone) (#3653 ) Introduces a new library for syncing using libp2p based light client sync protocol, and adds a new `nimbus_light_client` executable that uses this library for syncing. The new executable emits log messages when new beacon block headers are received, and is integrated into testing.	2022-05-31 12:45:37 +02:00
Jacek Sieka	f31f52e24a	fix missing frontfill index (fixes #3658 ) (#3675 ) * fix key load duration log * log broken frontfill block root	2022-05-31 10:09:01 +02:00
Jacek Sieka	48f01186d6	fix unnecessary HashList/HashArray cache invalidation (#3660 ) * SSZ `[]` -> `mitem` * `[]` -> `item` immutable access via mutable instance cannot rely on template overloading, and `[]` cannot be a `func` because of special seq handling in compiler.	2022-05-30 13:30:42 +00:00
tersec	01534b0431	🐼 (#3670 ) * 🐼 * rm panda refs outside core module; preprocess text/ANSI artwork sources * credit artwork to beatscribe	2022-05-30 08:25:27 +00:00
tersec	b3d603f364	more CL spec URL updates to v1.2.0-rc.1 (#3657 )	2022-05-24 08:26:35 +00:00
tersec	c73239f60b	CL spec URL updates to v1.2.0-rc.1 (#3655 )	2022-05-23 19:30:24 +00:00
Etan Kissling	c808f17a37	update to latest light client libp2p protocol (#3623 ) Incorporates the latest changes to the light client sync protocol based on Devconnect AMS feedback. Note that this breaks compatibility with the previous prototype, due to changes to data structures and endpoints. See https://github.com/ethereum/consensus-specs/pull/2802	2022-05-23 14:02:54 +02:00
zah	a2ba34f686	Implement all sync committee duties in the validator client (#3583 ) Other changes: * logtrace can now verify sync committee messages and contributions * Many unnecessary use of pairs() have been removed for consistency * Map 40x BN response codes to BeaconNodeStatus.Incompatible in the VC	2022-05-10 10:03:40 +00:00
Jacek Sieka	011e0ca02f	era file verification (#3605 ) * era file verification Implement and document era file verification * era file states now come with block applied for easier verification * clarify conflicting version handling * document verification requirements * remove count from name, use start-era, end-root to discover range * remove obsolete todo * abstract out block root loading	2022-05-10 03:28:46 +03:00
tersec	61ba308e13	stylecheck fixes (#3593 )	2022-04-14 17:39:37 +02:00
tersec	ff6c581273	keep proposer boosting permanently enabled (#3565 )	2022-04-12 12:06:30 +02:00
Zahary Karadjov	def69e2a06	Revert "More sparse state snapshots in the Gnosis network" This reverts commit `557717b517`.	2022-04-11 13:56:42 +03:00
Zahary Karadjov	ac4e7723ea	Fix the build	2022-04-10 23:10:40 +03:00
Zahary Karadjov	557717b517	More sparse state snapshots in the Gnosis network	2022-04-09 18:07:36 +03:00
Jacek Sieka	f70ff38b53	enable `styleCheck:usages` (#3573 ) Some upstream repos still need fixes, but this gets us close enough that style hints can be enabled by default. In general, "canonical" spellings are preferred even if they violate nep-1 - this applies in particular to spec-related stuff like `genesis_validators_root` which appears throughout the codebase.	2022-04-08 16:22:49 +00:00
Jacek Sieka	5092fc41c7	use snappy-framed format for compressing bellatrix+ database entries (#3551 ) `.era` files and Req/Resp protocols use framed formats - aligning the database with these makes for less recompression work overall as gossip is sent only once while req/resp repeats (potentially) - this also allows efficient pruning-to-era where snappy-recompression is the major cycle thief.	2022-03-29 11:33:06 +00:00
tersec	9b43a76f2f	kiln beacon node (#3540 ) * kiln bn * use version of beacon_chain_db * have Eth1Monitor abstract more tightly over web3provider	2022-03-25 11:40:10 +00:00
Jacek Sieka	e009728858	work around Nim assignment bug that breaks state pruning (#3545 ) See https://github.com/nim-lang/Nim/issues/19613	2022-03-24 14:37:37 +00:00
Jacek Sieka	bc80ac3be1	harden REST API `atSlot` against non-finalized blocks (#3538 ) * harden validator API against pre-finalized slot requests * check `syncHorizon` when responding to validator api requests too far from `head` * limit state-id based requests to one epoch ahead of `head` * put historic data bounds on block/attestation/etc validator production API, preventing them from being used with already-finalized slots * add validator block smoke tests * make rest test create a new genesis with the tests running roughly in the first epoch to allow testing a few more boundary conditions	2022-03-23 12:42:16 +01:00
Jacek Sieka	4207b127f9	era: load blocks and states (#3394 ) * era: load blocks and states Era files contain finalized history and can be thought of as an alternative source for block and state data that allows clients to avoid syncing this information from the P2P network - the P2P network is then used to "top up" the client with the most recent data. They can be freely shared in the community via whatever means (http, torrent, etc) and serve as a permanent cold store of consensus data (and, after the merge, execution data) for history buffs and bean counters alike. This PR gently introduces support for loading blocks and states in two cases: block requests from rest/p2p and frontfilling when doing checkpoint sync. The era files are used as a secondary source if the information is not found in the database - compared to the database, there are a few key differences: * the database stores the block indexed by block root while the era file indexes by slot - the former is used only in rest, while the latter is used both by p2p and rest. * when loading blocks from era files, the root is no longer trivially available - if it is needed, it must either be computed (slow) or cached (messy) - the good news is that for p2p requests, it is not needed * in era files, "framed" snappy encoding is used while in the database we store unframed snappy - for p2p2 requests, the latter requires recompression while the former could avoid it * front-filling is the process of using era files to replace backfilling - in theory this front-filling could happen from any block and front-fills with gaps could also be entertained, but our backfilling algorithm cannot take advantage of this because there's no (simple) way to tell it to "skip" a range. * front-filling, as implemented, is a bit slow (10s to load mainnet): we load the full BeaconState for every era to grab the roots of the blocks - it would be better to partially load the state - as such, it would also be good to be able to partially decompress snappy blobs * lookups from REST via root are served by first looking up a block summary in the database, then using the slot to load the block data from the era file - however, there needs to be an option to create the summary table from era files to fully support historical queries To test this, `ncli_db` has an era file exporter: the files it creates should be placed in an `era` folder next to `db` in the data directory. What's interesting in particular about this setup is that `db` remains as the source of truth for security purposes - it stores the latest synced head root which in turn determines where a node "starts" its consensus participation - the era directory however can be freely shared between nodes / people without any (significant) security implications, assuming the era files are consistent / not broken. There's lots of future improvements to be had: * we can drop the in-memory `BlockRef` index almost entirely - at this point, resident memory usage of Nimbus should drop to a cool 500-600 mb * we could serve era files via REST trivially: this would drop backfill times to whatever time it takes to download the files - unlike the current implementation that downloads block by block, downloading an era at a time almost entirely cuts out request overhead * we can "reasonably" recreate detailed state history from almost any point in time, turning an O(slot) process into O(1) effectively - we'll still need caches and indices to do this with sufficient efficiency for the rest api, but at least it cuts the whole process down to minutes instead of hours, for arbitrary points in time * CI: ignore failures with Nim-1.6 (temporary) * test fixes Co-authored-by: Ștefan Talpalaru <stefantalpalaru@yahoo.com>	2022-03-23 09:58:17 +01:00
Jacek Sieka	361596c719	harden head update against missing parent (#3529 ) in case BlockRef ends up in some lifetime leak * fix duplicate head logging	2022-03-21 15:18:05 +01:00
Jacek Sieka	13fafe3a40	simplify unviable head pruning (#3528 ) Also note bug that exists that potentially prevents states from being pruned correctly	2022-03-21 09:20:26 +00:00
Etan Kissling	fd1ffd62dd	update light client server for DAG failure modes (#3514 ) Gracefully handles the new failure modes recently introduced to the DAG as part of https://github.com/status-im/nimbus-eth2/pull/3513 Data that is deemed to exist but fails to load leads to an error log to avoid suppressing logic errors accidentally. In `verifyFinalization` mode, the assertions remain active.	2022-03-20 11:58:59 +01:00
Etan Kissling	04b851f775	fix light client data pruning (#3523 ) When eliminating orphaned forks, light client data about blocks was also deleted when the orphaned fork was referring to a state several slots after the block. Linking light client data pruning with block deletion instead of state deletion fixes this problem. Light client data always refers to blocks and their immediate post-state.	2022-03-20 10:09:43 +01:00
Jacek Sieka	ea1acd7397	fix loading when finalized checkpoint slot is missing block (#3525 ) ref loop would stop one block early in this case - trying to load everything in one loop ends up being pretty confusing.. * simplify finalizedBlocks topup by splitting it from the head loop / query	2022-03-19 11:02:17 +00:00
Etan Kissling	637f1e2be6	simplify `computeEarliestLightClientSlot` (#3524 ) Combine DAG and LC import tails in `computeEarliestLightClientSlot`.	2022-03-19 09:58:55 +01:00
Etan Kissling	18bd6df1b4	fix light client data collection for checkpoint sync (#3498 ) When doing checkpoint sync, collecting light client data of known blocks and states incorrectly assumes that `finalized_checkpoint` information is also known. Hardens collection to only collect finalized checkpoint data after `dag.computeEarliestLightClientSlot`.	2022-03-18 15:47:53 +01:00
Jacek Sieka	d0223d1f28	fix finalized epoch ref loading on checkpoint start (#3517 ) regression from #3513 that did not take tail into consideration when loading epoch ancestor	2022-03-18 13:13:57 +01:00
Jacek Sieka	b3d80827fb	tns: checkpoint wal periodically while backfilling (#3516 ) Witout this, we end up with a massive .wal file that needs to be checkpointed on first startup (which takes a few minutes) - it's much more efficient to do smaller checkpoints, it turns out.	2022-03-18 12:32:20 +01:00
Jacek Sieka	05ffe7b2bf	Prune `BlockRef` on finalization (#3513 ) Up til now, the block dag has been using `BlockRef`, a structure adapted for a full DAG, to represent all of chain history. This is a correct and simple design, but does not exploit the linearity of the chain once parts of it finalize. By pruning the in-memory `BlockRef` structure at finalization, we save, at the time of writing, a cool ~250mb (or 25%:ish) chunk of memory landing us at a steady state of ~750mb normal memory usage for a validating node. Above all though, we prevent memory usage from growing proportionally with the length of the chain, something that would not be sustainable over time - instead, the steady state memory usage is roughly determined by the validator set size which grows much more slowly. With these changes, the core should remain sustainable memory-wise post-merge all the way to withdrawals (when the validator set is expected to grow). In-memory indices are still used for the "hot" unfinalized portion of the chain - this ensure that consensus performance remains unchanged. What changes is that for historical access, we use a db-based linear slot index which is cache-and-disk-friendly, keeping the cost for accessing historical data at a similar level as before, achieving the savings at no percievable cost to functionality or performance. A nice collateral benefit is the almost-instant startup since we no longer load any large indicies at dag init. The cost of this functionality instead can be found in the complexity of having to deal with two ways of traversing the chain - by `BlockRef` and by slot. * use `BlockId` instead of `BlockRef` where finalized / historical data may be required * simplify clearance pre-advancement * remove dag.finalizedBlocks (~50:ish mb) * remove `getBlockAtSlot` - use `getBlockIdAtSlot` instead * `parent` and `atSlot` for `BlockId` now require a `ChainDAGRef` instance, unlike `BlockRef` traversal * prune `BlockRef` parents on finality (~200:ish mb) * speed up ChainDAG init by not loading finalized history index * mess up light client server error handling - this need revisiting :)	2022-03-17 17:42:56 +00:00
Jacek Sieka	8a63efc413	move `BlockId` to `spec` (#3511 ) The spec implicitly talks about the slot of a block in several places, and keeping it readily available is useful in a number of context - might as well put this implicitly refereneced helper in the spec code directly	2022-03-16 16:00:18 +01:00
tersec	8fbcf29775	update unchanged specs/phase0/p2p-interface.md URL references from v1.1.9 to v1.1.10 (#3510 )	2022-03-16 10:40:35 +00:00
Jacek Sieka	c64bf045f3	remove StateData (#3507 ) One more step on the journey to reduce `BlockRef` usage across the codebase - this one gets rid of `StateData` whose job was to keep track of which block was last assigned to a state - these duties have now been taken over by `latest_block_root`, a fairly recent addition that computes this block root from state data (at a small cost that should be insignificant) 99% mechanical change.	2022-03-16 08:20:40 +01:00
Jacek Sieka	a3bd01b58d	move dependent root computations to `BeaconState` / `EpochRef` (#3478 ) * fewer deps on `BlockRef` traversal in anticipation of pruning * allows identifying EpochRef:s by their shuffling as a first step of * tighten error handling around missing blocks using the zero hash for signalling "missing block" is fragile and easy to miss - with checkpoint sync now, and pruning in the future, missing blocks become "normal".	2022-03-15 09:24:55 +01:00
Etan Kissling	ae408c279a	add option to collect light client data (#3474 ) Light clients require full nodes to serve additional data so that they can stay in sync with the network. This patch adds a new launch option `--import-light-client-data` to configure what data to make available. For now, data is only kept in memory; it is not persisted at this time. Note that data is only locally collected, a separate patch is needed to actually make it availble over the network. `--serve-light-client-data` will be used for serving data, but is not functional yet outside tests.	2022-03-11 21:28:10 +01:00
tersec	21b71bd29c	update URL and document Nim bug blocking further genericizing cleanups (#3483 )	2022-03-11 15:03:47 +00:00
Jacek Sieka	d0183ccd77	Historical state reindex for trusted node sync (#3452 ) When performing trusted node sync, historical access is limited to states after the checkpoint. Reindexing restores full historical access by replaying historical blocks against the state and storing snapshots in the database. The process can be initiated or resumed at any point in time.	2022-03-11 12:49:47 +00:00
Jacek Sieka	4363215a32	relax `BlockRef` database assumptions (#3472 ) * remove `getForkedBlock(BlockRef)` which assumes block data exists but doesn't support archive/backfilled blocks * fix REST `/eth/v1/beacon/headers` request not returning archive/backfilled blocks * avoid re-encoding in REST block SSZ requests (using `getBlockSSZ`)	2022-03-11 13:08:17 +01:00
Etan Kissling	8955edf158	allow using `BlockId` as key in tables (#3467 ) `BlockId` is a type that bundles a block root with its slot number. The type can be useful as key in tables that deal with non-finalized blocks (not uniquely identified by slot) and also support pruning (drop data about older blocks by slot). Instead of creating a custom type for those use cases, this patch suggests implementing `hash` for `BlockId` to re-use the existing type.	2022-03-07 14:56:58 +01:00
tersec	f0ada15dac	automated CL spec ref URL updates from v1.1.9 to v1.1.10 (#3455 )	2022-03-02 10:00:21 +00:00
Jacek Sieka	12ed537f75	catch wrong-fork-blocks earlier (#3444 ) Can't apply a phase0 block to a later phase state and vice versa. Since instantiation has been a topic, pre/post c file size: ``` 424K @mspec@sstate_transition.nim.c 892K @mspec@sstate_transition_block.nim.c ``` ``` 288K @mspec@sstate_transition.nim.c 880K @mspec@sstate_transition_block.nim.c ```	2022-02-28 12:58:34 +00:00
Jacek Sieka	40a4c01086	chaindag: don't keep backfill block table in memory (#3429 ) This PR names and documents the concept of the archive: a range of slots for which we have degraded functionality in terms of historical access - in particular: * we don't support rewinding to states in this range * we don't keep an in-memory representation of the block dag The archive de-facto exists in a trusted-node-synced node, but this PR gives it a name and drops the in-memory digest index. In order to satisfy `GetBlocksByRange` requests, we ensure that we have blocks for the entire archive period via backfill. Future versions may relax this further, adding a "pre-archive" period that is fully pruned. During by-slot searches in the archive (both for libp2p and rest requests), an extra database lookup is used to covert the given `slot` to a `root` - future versions will avoid this using era files which natively are indexed by `slot`. That said, the lookup is quite fast compared to the actual block loading given how trivial the table is - it's hard to measure, even. A collateral benefit of this PR is that checkpoint-synced nodes will see 100-200MB memory usage savings, thanks to the dropped in-memory cache - future pruning work will bring this benefit to full nodes as well. * document chaindag storage architecture and assumptions * look up parent using block id instead of full block in clearance (future-proofing the code against a future in which blocks come from era files) * simplify finalized block init, always writing the backfill portion to db at startup (to ensure lookups work as expected) * preallocate some extra memory for finalized blocks, to avoid immediate realloc	2022-02-26 19:16:19 +01:00
Jacek Sieka	92e7e288e7	Ignore seen aggregates (#3439 ) https://github.com/ethereum/consensus-specs/pull/2225 removed an ignore rule that would filter out duplicate aggregates from gossip publishing - however, this causes increased bandwidth and CPU usage as discussed in https://github.com/ethereum/consensus-specs/issues/2183 - the intent is to revert the removal and reinstate the rule. This PR implements ignore filtering which cuts down on CPU usage (fewer aggregates to validate) and bandwidth usage (less fanout of duplicates) - as #2225 points out, this may lead to a small increase in IHAVE messages.	2022-02-25 17:15:39 +01:00
tersec	7de3f00f35	generic putCorruptState; {Merge=>Bellatrix}BeaconStateNoImmutableValidators (#3427 )	2022-02-21 12:55:56 +01:00
Jacek Sieka	adfe655b16	db: make block loading generic (#3413 ) Streamline lookup with Forky and BeaconBlockFork (then we can do the same for era) We use type to avoid conditionals, as fork is often already known at a "higher" level. * load blockid before loading block by root - this is needed to map root to slot and will eventually be done via block summary table for "old" blocks Co-authored-by: tersec <tersec@users.noreply.github.com>	2022-02-21 09:48:02 +01:00
tersec	79761c78a4	proc -> func, mainly in spec/state transition and adjecent modules (#3405 )	2022-02-17 11:53:55 +00:00
tersec	5eecb9a21f	rename no{R=>r}eturn, no{I=>i}init, short{l=>L}og, E{T=>t}h2Node, Beacon{c=>C}hainDB (#3403 )	2022-02-16 23:24:44 +01:00
Jacek Sieka	7db5647a6e	clean up / document init (#3387 ) * clean up / document init * drop `immutable_validators` data (pre-altair) * document versions where data is first added * avoid needlessly loading genesis block data on startup * add a few more internal database consistency checks * remove duplicate state root lookup on state load * comment	2022-02-16 16:44:04 +01:00
tersec	873a8ec1e6	use isZeroMemory for Eth2Digest comparisons (#3386 ) * use isZeroMemory for Eth2Digest comparisons * use Eth2Digest.isZero abstraction	2022-02-14 05:26:19 +00:00
Jacek Sieka	40fe8f5336	fix missing backfill when restarting node When node is restarted before backfill has started but after some blocks have finalized with forward sync, we would not start the backfill. * also clean up one last `SomeSome`	2022-02-11 23:08:50 +02:00
tersec	d358299875	fork choice proposer boosting support (#3349 ) * fork choice proposer boosting support * detect nodeDelta underflow/overflow	2022-02-04 12:59:40 +01:00
Zahary Karadjov	215caa21ae	Eth1 monitor fixes * Fix a resource leak introduced in https://github.com/status-im/nimbus-eth2/pull/3279 * Don't restart the Eth1 syncing proggress from scratch in case of monitor failures during Eth2 syncing. * Switch to the primary operator as soon as it is back online. * Log the web3 credentials in fewer places Other changes: The 'web3 test' command has been enhanced to obtain and print more data regarding the selected provider.	2022-02-03 14:01:55 +02:00
tersec	8e6a920bf4	rename MERGE_FORK_EPOCH to BELLATRIX_FORK_EPOCH (#3350 ) * rename MERGE_FORK_EPOCH to BELLATRIX_FORK_EPOCH * fix REST test rules	2022-02-02 14:06:55 +01:00
Jacek Sieka	ff4f2a6b6c	better log on finalized slot failure	2022-02-01 21:23:18 +01:00
Jacek Sieka	3df9ffca9f	val-mon: remove redundant `_total` suffix from counters It turns out nim-metrics adds this suffix on its own - it also turns out some of the names are non-conventional and need follow-up.	2022-01-31 18:51:24 +02:00
Jacek Sieka	ad327a8769	Fix counters in validator monitor totals mode (#3332 ) The current counters set gauges etc to the value of the _last_ validator to be processed - as the name of the feature implies, we should be using sums instead. * fix missing beacon state metrics on startup, pre-first-head-selection * fix epoch metrics not being updated on cross-epoch reorg	2022-01-31 08:36:29 +01:00
Jacek Sieka	d583e8e4ac	Store finalized block roots in database (3s startup) (#3320 ) * Store finalized block roots in database (3s startup) When the chain has finalized a checkpoint, the history from that point onwards becomes linear - this is exploited in `.era` files to allow constant-time by-slot lookups. In the database, we can do the same by storing finalized block roots in a simple sparse table indexed by slot, bringing the two representations closer to each other in terms of conceptual layout and performance. Doing so has a number of interesting effects: * mainnet startup time is improved 3-5x (3s on my laptop) * the _first_ startup might take slightly longer as the new index is being built - ~10s on the same laptop * we no longer rely on the beacon block summaries to load the full dag - this is a lot faster because we no longer have to look up each block by parent root * a collateral benefit is that we no longer need to load the full summaries table into memory - we get the RSS benefits of #3164 without the CPU hit. Other random stuff: * simplify forky block generics * fix withManyWrites multiple evaluation * fix validator key cache not being updated properly in chaindag read-only mode * drop pre-altair summaries from `kvstore` * recreate missing summaries from altair+ blocks as well (in case database has lost some to an involuntary restart) * print database startup timings in chaindag load log * avoid allocating superfluos state at startup * use a recursive sql query to load the summaries of the unfinalized blocks	2022-01-30 18:51:04 +02:00
tersec	29e2169585	phase 0 & altair beacon chain and altair validator spec URL updates (#3339 )	2022-01-29 13:53:31 +00:00
tersec	89ffa8a1a7	spec URL & copyright year update (#3338 )	2022-01-29 01:05:39 +00:00
Jacek Sieka	e264276b36	keep unviables in quarantine (#3331 ) they remain unviable even after a reorg	2022-01-28 11:59:55 +01:00
tersec	2b4a960270	rename On{Merge,Bellatrix}BlockAdded and Rollback{Merge,Bellatrix}HashedProc (#3321 )	2022-01-26 13:21:29 +01:00
Jacek Sieka	f70aceef37	Harden handling of unviable forks (#3312 ) * Harden handling of unviable forks In our current handling of unviable forks, we allow peers to send us blocks that come from a different fork - this is not necessarily an error as it can happen naturally, but it does open up the client to a case where the same unviable fork keeps getting requested - rather than allowing this to happen, we'll now give these peers a small negative score - if it keeps happening, we'll disconnect them. * keep track of unviable forks in quarantine, to avoid filling it with known junk * collect peer scores in single module * descore peers when they send unviable blocks during sync * don't give score for duplicate blocks * increase quarantine size to a level that allows finality to happen under optimal conditions - this helps avoid downloading the same blocks over and over in case of an unviable fork * increase initial score for new peers to make room for one more failure before disconnection * log and score invalid/unviable blocks in requestmanager too * avoid ChainDAG dependency in quarantine * reject gossip blocks with unviable parent * continue processing unviable sync blocks in order to build unviable dag * docs * Update beacon_chain/consensus_object_pools/block_pools_types.nim * add unviable queue test	2022-01-26 13:20:08 +01:00
Jacek Sieka	d076e1a11b	ncli_db: import states and blocks from era file (#3313 )	2022-01-25 09:28:26 +01:00

1 2 3 4 5 ...

418 Commits