nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Jacek Sieka	d839b9d07e	State-only checkpoint state startup (#4251 ) Currently, we require genesis and a checkpoint block and state to start from an arbitrary slot - this PR relaxes this requirement so that we can start with a state alone. The current trusted-node-sync algorithm works by first downloading blocks until we find an epoch aligned non-empty slot, then downloads the state via slot. However, current [proposals](https://github.com/ethereum/beacon-APIs/pull/226) for checkpointing prefer finalized state as the main reference - this allows more simple access control and caching on the server side - in particular, this should help checkpoint-syncing from sources that have a fast `finalized` state download (like infura and teku) but are slow when accessing state via slot. Earlier versions of Nimbus will not be able to read databases created without a checkpoint block and genesis. In most cases, backfilling makes the database compatible except where genesis is also missing (custom networks). * backfill checkpoint block from libp2p instead of checkpoint source, when doing trusted node sync * allow starting the client without genesis / checkpoint block * perform epoch start slot lookahead when loading tail state, so as to deal with the case where the epoch start slot does not have a block * replace `--blockId` with `--state-id` in TNS command line * when replaying, also look at the parent of the last-known-block (even if we don't have the parent block data, we can still replay from a "parent" state) - in particular, this clears the way for implementing state pruning * deprecate `--finalized-checkpoint-block` option (no longer needed)	2022-11-02 10:02:38 +00:00
Etan Kissling	aff9147c31	avoid packing attestations from other forks (#4273 ) * avoid packing attestations from other forks Revisit #3893 using method based on Lighthouse (less heavy computation). * fix comment	2022-11-01 14:23:40 +02:00
tersec	0cfc1b776e	add all missing epoch transition tests (#4269 )	2022-10-28 08:02:33 +00:00
tersec	48e351d672	run EF Capella consensus test fixtures (#4267 )	2022-10-27 19:11:13 +00:00
tersec	7dd5c49c4e	use v1.3.0-alpha.0 test vectors (#4263 )	2022-10-27 11:54:39 +00:00
tersec	06ccf5b80c	capella test vector support (#4261 )	2022-10-27 06:29:24 +00:00
tersec	f9830836a9	deprecate --terminal-total-difficulty-override; remove launch script for deprecated ropsten (#4241 ) * deprecate --terminal-total-difficulty-override; remove launch script for deprecated ropsten * remove Makefile support for Ropsten	2022-10-24 23:32:52 +03:00
tersec	fb6e6d9cf4	remove `newPayload` from block production flow (#4186 ) * remove `newPayload` from block production flow * refactor block_processor to run `newPayload` as part of `storeBlock`	2022-10-14 22:48:56 +03:00
Jacek Sieka	819442acc3	Allow chain dag without genesis / block (#4230 ) * Allow chain dag without genesis / block This PR enables the initialization of the dag without access to blocks or genesis state - it is a prerequisite for implementing a number of interesting features: * checkpoint sync without any block download * pruning of blocks and states * backfill checkpoint block	2022-10-14 22:40:10 +03:00
Jacek Sieka	40bed02f60	Build block in parallel with attestation packing (#4185 ) * fix block proposal in first slot after checkpoint	2022-10-04 11:24:16 +00:00
tersec	ad7541567c	move LVH handling to tests/; increase maximum fork choice retries (#4205 )	2022-10-03 13:10:08 +00:00
tersec	0a4aa5fdb3	switch `withStateAndBlck` usage to `forkyState`; rm unused `tests/mocking/` modules (#4206 )	2022-10-03 13:08:50 +00:00
Jacek Sieka	af9ec577d0	nicer error message for failed backfill (#4188 ) * nicer error message for failed backfill Many checkpoint sources don't support block download * RestGenericError -> RestErrorMessage ...and other assorted fixes to bring rest types closer to spec * fix tests	2022-09-29 23:55:18 +03:00
Etan Kissling	5968ed586b	use LRU strategy for shuffling/epoch caches (#4196 ) When EL `newPayload` is slow (e.g., Raspberry Pi with Besu), the epoch and shuffling caches tend to fill up with multiple copies per epoch when processing gossip and performing validator duties close to wall slot. The old strategy of evicting oldest epoch led to the same item being evicted over and over, leading to blocking of over 5 minutes in extreme cases where alternate epochs/shuffling got loaded repeatedly. Changing the cache eviction strategy to least-recently-used seems to improve the situation drastically. A simple implementation was selected based on single linked-list without a hashtable.	2022-09-29 14:55:58 +00:00
tersec	c367b14ad9	deprecate `--safe-slots-to-import-optimistically` (#4182 )	2022-09-29 06:29:49 +00:00
tersec	1819d79e07	avoid potential database inconsistency after fcU `INVALID`+crash (#4192 ) * avoid database race-condition inconsistency after fcU `INVALID` then crash * ensure head doesn't fall behind finalized; add more tests for head movement/reloading DAG	2022-09-28 21:07:31 +00:00
Eugene Kabanov	8778e1cf8d	Fix REST generic error parsing. (#4189 ) * Fix REST generic error parser. * Unescape test vectors. * Fix RestGenericError writer and tests, to encode `code` as `Number`.	2022-09-28 18:47:15 +00:00
Jacek Sieka	b1bc830a92	Harden EpochRef loading against bogus block root at tail (#4178 ) * add more error information when things go wrong with database * lower log level when reloading attestations from no-block epoch start slot	2022-09-27 18:56:08 +02:00
tersec	0f6d19b4b3	implement v1.2.0 optimistic sync tests (#4174 ) * implement v1.2.0 optimistic sync tests * Update beacon_chain/consensus_object_pools/blockchain_dag.nim Co-authored-by: Etan Kissling <etan@status.im> * `lvh` -> `latestValidHash` and only invalidate one specific block" * `getEarliestInvalidRoot` -> `getEarliestInvalidBlockRoot`; `defaultEarliestInvalidRoot` -> `defaultEarliestInvalidBlockRoot` Co-authored-by: Etan Kissling <etan@status.im>	2022-09-27 15:11:47 +03:00
Jacek Sieka	7f9af78ddb	test randao skippping (complements #3837 ) (#4179 )	2022-09-27 09:22:24 +02:00
tersec	9750cd3a38	update state diffs to Bellatrix (#4177 )	2022-09-26 19:13:50 +00:00
tersec	3c03ba86c1	update consensus spec ref URLs to v1.2.0 (#4164 )	2022-09-23 07:56:06 +00:00
tersec	72e6b2021a	use v1.2.0 consensus spec test vectors (#4163 )	2022-09-22 22:24:13 +00:00
zah	ad63bba446	Support Prysm and Ethdo Keystores (Fixes #4107 ) (#4149 )	2022-09-20 01:09:56 +03:00
Eugene Kabanov	174292b7e4	Sync gaps fix (#4090 )	2022-09-19 12:37:42 +03:00
Eugene Kabanov	ca871a5435	Fix HTTP/REST clients HTTP Content-Type header parsers. (#4139 ) * Fix client HTTP content-type parsers. * Fix tests. * Address review comment and apply wildcard checks for generic decodeBytes.	2022-09-19 12:17:29 +03:00
Etan Kissling	9999362b11	detect mismatch of config and binary (#4132 ) * detect mismatch of config and binary When loading configuration that sets keys that Nimbus bakes into the binary at compile-time, raise an error if the config is incompatible instead of ignoring the conflicting value.	2022-09-19 12:07:46 +03:00
Jacek Sieka	ef8bab58eb	load suggested fee recipient file also when keymanager is disabled (#4078 ) Since these files may have been created in a previous run or manually, we want to keep loading them even on nodes that don't enable the keystore API (for example static setups) Other changes: * log keystore loading progressively (#3699) * print initial fee recipient when loading validators * log dynamic fee recipient updates	2022-09-17 08:30:07 +03:00
Etan Kissling	3ba016d75f	consistent peer scoring for missing non-finalized parent (#3381 ) When the sync queue processes results for a blocks by range request, and the requested range contained some slots that are already finalized, `BlockError.MissingParent` currently leads to `PeerScoreBadBlocks` even when the error occurs on a non-finalized slot in the requested range. This patch changes the scoring in that case to `PeerScoreMissingBlocks` for consistency with range requests solely covering non-finalized slots, and, likewise, rewinds the sync queue to the next `rewindSlot`.	2022-09-16 21:45:53 +02:00
tersec	0410aec9d8	remove rest of `withState.state` usage (#4120 ) * remove rest of `withState.state` usage * remove scaffolding	2022-09-16 15:35:00 +02:00
tersec	5b0b48f6e9	implement /eth/v1/validator/register_validator (#4115 )	2022-09-13 14:52:26 +03:00
tersec	8be964a152	update consensus layer spec ref URLs to v1.2.0-rc.3 (#4109 )	2022-09-10 17:16:38 +00:00
tersec	19bf460a3b	more `withState` `state` -> `forkyState` (#4104 )	2022-09-10 08:12:07 +02:00
tersec	1d620f0123	consensus spec URL updates to v1.2.0-rc.3 (#4105 )	2022-09-09 21:56:06 +00:00
tersec	bf3a014287	more efficient forkchoiceUpdated usage (#4055 ) * more efficient forkchoiceUpdated usage * await rather than asyncSpawn; ensure head update before dag.updateHead * use action tracker rather than attached validators to check for next slot proposal; use wall slot + 1 rather than state slot + 1 to correctly check when missing blocks * re-add two-fcU case for when newPayload not VALID * check dynamicFeeRecipientsStore for potential proposal * remove duplicate checks for whether next proposer	2022-09-07 20:34:52 +02:00
Etan Kissling	634408ff2c	use `nim-websock` instead of `news` (#4061 ) `news` has a few open issues that are not present in `nim-websock`: 1. There is a 1 second delay between each MB of sent data. 2. Cancelling an ongoing `send` makes the entire WebSocket unusable. 3. Control packets do not have priority over ongoing message frames. Using `news`, there are quite a few of these messages in Geth: ``` Previously seen beacon client is offline. Please ensure it is operational to follow the chain! ``` It may take quite some time to reconnect when this happens. Using `nim-websock`, this message still occurs because `eth1_monitor` reconnects the EL connection when no new blocks occurred for 5 minutes, but reconnecting is quick and the message is rarer.	2022-09-06 23:41:33 +02:00
tersec	ad0d30093f	state/forkyState cleanup; spec URL updates; rm unused imports (#4052 )	2022-08-31 13:29:34 +02:00
Etan Kissling	994339c7ee	adjust checkpoint tracking for devnets (#4039 ) Track checkpoints more defensively on devnets with low participation.	2022-08-29 09:26:01 +02:00
tersec	b60456fdf3	`withState`: `state` -> `forkyState` (#4038 )	2022-08-26 22:47:40 +00:00
tersec	66a5e88203	allow accessing withState forky state via `forkyState` (#4026 )	2022-08-26 17:14:18 +03:00
Etan Kissling	64972e3c8a	set `safe_block_hash` to fork choice justified (#4010 ) Implements the fork choice safe block spec, where `safe_block_hash` in `forkChoiceUpdated` is set to justified (used to be `ZERO_HASH`). https://github.com/ethereum/consensus-specs/blob/v1.2.0-rc.3/fork_choice/safe-block.md#get_safe_execution_payload_hash	2022-08-25 23:34:02 +00:00
Etan Kissling	9180f09641	reduce LC optsync latency (#4002 ) The optimistic sync spec was updated since the LC based optsync module was introduced. It is no longer necessary to wait for the justified checkpoint to have execution enabled; instead, any block is okay to be optimistically imported to the EL client, as long as its parent block has execution enabled. Complex syncing logic has been removed, and the LC optsync module will now follow gossip directly, reducing the latency when using this module. Note that because this is now based on gossip instead of using sync manager / request manager, that individual blocks may be missed. However, EL clients should recover from this by fetching missing blocks themselves.	2022-08-25 03:53:59 +00:00
tersec	1d55743ebb	allow execution clients several seconds to construct blocks (#4012 )	2022-08-23 19:19:52 +03:00
Jacek Sieka	9e9db216c5	Harden block proposal against expired slashings/exits (#4013 ) * Harden block proposal against expired slashings/exits When a message is signed in a phase0 domain, it can no longer be validated under bellatrix due to the correct fork no longer being available in the `BeaconState`. To ensure that all slashing/exits are still valid, in this PR we re-run the checks in the state that we're proposing for, thus hardening against both signatures and other changes in the state that might have invalidated the message. * fix same message added multiple times in case of attestation slashing of multiple validators in one go	2022-08-23 18:30:46 +03:00
Etan Kissling	f1ddcfff0f	support connecting to peers without bellatrix (#4011 ) * support connecting to peers without bellatrix Make discovery fork ID aware of scheduled Bellatrix fork to enable connections to peers that don't have Bellatrix scheduled yet. Without this, has peering issues with peers on older SW version. * expand tests with compatibility checks * more exhaustive compatibility checks	2022-08-21 19:36:46 +02:00
tersec	c65eaca1bf	update spec ref URLs (#4005 )	2022-08-20 16:03:32 +00:00
zah	b1ac9c9fe4	Fix a potential segfault and various potential stalls (#4003 ) * Fixes a segfault during block production when the Keymanager API is disabled. The Keymanager is now disabled on half of the local testnet nodes to catch such problems in the future. * Fixes multiple potential stalls from REST requests being done without a timeout. From practice, we know that such requests can hang forever if not cancelled with a timeout. At best, this would be a resource leak, at worst, it may lead to a full stall of the client and missed validator duties. * Changes some Options usages to Opt (for easier use of valueOr)	2022-08-19 21:51:30 +00:00
zah	fca20e08d6	Keymanager API for the validator client (#3976 ) * Keymanager API for the validator client * Properly treat the 'description' field as optional when loading Keystores * Spec-compliant serialization of the slashing data in Keymanager's DeleteKeys response () Fixes #3940 Fixes #3964 Closes #3884 by adding test	2022-08-19 13:30:07 +03:00
Jacek Sieka	0d9fd54857	cache shuffling separately from other EpochRef data (fixes #2677 ) (#3990 ) In order to avoid full replays when validating attestations hailing from untaken forks, it's better to keep shufflings separate from `EpochRef` and perform a lookahead on the shuffling when processing the block that determines them. This also helps performance in the case where REST clients are trying to perform lookahead on attestation duties and decreases memory usage by sharing shufflings between EpochRef instances of the same dependent root.	2022-08-18 21:07:01 +03:00
Etan Kissling	5c8e58ea23	update LC spec references for v1.2.0-rc.2 (#3982 ) Updates light client spec references for latest spec (no more `vFuture`)	2022-08-17 19:47:06 +00:00

1 2 3 4 5 ...

1150 Commits