nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Etan Kissling	eec6c04d32	do not descore peer when EL connection fails (#4020 ) When the EL fails to respond to `newPayload`, e.g., because connection to the EL got interrupted, or due to misconfiguration, optimistic blocks cannot be imported according to spec. This condition is treated the same as if the peer returned a block with missing parent which gets the block out of our processing queue, but can have nasty side effects. For example, if sync manager asks for validation of a block known to be in the finalized range, if it receives a `MissingParent` verdict, the peer is immediately removed from the peer pool. ``` DBG 2022-08-24 11:45:26.874+02:00 newPayload: inserting block into execution engine parentHash=e4ca7424 blockHash=36cdc198 stateRoot=cf3902c1 receiptsRoot=56e81f17 prevRandao=0b49a172 blockNumber=1518089 gasLimit=30000000 gasUsed=0 timestamp=1657980396 extraDataLen=0 baseFeePerGas=7 numTransactions=0 ERR 2022-08-24 11:45:26.875+02:00 newPayload failed msg="Transport is not initialised (missing a call to connect?)" DBG 2022-08-24 11:45:26.875+02:00 Block pool rejected peer's response topics="syncman" request=187232:32@1475 peer=16UMsCJdx direction=forward blocks_map=xxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxx blocks_count=31 ok=false unviable=false missing_parent=true sync_ident=main ERR 2022-08-24 11:45:26.875+02:00 Unexpected missing parent at finalized epoch slot topics="syncman" request=187232:32@1475 peer=16UMsCJdx direction=forward rewind_to_slot=187232 blocks_count=31 blocks_map=xxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxx sync_ident=main DBG 2022-08-24 11:45:26.875+02:00 Peer was removed from PeerPool due to low score topics="beacnde" peer=16UMsCJdx peer_score=-1000 score_low_limit=0 score_high_limit=1000 DBG 2022-08-24 11:45:26.875+02:00 Lost connection to peer topics="networking" peer=16UMsCJdx connections=0 ``` By delaying issuing a verdict until the EL connection is restored and `newPayload` successfully ran, the problem should be fixed. This also induces back pressure to the sync manager by stopping download of new blocks (or re-downloading the same block over and over again).	2022-08-24 16:55:41 +00:00
tersec	1d55743ebb	allow execution clients several seconds to construct blocks (#4012 )	2022-08-23 19:19:52 +03:00
Jacek Sieka	9e9db216c5	Harden block proposal against expired slashings/exits (#4013 ) * Harden block proposal against expired slashings/exits When a message is signed in a phase0 domain, it can no longer be validated under bellatrix due to the correct fork no longer being available in the `BeaconState`. To ensure that all slashing/exits are still valid, in this PR we re-run the checks in the state that we're proposing for, thus hardening against both signatures and other changes in the state that might have invalidated the message. * fix same message added multiple times in case of attestation slashing of multiple validators in one go	2022-08-23 18:30:46 +03:00
tersec	e70d5e6194	update spec ref URLs in state_transition_epoch (#4016 )	2022-08-23 13:06:12 +00:00
Zahary Karadjov	57f9974fe5	Version 22.8.0	2022-08-23 01:11:29 +03:00
zah	4e41ed1d5a	Require properly configured Engine API connection after the merge (#4006 )	2022-08-22 22:44:40 +03:00
Etan Kissling	f1ddcfff0f	support connecting to peers without bellatrix (#4011 ) * support connecting to peers without bellatrix Make discovery fork ID aware of scheduled Bellatrix fork to enable connections to peers that don't have Bellatrix scheduled yet. Without this, has peering issues with peers on older SW version. * expand tests with compatibility checks * more exhaustive compatibility checks	2022-08-21 19:36:46 +02:00
Etan Kissling	74dc388ad9	do not prune LC data by default (#4008 ) Aligns the default retention policy for LC data with the one for blocks. Minimum spec requirement for both blocks and LC data is ~5 months. Additional use cases are better supported by retaining data for longer.	2022-08-21 11:24:59 +02:00
tersec	c65eaca1bf	update spec ref URLs (#4005 )	2022-08-20 16:03:32 +00:00
zah	09de83af80	Reviewed the Engine API calls for missing error handling (#4004 )	2022-08-20 09:09:25 +03:00
zah	b1ac9c9fe4	Fix a potential segfault and various potential stalls (#4003 ) * Fixes a segfault during block production when the Keymanager API is disabled. The Keymanager is now disabled on half of the local testnet nodes to catch such problems in the future. * Fixes multiple potential stalls from REST requests being done without a timeout. From practice, we know that such requests can hang forever if not cancelled with a timeout. At best, this would be a resource leak, at worst, it may lead to a full stall of the client and missed validator duties. * Changes some Options usages to Opt (for easier use of valueOr)	2022-08-19 21:51:30 +00:00
tersec	f537f263df	don't use empty execution payload when newPayload rejects it (#3999 ) * don't use empty execution payload when newPayload rejects it * disallow optimistic import except when accepted/syncing	2022-08-20 00:20:57 +03:00
zah	df5ef95111	Doppelganger detection bug fix (#3997 ) When the client was started without any validators, the doppelganger detection structures were never initialized properly. Later, when validators were added through the Keymanager API, they interacted with the uninitialized doppelganger detection structures and their duties were inappropriately skipped.	2022-08-19 13:34:08 +03:00
zah	fca20e08d6	Keymanager API for the validator client (#3976 ) * Keymanager API for the validator client * Properly treat the 'description' field as optional when loading Keystores * Spec-compliant serialization of the slashing data in Keymanager's DeleteKeys response () Fixes #3940 Fixes #3964 Closes #3884 by adding test	2022-08-19 13:30:07 +03:00
zah	a7192f5d6c	Fix the block header computation when proposing an empty execution payload (#3991 ) * Fix the block header computation when proposing an empty execution payload * Spec compliant base fee calculation when producing empty payloads	2022-08-19 13:28:42 +03:00
tersec	b5b93e90c0	use v1.2.0-rc.3 test vectors (#3995 )	2022-08-19 04:32:53 +00:00
Jacek Sieka	c8fb447020	valmon: log autoregistration once only (#3993 )	2022-08-18 23:09:49 +00:00
Jacek Sieka	0d9fd54857	cache shuffling separately from other EpochRef data (fixes #2677 ) (#3990 ) In order to avoid full replays when validating attestations hailing from untaken forks, it's better to keep shufflings separate from `EpochRef` and perform a lookahead on the shuffling when processing the block that determines them. This also helps performance in the case where REST clients are trying to perform lookahead on attestation duties and decreases memory usage by sharing shufflings between EpochRef instances of the same dependent root.	2022-08-18 21:07:01 +03:00
Zahary Karadjov	7f681af56f	Make it easier to test trustedNodeSync or different networks	2022-08-18 19:48:11 +03:00
tersec	2f62567624	distinguish between optimistic and non-optimistic sync progress (#3987 ) * distinguish between optimistic and non-optimistic sync progress * use consistent optimistic formatting for sync	2022-08-18 13:22:22 +00:00
tersec	3ad1d251ef	make newPayload/forkchoiceUpdated failures errors (#3989 )	2022-08-18 12:57:32 +00:00
tersec	c0668367de	don't register exited validators with builder API (#3985 )	2022-08-18 14:59:35 +03:00
tersec	c0f673dc09	spec ref URL updates: v1.2.0-rc.{1,2} for phase0/fork-choice altair/beacon-chain (#3986 )	2022-08-18 07:25:33 +00:00
Etan Kissling	5c8e58ea23	update LC spec references for v1.2.0-rc.2 (#3982 ) Updates light client spec references for latest spec (no more `vFuture`)	2022-08-17 19:47:06 +00:00
tersec	8274d5373b	update spec ref URLs (#3979 )	2022-08-17 11:33:19 +00:00
Jacek Sieka	255be39e69	packaging updates (#3974 ) * packaging updates * one package per binary (nimbus_beacon_node, nimbus_validator_client) * use `-` in package name (`_` is separating the version) * don't include (un)installation scripts in package * default metrics port 8108 for vc * fix several upgrade/install errors in scripts * add JWT option to service files * don't attempt to remove user on purge	2022-08-17 13:26:31 +03:00
zah	ca3245c4f0	Doppelganger exit code changed from 1031 to 129 (addresses #3973 ) (#3977 )	2022-08-17 08:13:55 +02:00
zah	fa9e2b4ec4	Fix #3965 (potential crashes when routing certain gossip messages) (#3978 )	2022-08-17 01:47:21 +00:00
Eugene Kabanov	cd9b50bbbc	Address #3962 (#3968 )	2022-08-17 01:14:32 +03:00
tersec	e732b080f7	use v1.2.0-rc.2 test vectors (#3975 )	2022-08-16 21:24:37 +00:00
Etan Kissling	052f9edfd4	import EL deposits even when EL is stuck (#3956 ) * import EL deposits even when EL is stuck The `eth1_monitor` only starts importing deposits once the EL reports a new head block. However, the EL may be stuck at a block, e.g., the TTD. By polling the latest EL block once after subscribing to new EL block events it is ensured that deposits are still imported in this situation. * also poll once on re-connects * update `eth1_latest_head` metric in poll mode * add comment about similar polling vs events parts * replace check with assert * `isNewLastBlock` helper	2022-08-12 19:44:55 +00:00
Miran	b1974d90eb	log NimVersion (#3937 )	2022-08-12 17:53:15 +03:00
Etan Kissling	c360db8194	avoid materializing potentially long deposits seq (#3947 ) When fetching eth1 data and deposits for a new block proposal, the list of deposits from previous eth1 data to the next one is fully loaded into a `seq`. This can potentially be a very long list in active periods. Changing this to an `iterator` saves memory by ensuring that the entire list is no longer materialized; only the `DepositData` roots are needed.	2022-08-12 16:52:06 +03:00
Etan Kissling	03d6a1a934	track request chunk size across EL reconnects (#3960 ) When the EL connection is interrupted, deposits are once more requested in chunks of 5000 blocks. This is a problem when the response takes over a minute to produce and consistently times out as followup requests with lower chunk sizes may no longer work after a request was canceled, e.g., when using Geth with websockets. By keeping track of `blocksPerRequest` across EL reconnections, it is possible to recover from this by avoiding to continuously repeat the initial request with the full 5000 blocks. Also cleans up one more "retry of retry" instance; `DataProviderTimeout` is a `CatchableError` and already handled by the existing retry logic.	2022-08-12 16:51:33 +03:00
Etan Kissling	98a533c781	resume EL sync at correct block on reconnect (#3957 ) When connection to the EL is lost as part of EL deposits importing, the targeted block range to sync would reset. This is changed to properly remember import progress across reconnects.	2022-08-12 16:48:33 +03:00
zah	d64c17ffc3	Minor post-merge cleanups (#3945 ) https://github.com/status-im/nimbus-eth2/pull/3944 The use of nested `awaitWithRetries` calls would have resulted in an unexpected number of retries (3x3). We now use regular `await` in outer layer to avoid the problem. https://github.com/status-im/nimbus-eth2/pull/3943 The new code has an invariant that the `headMerkleizer` field in the `Eth1Chain` is always kept in sync with the blocks stored in the chain. This invariant is now enforced better by doing the necessary merkleizer updates in the `Eth1Chain.addBlock` function, in the `Eth1Chain.init` function and in the `Eth1Chain.reset` function.	2022-08-10 12:31:10 +00:00
Etan Kissling	ede83b1805	consider blocks with deposits in eth1 data vote (#3944 ) When importing blocks with deposits from the EL, the timestamp is never initialized for them. Therefore, only blocks without deposits (for which the timestamp is obtained) are considered for `is_candidate_block`. This is fixed by also importing timestamps for blocks with deposits.	2022-08-10 01:14:02 +03:00
Etan Kissling	4ef621f926	fix obtaining deposits after connection loss (#3943 ) * fix obtaining deposits after connection loss When an error occurs during Eth1 deposits import, the already imported blocks are kept while the connection to the EL is re-established. However, the corresponding merkleizer is not persisted, leading to any future deposits no longer being properly imported. This is quite common when syncing a fresh Nimbus instance against an already-synced Geth EL. Fixed by persisting the head merkleizer together with the blocks.	2022-08-10 00:32:34 +03:00
zah	06a5c67e62	The body of prepare_beacon_proposer is an array (Fixes #3920 ) (#3938 )	2022-08-09 12:53:02 +03:00
zah	dc50abbc90	Implement a missing ingnore rule for sync committee contributions (#3941 )	2022-08-09 12:52:11 +03:00
Eugene Kabanov	250f7b4bdf	Exclusive keystore locking (#3907 )	2022-08-08 00:53:20 +03:00
Eugene Kabanov	fe5435ed96	Address #3624 (#3934 )	2022-08-06 11:55:40 +00:00
zah	8273b3d909	Keep CLI options consistent by removing the '-enable' suffix from the outliers (#3928 )	2022-08-05 17:38:26 +02:00
Etan Kissling	9c6a4316aa	document LC data serving options (#3922 ) Adds a documentation page for configuring LC data serving.	2022-08-02 12:23:03 +00:00
tersec	2240594ed8	beacon_chain_db: proc -> func (#3931 )	2022-08-01 16:17:06 +00:00
tersec	d62d13a23c	MEV block proposal (#3883 ) * MEV validator registration * add nearby canary to detect new beacon chain forks * remove special MEV graffiti * web3signer support * fix trace logging * Nim 1.2 needs raises Defect * use template rather than proc in REST JSON parsing * use --payload-builder-enable and --payload-builder-url * explicitly default MEV to disabled * explicitly empty default value for payload builder URL * revert attestation pool to unstable version	2022-08-01 09:41:47 +03:00
Taneli Hukkinen	17bf42316e	Fix typo in log: canoncial->canonical (#3927 )	2022-08-01 00:32:08 +00:00
Etan Kissling	449848451d	use LC features on all networks (#3921 ) Prepares removing functionality to restrict LC features to testnets. Rest of functions is removed later to have a quick way of reverting.	2022-07-29 15:37:47 +00:00
Miran	dfd4afc9f2	compatibility with Nim 1.4+ (#3888 )	2022-07-29 10:53:42 +00:00
Etan Kissling	3ec7982293	update light client protocol version (#3550 ) * Use final `v1` version for light client protocols * Unhide LC data collection options * Default enable LC data serving * rm unneeded import * Connect to EL on startup * Add docs for LC based EL sync	2022-07-29 11:45:39 +03:00

1 2 3 4 5 ...

2755 Commits