nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Etan Kissling	ebfb624557	bump `news` to fix concurrency bug with >1MB frames (#4028 ) When calling `newPayload` on a >1MB payload (can happen post-merge), `news` splits up that payload into 1MB chunks. The chunks are each sent individually, though, with `await` in-between. This means that when we send concurrent `forkChoiceUpdated` calls, that those may end up getting in-between the `newPayload` chunks, leading to invalid data being sent. The EL then returns an error message with a `null` `id` entry (as it could not read the request `id` due to the mangling) and disconnects. A PR has been submitted to fix this in `news`, and merged into `status` branch early as this fix is critical for reliable post-merge operation: https://github.com/Tormund/news/pull/22	2022-08-25 21:14:40 +00:00
Etan Kissling	d619b539f3	fix engine API crash when EL disconnected (#4027 ) When issuing an engine API call while the EL is disconnected, a `nil` pointer is dereferenced. Fixed by correctly initializing futures. ``` Traceback (most recent call last, using override) vendor/nim-libp2p/libp2p/protocols/pubsub/pubsub.nim(890) main beacon_chain/nimbus_beacon_node.nim(2139) main beacon_chain/nimbus_beacon_node.nim(0) handleStartUpCmd beacon_chain/nimbus_beacon_node.nim(0) doRunBeaconNode beacon_chain/nimbus_beacon_node.nim(0) start beacon_chain/nimbus_beacon_node.nim(1589) run vendor/nimbus-build-system/vendor/Nim/lib/system/iterators_1.nim(107) poll vendor/nim-chronos/chronos/asyncfutures2.nim(365) futureContinue beacon_chain/consensus_object_pools/consensus_manager.nim(297) updateHeadWithExecution vendor/nim-chronos/chronos/asyncmacro2.nim(213) runProposalForkchoiceUpdated vendor/nim-chronos/chronos/asyncfutures2.nim(365) futureContinue beacon_chain/consensus_object_pools/consensus_manager.nim(259) runProposalForkchoiceUpdated beacon_chain/eth1/eth1_monitor.nim(0) forkchoiceUpdated vendor/nim-chronos/chronos/asyncfutures2.nim(219) complete vendor/nim-chronos/chronos/asyncfutures2.nim(149) cancelled vendor/nimbus-build-system/vendor/Nim/lib/system/excpt.nim(610) signalHandler SIGSEGV: Illegal storage access. (Attempt to read from nil?) ```	2022-08-25 20:07:29 +02:00
Etan Kissling	9180f09641	reduce LC optsync latency (#4002 ) The optimistic sync spec was updated since the LC based optsync module was introduced. It is no longer necessary to wait for the justified checkpoint to have execution enabled; instead, any block is okay to be optimistically imported to the EL client, as long as its parent block has execution enabled. Complex syncing logic has been removed, and the LC optsync module will now follow gossip directly, reducing the latency when using this module. Note that because this is now based on gossip instead of using sync manager / request manager, that individual blocks may be missed. However, EL clients should recover from this by fetching missing blocks themselves.	2022-08-25 03:53:59 +00:00
dependabot[bot]	b6488d5245	Bump nbconvert from 6.4.0 to 6.5.1 in /ncli (#4018 ) Bumps [nbconvert](https://github.com/jupyter/nbconvert) from 6.4.0 to 6.5.1. - [Release notes](https://github.com/jupyter/nbconvert/releases) - [Commits](https://github.com/jupyter/nbconvert/compare/6.4.0...6.5.1) --- updated-dependencies: - dependency-name: nbconvert dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-08-24 16:56:45 +00:00
Etan Kissling	eec6c04d32	do not descore peer when EL connection fails (#4020 ) When the EL fails to respond to `newPayload`, e.g., because connection to the EL got interrupted, or due to misconfiguration, optimistic blocks cannot be imported according to spec. This condition is treated the same as if the peer returned a block with missing parent which gets the block out of our processing queue, but can have nasty side effects. For example, if sync manager asks for validation of a block known to be in the finalized range, if it receives a `MissingParent` verdict, the peer is immediately removed from the peer pool. ``` DBG 2022-08-24 11:45:26.874+02:00 newPayload: inserting block into execution engine parentHash=e4ca7424 blockHash=36cdc198 stateRoot=cf3902c1 receiptsRoot=56e81f17 prevRandao=0b49a172 blockNumber=1518089 gasLimit=30000000 gasUsed=0 timestamp=1657980396 extraDataLen=0 baseFeePerGas=7 numTransactions=0 ERR 2022-08-24 11:45:26.875+02:00 newPayload failed msg="Transport is not initialised (missing a call to connect?)" DBG 2022-08-24 11:45:26.875+02:00 Block pool rejected peer's response topics="syncman" request=187232:32@1475 peer=16UMsCJdx direction=forward blocks_map=xxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxx blocks_count=31 ok=false unviable=false missing_parent=true sync_ident=main ERR 2022-08-24 11:45:26.875+02:00 Unexpected missing parent at finalized epoch slot topics="syncman" request=187232:32@1475 peer=16UMsCJdx direction=forward rewind_to_slot=187232 blocks_count=31 blocks_map=xxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxx sync_ident=main DBG 2022-08-24 11:45:26.875+02:00 Peer was removed from PeerPool due to low score topics="beacnde" peer=16UMsCJdx peer_score=-1000 score_low_limit=0 score_high_limit=1000 DBG 2022-08-24 11:45:26.875+02:00 Lost connection to peer topics="networking" peer=16UMsCJdx connections=0 ``` By delaying issuing a verdict until the EL connection is restored and `newPayload` successfully ran, the problem should be fixed. This also induces back pressure to the sync manager by stopping download of new blocks (or re-downloading the same block over and over again).	2022-08-24 16:55:41 +00:00
Jacek Sieka	492a2ccfac	bump ssz-serialization (#4017 ) * free 7% speedup of `hash_tree_root` * harden `len` for uninitialized `BitSeq`	2022-08-23 18:32:34 +00:00
tersec	1d55743ebb	allow execution clients several seconds to construct blocks (#4012 )	2022-08-23 19:19:52 +03:00
Jacek Sieka	9e9db216c5	Harden block proposal against expired slashings/exits (#4013 ) * Harden block proposal against expired slashings/exits When a message is signed in a phase0 domain, it can no longer be validated under bellatrix due to the correct fork no longer being available in the `BeaconState`. To ensure that all slashing/exits are still valid, in this PR we re-run the checks in the state that we're proposing for, thus hardening against both signatures and other changes in the state that might have invalidated the message. * fix same message added multiple times in case of attestation slashing of multiple validators in one go	2022-08-23 18:30:46 +03:00
tersec	e70d5e6194	update spec ref URLs in state_transition_epoch (#4016 )	2022-08-23 13:06:12 +00:00
Zahary Karadjov	74fa8de886	Merge branch 'stable' into unstable	2022-08-23 01:13:06 +03:00
Zahary Karadjov	57f9974fe5	Version 22.8.0	2022-08-23 01:11:29 +03:00
zah	4e41ed1d5a	Require properly configured Engine API connection after the merge (#4006 )	2022-08-22 22:44:40 +03:00
Etan Kissling	f1ddcfff0f	support connecting to peers without bellatrix (#4011 ) * support connecting to peers without bellatrix Make discovery fork ID aware of scheduled Bellatrix fork to enable connections to peers that don't have Bellatrix scheduled yet. Without this, has peering issues with peers on older SW version. * expand tests with compatibility checks * more exhaustive compatibility checks	2022-08-21 19:36:46 +02:00
Etan Kissling	74dc388ad9	do not prune LC data by default (#4008 ) Aligns the default retention policy for LC data with the one for blocks. Minimum spec requirement for both blocks and LC data is ~5 months. Additional use cases are better supported by retaining data for longer.	2022-08-21 11:24:59 +02:00
tersec	c65eaca1bf	update spec ref URLs (#4005 )	2022-08-20 16:03:32 +00:00
Etan Kissling	bac50610fd	re-generate test report (#4007 ) A couple tests have been removed recently; re-ran `make -j test` to sync the test report.	2022-08-20 14:40:33 +00:00
zah	09de83af80	Reviewed the Engine API calls for missing error handling (#4004 )	2022-08-20 09:09:25 +03:00
zah	b1ac9c9fe4	Fix a potential segfault and various potential stalls (#4003 ) * Fixes a segfault during block production when the Keymanager API is disabled. The Keymanager is now disabled on half of the local testnet nodes to catch such problems in the future. * Fixes multiple potential stalls from REST requests being done without a timeout. From practice, we know that such requests can hang forever if not cancelled with a timeout. At best, this would be a resource leak, at worst, it may lead to a full stall of the client and missed validator duties. * Changes some Options usages to Opt (for easier use of valueOr)	2022-08-19 21:51:30 +00:00
tersec	f537f263df	don't use empty execution payload when newPayload rejects it (#3999 ) * don't use empty execution payload when newPayload rejects it * disallow optimistic import except when accepted/syncing	2022-08-20 00:20:57 +03:00
zah	df5ef95111	Doppelganger detection bug fix (#3997 ) When the client was started without any validators, the doppelganger detection structures were never initialized properly. Later, when validators were added through the Keymanager API, they interacted with the uninitialized doppelganger detection structures and their duties were inappropriately skipped.	2022-08-19 13:34:08 +03:00
zah	fca20e08d6	Keymanager API for the validator client (#3976 ) * Keymanager API for the validator client * Properly treat the 'description' field as optional when loading Keystores * Spec-compliant serialization of the slashing data in Keymanager's DeleteKeys response () Fixes #3940 Fixes #3964 Closes #3884 by adding test	2022-08-19 13:30:07 +03:00
zah	a7192f5d6c	Fix the block header computation when proposing an empty execution payload (#3991 ) * Fix the block header computation when proposing an empty execution payload * Spec compliant base fee calculation when producing empty payloads	2022-08-19 13:28:42 +03:00
tersec	b5b93e90c0	use v1.2.0-rc.3 test vectors (#3995 )	2022-08-19 04:32:53 +00:00
Jacek Sieka	c8fb447020	valmon: log autoregistration once only (#3993 )	2022-08-18 23:09:49 +00:00
tersec	b32ab88191	switch GitHub Actions CI images from Ubuntu 18.04 to 20.04 (#3988 )	2022-08-18 21:07:57 +03:00
Jacek Sieka	0d9fd54857	cache shuffling separately from other EpochRef data (fixes #2677 ) (#3990 ) In order to avoid full replays when validating attestations hailing from untaken forks, it's better to keep shufflings separate from `EpochRef` and perform a lookahead on the shuffling when processing the block that determines them. This also helps performance in the case where REST clients are trying to perform lookahead on attestation duties and decreases memory usage by sharing shufflings between EpochRef instances of the same dependent root.	2022-08-18 21:07:01 +03:00
Zahary Karadjov	d3dbfd6021	libnfuzz uses the mainnet RuntimeConfig (the default one might differ in subtle ways)	2022-08-18 19:48:16 +03:00
Zahary Karadjov	7f681af56f	Make it easier to test trustedNodeSync or different networks	2022-08-18 19:48:11 +03:00
tersec	2f62567624	distinguish between optimistic and non-optimistic sync progress (#3987 ) * distinguish between optimistic and non-optimistic sync progress * use consistent optimistic formatting for sync	2022-08-18 13:22:22 +00:00
tersec	3ad1d251ef	make newPayload/forkchoiceUpdated failures errors (#3989 )	2022-08-18 12:57:32 +00:00
tersec	c0668367de	don't register exited validators with builder API (#3985 )	2022-08-18 14:59:35 +03:00
tersec	c0f673dc09	spec ref URL updates: v1.2.0-rc.{1,2} for phase0/fork-choice altair/beacon-chain (#3986 )	2022-08-18 07:25:33 +00:00
Etan Kissling	5c8e58ea23	update LC spec references for v1.2.0-rc.2 (#3982 ) Updates light client spec references for latest spec (no more `vFuture`)	2022-08-17 19:47:06 +00:00
Etan Kissling	89bc466a9e	re-enable `LightClientUpdate` tests (#3981 ) Now that the 1.2.0-rc.2 spec contains the same `LightClientUpdate` definition that Nimbus was already using before, the corresponding SSZ test vectors can be re-enabled.	2022-08-17 17:50:28 +00:00
zah	74819729a5	Bellatrix metadata for Mainnet (#3980 )	2022-08-17 14:40:58 +00:00
tersec	8274d5373b	update spec ref URLs (#3979 )	2022-08-17 11:33:19 +00:00
Jacek Sieka	44ea08240c	web3 doc updates (#3971 )	2022-08-17 13:37:25 +03:00
Jacek Sieka	255be39e69	packaging updates (#3974 ) * packaging updates * one package per binary (nimbus_beacon_node, nimbus_validator_client) * use `-` in package name (`_` is separating the version) * don't include (un)installation scripts in package * default metrics port 8108 for vc * fix several upgrade/install errors in scripts * add JWT option to service files * don't attempt to remove user on purge	2022-08-17 13:26:31 +03:00
zah	ca3245c4f0	Doppelganger exit code changed from 1031 to 129 (addresses #3973 ) (#3977 )	2022-08-17 08:13:55 +02:00
zah	fa9e2b4ec4	Fix #3965 (potential crashes when routing certain gossip messages) (#3978 )	2022-08-17 01:47:21 +00:00
Jacek Sieka	94a479f1c9	ncli: time operations (#3966 )	2022-08-17 01:17:22 +03:00
Eugene Kabanov	cd9b50bbbc	Address #3962 (#3968 )	2022-08-17 01:14:32 +03:00
tersec	e732b080f7	use v1.2.0-rc.2 test vectors (#3975 )	2022-08-16 21:24:37 +00:00
Nikolay Mitev	5c91d29df0	Trivial: add wait_for_port function to script	2022-08-16 12:39:27 +03:00
Nikolay Mitev	33546f0fa9	Trivial: Fix typo	2022-08-15 16:46:14 +03:00
Nikolay Mitev	9e6d9b955d	Trivial: Make NIMBUS_EL_BINARY customizable	2022-08-14 15:51:42 +03:00
Jacek Sieka	4adf849d09	book edits (#3955 ) * reorganise navigation menus * update light client guide with comparison table * add suggested fee recipient and JWT secrets to the merge guide * add some background info to book readme * add JWT docs also limit toc to make it displayable with substeps.	2022-08-13 20:17:20 +02:00
Etan Kissling	052f9edfd4	import EL deposits even when EL is stuck (#3956 ) * import EL deposits even when EL is stuck The `eth1_monitor` only starts importing deposits once the EL reports a new head block. However, the EL may be stuck at a block, e.g., the TTD. By polling the latest EL block once after subscribing to new EL block events it is ensured that deposits are still imported in this situation. * also poll once on re-connects * update `eth1_latest_head` metric in poll mode * add comment about similar polling vs events parts * replace check with assert * `isNewLastBlock` helper	2022-08-12 19:44:55 +00:00
Miran	b1974d90eb	log NimVersion (#3937 )	2022-08-12 17:53:15 +03:00
Nikolay Mitev	607de676cb	launch_local_testnet script: add options to download eth2 binary (#3958 ) Some refactoring and cleanup	2022-08-12 17:41:11 +03:00

... 9 10 11 12 13 ...

5102 Commits All Branches Search

5102 Commits

All Branches