nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
tersec	bf3ef987e4	deactivate doppelganger protection during genesis (#3362 ) * deactivate Doppelganger Protection during genesis * also don't actually flag supposed-doppelgangers (because they're before broadcastStartEpoch) on GENESIS_SLOT start	2022-02-07 07:12:36 +02:00
Jacek Sieka	a50e21e229	fix doppelganger detection logging * update action tracker on dependent-root-changing reorg (instead of epoch change) * don't try to log duties while syncing - we're not tracking actions yet * fix slot used for doppelganger loss detection	2022-02-04 12:25:32 +01:00
Jacek Sieka	49282e9477	val_mon: register locally produced aggregates (#3352 ) These use a separate flow, and were previously only registered from the network * don't log successes in totals mode (TMI) * remove `attestation-sent` event which is unused	2022-02-04 08:33:20 +01:00
tersec	89ffa8a1a7	spec URL & copyright year update (#3338 )	2022-01-29 01:05:39 +00:00
Jacek Sieka	61342c2449	limit by-root requests to non-finalized blocks (#3293 ) * limit by-root requests to non-finalized blocks Presently, we keep a mapping from block root to `BlockRef` in memory - this has simplified reasoning about the dag, but is not sustainable with the chain growing. We can distinguish between two cases where by-root access is useful: * unfinalized blocks - this is where the beacon chain is operating generally, by validating incoming data as interesting for future fork choice decisions - bounded by the length of the unfinalized period * finalized blocks - historical access in the REST API etc - no bounds, really In this PR, we limit the by-root block index to the first use case: finalized chain data can more efficiently be addressed by slot number. Future work includes: * limiting the `BlockRef` horizon in general - each instance is 40 bytes+overhead which adds up - this needs further refactoring to deal with the tail vs state problem * persisting the finalized slot-to-hash index - this one also keeps growing unbounded (albeit slowly) Anyway, this PR easily shaves ~128mb of memory usage at the time of writing. * No longer honor `BeaconBlocksByRoot` requests outside of the non-finalized period - previously, Nimbus would generously return any block through this libp2p request - per the spec, finalized blocks should be fetched via `BeaconBlocksByRange` instead. * return `Opt[BlockRef]` instead of `nil` when blocks can't be found - this becomes a lot more common now and thus deserves more attention * `dag.blocks` -> `dag.forkBlocks` - this index only carries unfinalized blocks from now - `finalizedBlocks` covers the other `BlockRef` instances * in backfill, verify that the last backfilled block leads back to genesis, or panic * add backfill timings to log * fix missing check that `BlockRef` block can be fetched with `getForkedBlock` reliably * shortcut doppelganger check when feature is not enabled * in REST/JSON-RPC, fetch blocks without involving `BlockRef` * fix dag.blocks ref	2022-01-21 13:33:16 +02:00
tersec	9c0c9c98ce	complete switch to beacon_chain/specs/datatypes/bellatrix (#3295 )	2022-01-18 13:36:52 +00:00
Jacek Sieka	836f6984bb	move `state_transition` to `Result` (#3284 ) * better error messages in api * avoid `BlockData` copies when replaying blocks	2022-01-17 12:19:58 +01:00
Jacek Sieka	805e85e1ff	time: spring cleaning (#3262 ) Time in the beacon chain is expressed relative to the genesis time - this PR creates a `beacon_time` module that collects helpers and utilities for dealing the time units - the new module does not deal with actual wall time (that's remains in `beacon_clock`). Collecting the time related stuff in one place makes it easier to find, avoids some circular imports and allows more easily identifying the code actually needs wall time to operate. * move genesis-time-related functionality into `spec/beacon_time` * avoid using `chronos.Duration` for time differences - it does not support negative values (such as when something happens earlier than it should) * saturate conversions between `FAR_FUTURE_XXX`, so as to avoid overflows * fix delay reporting in validator client so it uses the expected deadline of the slot, not "closest wall slot" * simplify looping over the slots of an epoch * `compute_start_slot_at_epoch` -> `start_slot` * `compute_epoch_at_slot` -> `epoch` A follow-up PR will (likely) introduce saturating arithmetic for the time units - this is merely code moves, renames and fixing of small bugs.	2022-01-11 11:01:54 +01:00
Jacek Sieka	20e700fae4	Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex (#3259 ) * Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex Harden the use of `CommitteeIndex` et al to prevent future issues by using a distinct type, then validating before use in several cases - datatypes in spec are kept simple though so that invalid data still can be read. * fix invalid epoch used in REST `/eth/v1/beacon/states/{state_id}/committees` committee length (could return invalid data) * normalize some variable names * normalize committee index loops * fix `RestAttesterDuty` to use `uint64` for `validator_committee_index` * validate `CommitteeIndex` on ingress in REST API * update rest rules with stricter parsing * better REST serializers * save lots of memory by not using `zip` ...at least a few bytes!	2022-01-09 01:28:49 +02:00
Jacek Sieka	0a4728a241	Handle access to historical data for which there is no state (#3217 ) With checkpoint sync in particular, and state pruning in the future, loading states or state-dependent data may fail. This PR adjusts the code to allow this to be handled gracefully. In particular, the new availability assumption is that states are always available for the finalized checkpoint and newer, but may fail for anything older. The `tail` remains the point where state loading de-facto fails, meaning that between the tail and the finalized checkpoint, we can still get historical data (but code should be prepared to handle this as an error). However, to harden the code against long replays, several operations which are assumed to work only with non-final data (such as gossip verification and validator duties) now limit their search horizon to post-finalized data. * harden several state-dependent operations by logging an error instead of introducing a panic when state loading fails * `withState` -> `withUpdatedState` to differentiate from the other `withState` * `updateStateData` can now fail if no state is found in database - it is also hardened against excessively long replays * `getEpochRef` can now fail when replay fails * reject blocks with invalid target root - they would be ignored previously * fix recursion bug in `isProposed`	2022-01-05 19:38:04 +01:00
Zahary Karadjov	54d0d588b1	Implementation of the Keymanager API (BETA) https://github.com/ethereum/keymanager-APIs	2022-01-04 18:51:45 +02:00
tersec	da017d2ca5	update from phase0/altair v1.1.6 URLs to v1.1.8 spec URLs (#3238 )	2022-01-04 03:57:15 +00:00
Jacek Sieka	c4ce59e55b	Assorted logging improvements (#3237 ) * log doppelganger detection when it activates and when it causes missed duties * less prominent eth1 sync progress * log in-progress sync at notice only when actually missing duties * better detail in replay log * don't log finalization checkpoints - this is quite verbose when syncing and already included in "Slot start"	2022-01-03 22:18:49 +01:00
Jacek Sieka	c270ec21e4	Validator monitoring (#2925 ) Validator monitoring based on and mostly compatible with the implementation in Lighthouse - tracks additional logs and metrics for specified validators so as to stay on top on performance. The implementation works more or less the following way: * Validator pubkeys are singled out for monitoring - these can be running on the node or not * For every action that the validator takes, we record steps in the process such as messages being seen on the network or published in the API * When the dust settles at the end of an epoch, we report the information from one epoch before that, which coincides with the balances being updated - this is a tradeoff between being correct (waiting for finalization) and providing relevant information in a timely manner)	2021-12-20 20:20:31 +01:00
tersec	6ef3834f4a	fix type-conversions-to-self, unexport from nimbus_beacon_node, and rm unused vars/procs (#3211 )	2021-12-20 12:21:17 +01:00
tersec	57974ce61b	forkchoiceUpdate support (#3199 )	2021-12-17 12:23:32 +00:00
tersec	f09686e835	update some spec URLs to v1.1.6 (#3188 )	2021-12-13 15:45:48 +00:00
Jacek Sieka	069bccd51b	batch-verify sync messages for a small perf boost (#3151 ) * batch-verify sync messages for a small perf boost Generally reuses the same structure as attestation and aggregate verification * normalize `signatures` and `signature_batch` to use the same pattern of verification * normalize parameter names, order etc for signature stuff in general * avoid calling `blsSign` directly - instead, go through `signatures` consistently	2021-12-09 14:56:54 +02:00
Jacek Sieka	850eece949	Broadcast blocks before integrating in ChainDAG This PR fixes two issues with block publishing: * Gossip-valid blocks are published before integrating them into the chain, giving broadcasting a head start, both for rest block and * Outright invalid blocks from the API that could lead to the descoring of the node are no longer broadcast Bonus: * remove undocumented and duplicated `post_v1_validator_block` JSON-RPC call	2021-12-06 17:15:47 +02:00
tersec	cc51f3fd12	v1.1.{5 -> 6} phase 0 and altair spec URL updates (#3157 )	2021-12-03 17:40:23 +00:00
Eugene Kabanov	e62c7c7c37	Remote signing client/server. (#3077 )	2021-11-30 03:20:21 +02:00
Zahary Karadjov	4570f6e4ec	Allow broadcasting "ignored" gossip messages Validator clients such as Vouch can be configured to work with multiple beacon nodes simultaneously. In this configuration, the validator client will try to broadcast the gossip messages through each of the connected beacon nodes which may lead to a situation where some of the nodes see a message arriving from the network before it arrives through the REST API. This should not be considered an error and the beacon node should still broadcast the message as the intented purpose of the Vouch strategy is to ensure that the message will reach as many peers as possible.	2021-11-28 01:54:36 +02:00
Zahary Karadjov	7902e7684c	Sync with Eth1 even when there are no validators attached	2021-11-27 18:43:01 +02:00
Jacek Sieka	a223d62b07	Cleanups (#3123 ) Renames and cleanups split out from the validator monitoring branch, so as to reduce conflict area vs other PR:s * add constants for expected message timing * name validators after the messages they validate, mostly, to make grepping easier * unify field naming of EpochInfo across forks to make cross-fork code easier	2021-11-25 13:20:36 +01:00
Jacek Sieka	9c2f43ed0e	Speed up altair block processing 2x (#3115 ) * Speed up altair block processing >2x Like #3089, this PR drastially speeds up historical REST queries and other long state replays. * cache sync committee validator indices * use ~80mb less memory for validator pubkey mappings * batch-verify sync aggregate signature (fixes #2985) * document sync committee hack with head block vs sync message block * add batch signature verification failure tests Before: ``` ../env.sh nim c -d:release -r ncli_db --db:mainnet_0/db bench --start-slot:-1000 All time are ms Average, StdDev, Min, Max, Samples, Test Validation is turned off meaning that no BLS operations are performed 5830.675, 0.000, 5830.675, 5830.675, 1, Initialize DB 0.481, 1.878, 0.215, 59.167, 981, Load block from database 8422.566, 0.000, 8422.566, 8422.566, 1, Load state from database 6.996, 1.678, 0.042, 14.385, 969, Advance slot, non-epoch 93.217, 8.318, 84.192, 122.209, 32, Advance slot, epoch 20.513, 23.665, 11.510, 201.561, 981, Apply block, no slot processing 0.000, 0.000, 0.000, 0.000, 0, Database load 0.000, 0.000, 0.000, 0.000, 0, Database store ``` After: ``` 7081.422, 0.000, 7081.422, 7081.422, 1, Initialize DB 0.553, 2.122, 0.175, 66.692, 981, Load block from database 5439.446, 0.000, 5439.446, 5439.446, 1, Load state from database 6.829, 1.575, 0.043, 12.156, 969, Advance slot, non-epoch 94.716, 2.749, 88.395, 100.026, 32, Advance slot, epoch 11.636, 23.766, 4.889, 205.250, 981, Apply block, no slot processing 0.000, 0.000, 0.000, 0.000, 0, Database load 0.000, 0.000, 0.000, 0.000, 0, Database store ``` * add comment	2021-11-24 13:43:50 +01:00
Jacek Sieka	95dd846a9b	Make sync horizon configurable (#3113 ) Currently, we don't have a good answer to the question "are we synced yet" - the sync manager syncs based on the peers it's connected to, but just because some peer looks like it should be synced from doesn't mean we're out of sync. Instead, we use a very silly time-based heuristic - the problem with that is that the network can go into a rut where nobody produces blocks - better heuristics would be needed here, but in the meantime, a command line option can get us out of a tight spot - this PR places such an option in the client, in the unlikely event it should be needed (most likely in a testnet).	2021-11-18 20:35:26 +01:00
Jacek Sieka	f19a497eec	ncli_db: add putState, putBlock (#3096 ) * ncli_db: add putState, putBlock These tools allow modifying an existing nimbus database for the purpose of recovery or reorg, moving the head, tail and genesis to arbitrary points. * remove potentially expensive `putState` in `BeaconStateDB` * introduce `latest_block_root` which computes the root of the latest applied block from the `latest_block_header` field (instead of passing it in separately) * avoid some unnecessary BeaconState copies during init * discover https://github.com/nim-lang/Nim/issues/19094 * prefer `HashedBeaconState` in a few places to avoid recomputing state root * fetch latest block root from state when creating blocks * harden `get_beacon_proposer_index` against invalid slots and document * move random spec function tests to `test_spec.nim` * avoid unnecessary state root computation before block proposal	2021-11-18 13:02:43 +01:00
Jacek Sieka	b22d86e161	REST/JSON-RPC: speed up several requests (#3092 ) REST/JSON-RPC and a few more also invalidate caches unnecessarily, similar to https://github.com/status-im/nimbus-eth2/pull/3089 * avoid copying validator on balance request	2021-11-12 23:29:28 +01:00
tersec	2e868dc2ba	mass/mechanical update of 1.1.4 phase0 and altair spec URLs to 1.1.5 (#3067 )	2021-11-09 07:40:41 +00:00
tersec	eb3ad25859	mass/mechanical update of 1.1.3 phase/altair spec URLs to 1.1.4 (#3058 )	2021-11-08 06:18:10 +00:00
Zahary Karadjov	29e5700838	Bugfix: Avoid the aggregation of duplicate signatures when creating sync committee contributions	2021-11-07 21:41:10 +02:00
Jacek Sieka	ea0a191723	Better REST/RPC error messages (#3046 ) * Better REST/RPC error messages * homogenise block logging (root first) * homegenise message verification pipeline (verify in `gossip_verification`, act in `eth2_processor`) * use `subcommitteeIdx` consistently * log each sent contribution * fix block_sim * fix block topic * don't recalc root on gossip block validation * move position loop into sync pool	2021-11-05 17:39:47 +02:00
tersec	8307e9c601	mechanical non-merge v1.1.2 to v1.1.3 spec URL updates (#3030 )	2021-10-26 16:44:23 +00:00
Jacek Sieka	421bf936ff	odds and ends (#3015 ) * `allSyncCommittees` => `allSyncSubcommittees` * simplify `_snappy` topic generation (avoid pointless string copies) * simplify gossip id generator (avoid pointless string copies) * avoid redundant syncnet ENR updates * simplify topic validation (allow only validated topics)	2021-10-21 15:09:19 +02:00
Jacek Sieka	9cf32c3748	clean up sync subcommittee handling * `SyncCommitteeIndex` -> `SyncSubcommitteeIndex` * `syncCommitteePeriod` -> `sync_committee_period` (spec spelling) * tighten period comparisons * fix assert when validating committee message with non-altair state in REST api	2021-10-20 22:59:13 +03:00
Jacek Sieka	bf6ad41d7d	add drop and sync committee metrics * use storeBlock for processing API blocks * avoid double block dump * count all gossip metrics at the same spot * simplify block broadcast	2021-10-20 18:20:12 +03:00
Jacek Sieka	c247702ebc	normalize subnet logging * call it subnet id everywhere * log aggregate sent from VC * log subnet with aggregate	2021-10-20 15:06:44 +03:00
Jacek Sieka	df3fc9525f	import cleanup (#2997 ) * import cleanup ...and remove some unused types * add random imports * more imports	2021-10-19 16:09:26 +02:00
Jacek Sieka	c40cc6cec1	clean up fork enum and field names * single naming strategy * simplify some fork code * simplify forked block production	2021-10-19 11:06:38 +03:00
Jacek Sieka	4f7a8cf79d	register vc duties with subnet tracker (#2949 ) * register vc duties with subnet tracker * fix activation logging during startup * cache slot signature to avoid duplicate signature work * schedule aggregation duties one slot at a time to avoid CPU spike at each epoch * lower aggregation subnet pre-subscription time to 4 slots (lowers bandwidth and CPU usage) * update stability subnets in ENR on startup * log gossip state * perform gossip subscriptions just before the next slot starts * document stuff * add random include * don't overwrite subscription state when not subscribed * log target gossip state * updating gossip status once is enough * add test * remove syncQueueLen - this one is not updated at the end of the sync and may cause gossip to disconnect itself completely - use a simple head distance instead * fix gossip disconnection - if in hysteresis, node.gossipState will be set to disabled even though we don't disable topic subscriptions * fix extra duty registration call	2021-10-18 11:11:44 +02:00
tersec	2eb9a608a4	add payloadId; add merge vector test script; remove consensusValidated (#2982 )	2021-10-13 16:08:50 +02:00
tersec	2ad1b7366a	update 62 spec URLs to v1.1.2 (#2979 )	2021-10-12 10:17:37 +00:00
Etan Kissling	9ee134324b	allow `withXxx` to access fork-specific fields (#2943 ) So far, `withState` and `withBlck` templates could only be used to have convenience access to fork-agnostic BeaconState and BeaconBlock fields. This patch: - injects an additional `stateFork` constant that allows to use `when` expressions to also access Altair and Merge-specific fields. - introduces a `withStateAndBlck` template to support operating on both a `BeaconState` and `BeaconBlock` at a time. - makes sync committee related functions Merge aware. - changes a couple if-else trees for forks into case statements so that forgotten future forks are promoted to compile-time errors.	2021-10-06 20:05:06 +03:00
Eugene Kabanov	65257b82f8	Validator key management API (#2755 ) Implements https://github.com/ethereum/beacon-APIs/pull/151	2021-10-04 22:08:31 +03:00
Etan Kissling	b217150f1d	use forked `getAttestationsForBlock` everywhere (#2937 ) There are a number of locations in the code that get attestations on a forked beacon state. For attestation pools test, a convenience wrapper was available to reduce clutter. This patch integrates that wrapper into the core component so that it can also take advantage of the wrapper.	2021-10-01 01:29:32 +00:00
Etan Kissling	ba84a55699	fix `makeBeaconBlockForHeadAndSlot` for merge (#2934 ) This fixes an if-else structure that was not aware of the merge phase in `makeBeaconBlockForHeadAndSlot`, avoiding a potential crash.	2021-09-30 16:27:17 +00:00
Etan Kissling	2e9fa87f8b	use `SyncAggregate.init()` everywhere (#2932 ) The initialization of a `SyncAggregate` to its default value is not very intuitive. There is an `init` function in `sync_committee_msg_pool` that provides a convenience wrapper. This patch exports that initializer so that the rest of the code base can also take advantage of it.	2021-09-30 13:56:07 +00:00
Etan Kissling	e243ba2c0b	revise `makeBeaconBlock` overloads (#2879 ) The phase0 and altair overloads of `makeBeaconBlock` slightly differ in their signatures which makes using them unnecessarily verbose. - A placeholder `sync_aggregate` argument similar to `executionPayload` is added to the phase0 overload to match the altair signature. - A wrapper operating on `ForkedHashedBeaconState` is introduced.	2021-09-29 12:10:44 +00:00
tersec	2b2846b468	implement forked merge state/block support (#2890 ) * implement forked state/block support * merge support for containsOrphan; import cleanup; 80-column lines * add merge block header operations and slot sanity fixture * add epoch state transition tests; implement is_valid_gas_limit(), is_merge_block(), is_execution_enabled(), and compute_timestamp_at_slot() * implement process_execution_payload() and add merge deposit operations tests * add merge block sanity tests * add merge case to syncCommitteeParticipants * v1.1.0-beta.5 updates * reduce getTestStates-based memory usage; don't try to REST-serialize ExecutionPayload transactions without underlying support * add execution payload tests; switch var to let in tests/official/	2021-09-27 14:22:58 +00:00
Eugene Kabanov	0c635334a2	Sync committee related REST API implementation. (#2856 )	2021-09-24 01:13:25 +03:00

1 2 3

101 Commits