nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Zahary Karadjov	4c01b77773	The remote Keymanager API was not using the URLs indicated in the spec	2022-03-03 11:10:00 +02:00
Etan Kissling	3ffab01b07	Refactor and optimize sync logs. (#3451 ) * Refactor and optimize logs. * Introduce shortLog(SyncRequest). * Address review comment. * make sync queue logs more consistent Adds a few minor logging improvements: - Fixes a typo (`was happened` -> `has happened`) - Avoids passing `reset_slot` argument to log statement multiple times - Uses same `rewind_to_slot` label when logging in both sync directions - Consistent rewind point logging Co-authored-by: cheatfate <eugene.kabanov@status.im>	2022-03-03 09:05:33 +01:00
Etan Kissling	33d084192f	consistent style in `light_client_sync.nim` (#3450 ) Uses consistent formatting in `light_client_sync.nim`, always refers to fork-dependent light client objects in full qualified notation, moves `get_safety_threshold` helper function to same location as in the spec.	2022-03-02 11:44:42 +01:00
tersec	f0ada15dac	automated CL spec ref URL updates from v1.1.9 to v1.1.10 (#3455 )	2022-03-02 10:00:21 +00:00
tersec	7b3d9d4e14	use v1.1.10 CL spec test vectors (#3454 )	2022-03-02 07:26:17 +00:00
Jacek Sieka	12ed537f75	catch wrong-fork-blocks earlier (#3444 ) Can't apply a phase0 block to a later phase state and vice versa. Since instantiation has been a topic, pre/post c file size: ``` 424K @mspec@sstate_transition.nim.c 892K @mspec@sstate_transition_block.nim.c ``` ``` 288K @mspec@sstate_transition.nim.c 880K @mspec@sstate_transition_block.nim.c ```	2022-02-28 12:58:34 +00:00
Etan Kissling	961c02fcba	document `GeneralizedIndex` constants (#3443 ) Updates the spec references for `GeneralizedIndex` constants used by the light client sync protocol, and adds a short explanation how they are derived and which SSZ fields they refer to.	2022-02-28 13:34:57 +01:00
tersec	ef9767eb7a	implement --jwt-secret and HS256 JWT/JWS signing for engine API alpha.7 (#3440 )	2022-02-27 16:55:02 +00:00
Jacek Sieka	40a4c01086	chaindag: don't keep backfill block table in memory (#3429 ) This PR names and documents the concept of the archive: a range of slots for which we have degraded functionality in terms of historical access - in particular: * we don't support rewinding to states in this range * we don't keep an in-memory representation of the block dag The archive de-facto exists in a trusted-node-synced node, but this PR gives it a name and drops the in-memory digest index. In order to satisfy `GetBlocksByRange` requests, we ensure that we have blocks for the entire archive period via backfill. Future versions may relax this further, adding a "pre-archive" period that is fully pruned. During by-slot searches in the archive (both for libp2p and rest requests), an extra database lookup is used to covert the given `slot` to a `root` - future versions will avoid this using era files which natively are indexed by `slot`. That said, the lookup is quite fast compared to the actual block loading given how trivial the table is - it's hard to measure, even. A collateral benefit of this PR is that checkpoint-synced nodes will see 100-200MB memory usage savings, thanks to the dropped in-memory cache - future pruning work will bring this benefit to full nodes as well. * document chaindag storage architecture and assumptions * look up parent using block id instead of full block in clearance (future-proofing the code against a future in which blocks come from era files) * simplify finalized block init, always writing the backfill portion to db at startup (to ensure lookups work as expected) * preallocate some extra memory for finalized blocks, to avoid immediate realloc	2022-02-26 19:16:19 +01:00
Jacek Sieka	92e7e288e7	Ignore seen aggregates (#3439 ) https://github.com/ethereum/consensus-specs/pull/2225 removed an ignore rule that would filter out duplicate aggregates from gossip publishing - however, this causes increased bandwidth and CPU usage as discussed in https://github.com/ethereum/consensus-specs/issues/2183 - the intent is to revert the removal and reinstate the rule. This PR implements ignore filtering which cuts down on CPU usage (fewer aggregates to validate) and bandwidth usage (less fanout of duplicates) - as #2225 points out, this may lead to a small increase in IHAVE messages.	2022-02-25 17:15:39 +01:00
Tanguy	1bfbcc48b6	Bump libp2p (#3438 )	2022-02-25 13:22:48 +01:00
zah	c29aa9d846	Support for Gnosis Chain (#3415 ) * Support for Gnosis Chain `make gnosis-chain-build` will build the Nimbus gnosis chain binary, stored in `build/nimbus_beacon_node_for_gnosis_chain`. `make gnosis-chain` will connect to the network. Other changes: * Restore compilation with -d:has_genesis_detection * Removed Makefile target related to testnet0 and testnet1 * Added more debug logging for failed peer handshakes * Report misconfigured builds which try to embed network metadata that is incompatible with the currently selected const preset. * Don't bundle network metadata in minimal builds, as they are not compatible	2022-02-25 10:22:44 +02:00
Ștefan Talpalaru	ebba093362	Nim-1.6 compatibility (#3434 )	2022-02-25 10:19:12 +02:00
tersec	fef71a78a0	bump nim-web3 for random -> prevRandao rename (#3435 )	2022-02-24 18:01:48 +01:00
zah	9c1ff78f84	Fix a reward calculation bug affecting Prater epoch 64781 (#3428 ) To calculate the deltas correctly, the `process_inactivity_updates` function must be called before the rewards and penalties processing code in order to update the `inactivity_scores` field in the state. This would have required duplicating more logic from the spec in the ncli modules, so I've decided to pay the price of introducing a run-time copy of the state at each epoch which eliminates the need to duplicate logic (both for this fix and the previous one). Other changes: * Fixes for the read-only mode of the `BeaconChainDb` * Fix an uint64 underflow in the debug output procedure for printing balance deltas * Allow Bellatrix states in the reward computation helpers	2022-02-22 14:14:17 +02:00
tersec	7de3f00f35	generic putCorruptState; {Merge=>Bellatrix}BeaconStateNoImmutableValidators (#3427 )	2022-02-21 12:55:56 +01:00
Jacek Sieka	adfe655b16	db: make block loading generic (#3413 ) Streamline lookup with Forky and BeaconBlockFork (then we can do the same for era) We use type to avoid conditionals, as fork is often already known at a "higher" level. * load blockid before loading block by root - this is needed to map root to slot and will eventually be done via block summary table for "old" blocks Co-authored-by: tersec <tersec@users.noreply.github.com>	2022-02-21 09:48:02 +01:00
tersec	84588b34da	var => let in specs/ and tests/ (#3425 )	2022-02-20 20:13:06 +00:00
Etan Kissling	9790c4958b	converter function for reducing blocks to headers (#3410 ) This introduces a function to convert `SignedBeaconBlock` to just their `BeaconBlockHeader` and updates the usages for reduced code duplication.	2022-02-18 21:35:52 +01:00
Jacek Sieka	a88427bd39	ncli_db: more readonly support (#3411 ) Update several `ncli_db` commands to run in readOnly mode, allowing them to be used with a running instance - in particular era export. * export all eras by default * skip already-exported eras	2022-02-18 07:37:44 +01:00
tersec	79761c78a4	proc -> func, mainly in spec/state transition and adjecent modules (#3405 )	2022-02-17 11:53:55 +00:00
Jacek Sieka	87e98b9e54	Revert "bump submodules (#3366 )" (#3406 ) This reverts commit `6e1ad080e8`.	2022-02-17 12:50:37 +01:00
tersec	5eecb9a21f	rename no{R=>r}eturn, no{I=>i}init, short{l=>L}og, E{T=>t}h2Node, Beacon{c=>C}hainDB (#3403 )	2022-02-16 23:24:44 +01:00
Jacek Sieka	7db5647a6e	clean up / document init (#3387 ) * clean up / document init * drop `immutable_validators` data (pre-altair) * document versions where data is first added * avoid needlessly loading genesis block data on startup * add a few more internal database consistency checks * remove duplicate state root lookup on state load * comment	2022-02-16 16:44:04 +01:00
Ștefan Talpalaru	6e1ad080e8	bump submodules (#3366 ) and add Nim-1.6 compatibility	2022-02-16 13:41:50 +02:00
Eugene Kabanov	3a80b9951c	VC: Fix forks handling. (#3389 ) * Trying to debug the finalization issue. * Add debug logs to understand signature issue. * Remove all the debugging helpers. * Initial commit. * Address review comments. * Remove unneeded checks for empty fork schedule. * Fix bellatrix ExecutionAddress serialization/deserialization procedures.	2022-02-16 12:31:23 +01:00
tersec	254e0fe2e2	avoid nimZeroMem and stack usage in is_merge_transition_complete and is_merge_transition_block (#3399 )	2022-02-16 07:16:01 +00:00
Dustin Brody	e1dbcfc02e	add --safe-slots-to-import-optimistically option	2022-02-15 23:08:49 +02:00
Zahary Karadjov	8e0330050d	Version 1.7.0	2022-02-15 22:55:29 +02:00
Zahary Karadjov	c672628be8	Hotfix: Fix a race condition leading to a busy loop preventing progress in Eth1 syncing	2022-02-15 22:45:55 +02:00
Ștefan Talpalaru	496d0266ec	bump nim-metrics (#3392 )	2022-02-14 21:57:06 +01:00
tersec	2275fad335	only show setting up doppelganger detection log message if enabled (#3391 ) * only show setting up doppelganger detection log message if enabled * correct indentation	2022-02-14 19:24:38 +00:00
Etan Kissling	6849536742	fix `firstSlot` computation for backfill sync When initializing backfill sync, the implementation intends to start at the first unknown slot (`1` before tail). However, an incorrect variable is passed, and backfill sync actually starts at the tail slot instead. This patch corrects this by passing the intended variable. The problem was introduced with the original backfill implementation at #3263.	2022-02-14 18:53:38 +02:00
Zahary Karadjov	922a0d264c	Add CORS support for the REST services The added options work in opt-in fashion. If they are not specified, the server will respond to all requests as if the CORS specification doesn't exist. This will result in errors in CORS-enabled clients. Please note that future versions may support more than one allowed origin. The option names will stay the same, but the user will be able to repeat them on the command line (similar to other options such as --web3-url). To be documented in the guide in a separate PR.	2022-02-14 18:52:17 +02:00
Etan Kissling	d1f97e209a	remove unused `sleepTime` from `SyncManager` (#3384 ) The `SyncManager` has a leftover optional `sleepTime` parameter in its constructor that used to configure the sync loop polling rate. This parameter was replaced with a constant in #1602 and is no longer functional. This patch removes the `sleepTime` leftovers.	2022-02-14 12:05:01 +01:00
Etan Kissling	a28900c348	fix slot number display during sync (#3383 ) #3304 introduced a regression to the sync status string displayed in the status bar; during the main forward sync, the current slot is no longer reported and always displays as `0`. This patch corrects the computation to accurately report the current slot once more.	2022-02-14 12:04:04 +01:00
tersec	873a8ec1e6	use isZeroMemory for Eth2Digest comparisons (#3386 ) * use isZeroMemory for Eth2Digest comparisons * use Eth2Digest.isZero abstraction	2022-02-14 05:26:19 +00:00
Eugene Kabanov	1a0bcf0b02	Fix #3267 (#3367 ) * Initial commit. * One more fix. * Trying to debug the finalization issue. * Add debug logs to understand signature issue. * Restore hash_tree_root calculation. * Remove all the debugging helpers. * Add `slot` check. * Address review comment.	2022-02-13 16:21:55 +01:00
Etan Kissling	15fc7534cf	remove unused `maxStatusAge` from `SyncManager` (#3382 ) The `SyncManager` has a leftover optional `maxStatusAge` parameter in its constructor that used to configure the libp2p `Status` polling rate. This parameter was replaced with a constant in #1827 and is no longer functional. This patch removes the `maxStatusAge` leftovers.	2022-02-13 16:17:13 +01:00
Jacek Sieka	1f89b7f7b9	speed up trusted node backfill (#3371 ) With these changes, we can backfill about 400-500 slots/sec, which means a full backfill of mainnet takes about 2-3h. However, the CPU is not saturated - neither in server nor in client meaning that somewhere, there's an artificial inefficiency in the communication - 16 parallel downloads should saturate the CPU. One plasible cause would be "too many async event loop iterations" per block request, which would introduce multiple "sleep-like" delays along the way. I can push the speed up to 800 slots/sec by increasing parallel downloads even further, but going after the root cause of the slowness would be better. * avoid some unnecessary block copies * double parallel requests	2022-02-12 12:09:59 +01:00
Jacek Sieka	40fe8f5336	fix missing backfill when restarting node When node is restarted before backfill has started but after some blocks have finalized with forward sync, we would not start the backfill. * also clean up one last `SomeSome`	2022-02-11 23:08:50 +02:00
Jacek Sieka	1760f4d7a7	move wallet/deposit commands to separate files (#3372 ) These commands have little to do with the "normal" beacon node operation - ergo, they deserve to live in their own module. * clean up imports/exports	2022-02-11 21:40:49 +01:00
Eugene Kabanov	b4eb150b9a	Revert restAccept workaround. (#3369 ) Bump fixed version of nim-presto.	2022-02-11 12:01:45 +01:00
Ștefan Talpalaru	70b38e37e6	Nim GC metrics for the main thread (#3108 ) * Nim GC metrics for the main thread	2022-02-08 20:19:21 +01:00
Eugene Kabanov	40c77e5928	Remote KeyManager API and number of fixes/tests for KeyManager API (#3360 ) * Initial commit. * Fix current test suite. * Fix keymanager api test. * Fix wss_sim. * Add more keystore_management tests. * Recover deleted isEmptyDir(). * Add `HttpHostUri` distinct type. Move keymanager calls away from rest_beacon_calls to rest_keymanager_calls. Add REST serialization of RemoteKeystore and Keystore object. Add tests for Remote Keystore management API. Add tests for Keystore management API (Add keystore). Fix serialzation issues. * Fix test to use HttpHostUri instead of Uri. * Add links to specification in comments. * Remove debugging echoes.	2022-02-07 22:36:09 +02:00
Jacek Sieka	c7abc97545	harden and speed up block sync (#3358 ) * harden and speed up block sync The `GetBlockBy` server implementation currently reads SSZ bytes from database, deserializes them into a Nim object then serializes them right back to SSZ - here, we eliminate the deser/ser steps and send the bytes straight to the network. Unfortunately, the snappy recoding must still be done because of differences in framing. Also, the quota system makes one giant request for quota right before sending all blocks - this means that a 1024 block request will be "paused" for a long time, then all blocks will be sent at once causing a spike in database reads which potentially will see the reading client time out before any block is sent. Finally, on the reading side we make several copies of blocks as they travel through various queues - this was not noticeable before but becomes a problem in two cases: bellatrix blocks are up to 10mb (instead of .. 30-40kb) and when backfilling, we process a lot more of them a lot faster. fix status comparisons for nodes syncing from genesis (#3327 was a bit too hard) * don't hit database at all for post-altair slots in GetBlock v1 requests	2022-02-07 19:20:10 +02:00
tersec	bf3ef987e4	deactivate doppelganger protection during genesis (#3362 ) * deactivate Doppelganger Protection during genesis * also don't actually flag supposed-doppelgangers (because they're before broadcastStartEpoch) on GENESIS_SLOT start	2022-02-07 07:12:36 +02:00
Jacek Sieka	6f10e651ff	rest: fix ssz preference string (#3357 )	2022-02-04 15:26:27 +02:00
tersec	e0fb5d95a6	remove --subscribe-all{att,sync}nets (#3359 )	2022-02-04 12:34:03 +00:00
tersec	02349b4181	update to engine API alpha.6 (#3351 )	2022-02-04 12:12:19 +00:00
tersec	d358299875	fork choice proposer boosting support (#3349 ) * fork choice proposer boosting support * detect nodeDelta underflow/overflow	2022-02-04 12:59:40 +01:00
Jacek Sieka	a50e21e229	fix doppelganger detection logging * update action tracker on dependent-root-changing reorg (instead of epoch change) * don't try to log duties while syncing - we're not tracking actions yet * fix slot used for doppelganger loss detection	2022-02-04 12:25:32 +01:00
Jacek Sieka	49282e9477	val_mon: register locally produced aggregates (#3352 ) These use a separate flow, and were previously only registered from the network * don't log successes in totals mode (TMI) * remove `attestation-sent` event which is unused	2022-02-04 08:33:20 +01:00
Zahary Karadjov	215caa21ae	Eth1 monitor fixes * Fix a resource leak introduced in https://github.com/status-im/nimbus-eth2/pull/3279 * Don't restart the Eth1 syncing proggress from scratch in case of monitor failures during Eth2 syncing. * Switch to the primary operator as soon as it is back online. * Log the web3 credentials in fewer places Other changes: The 'web3 test' command has been enhanced to obtain and print more data regarding the selected provider.	2022-02-03 14:01:55 +02:00
tersec	8e6a920bf4	rename MERGE_FORK_EPOCH to BELLATRIX_FORK_EPOCH (#3350 ) * rename MERGE_FORK_EPOCH to BELLATRIX_FORK_EPOCH * fix REST test rules	2022-02-02 14:06:55 +01:00
Jacek Sieka	ff4f2a6b6c	better log on finalized slot failure	2022-02-01 21:23:18 +01:00
Tanguy	bcd7b4598c	Tune peering (#3348 ) - Request metadata_v2 (altair) by default instead of the v1 - Change the metadata pinger to a 3 failure-then-kick, instead of being time based - Update kicker scorer to take into account topics which we're not subscribed to, to be sure that we will be able to publish correctly - Add some metrics to give "fanout" health (in the same spirit of mesh health)	2022-02-01 18:20:55 +01:00
tersec	0c814f49ee	rename sync_{committee_,}aggregate and execute_payload -> notify_new_payload (#3347 )	2022-02-01 07:31:53 +00:00
EmilIvanichkovv	336403d18b	Refactor `handleValidatorExitCommand` Make `validator exit command` work both with `JSON-RPC` and `REST` APIs Fix problem with specifying rest-url using `localhost` Change back exit error messages in `state_transition_block`	2022-02-01 01:24:05 +02:00
Jacek Sieka	3df9ffca9f	val-mon: remove redundant `_total` suffix from counters It turns out nim-metrics adds this suffix on its own - it also turns out some of the names are non-conventional and need follow-up.	2022-01-31 18:51:24 +02:00
tersec	c9aa1bee01	spec URL updates (#3342 )	2022-01-31 09:56:59 +00:00
Jacek Sieka	ad327a8769	Fix counters in validator monitor totals mode (#3332 ) The current counters set gauges etc to the value of the _last_ validator to be processed - as the name of the feature implies, we should be using sums instead. * fix missing beacon state metrics on startup, pre-first-head-selection * fix epoch metrics not being updated on cross-epoch reorg	2022-01-31 08:36:29 +01:00
Jacek Sieka	d583e8e4ac	Store finalized block roots in database (3s startup) (#3320 ) * Store finalized block roots in database (3s startup) When the chain has finalized a checkpoint, the history from that point onwards becomes linear - this is exploited in `.era` files to allow constant-time by-slot lookups. In the database, we can do the same by storing finalized block roots in a simple sparse table indexed by slot, bringing the two representations closer to each other in terms of conceptual layout and performance. Doing so has a number of interesting effects: * mainnet startup time is improved 3-5x (3s on my laptop) * the _first_ startup might take slightly longer as the new index is being built - ~10s on the same laptop * we no longer rely on the beacon block summaries to load the full dag - this is a lot faster because we no longer have to look up each block by parent root * a collateral benefit is that we no longer need to load the full summaries table into memory - we get the RSS benefits of #3164 without the CPU hit. Other random stuff: * simplify forky block generics * fix withManyWrites multiple evaluation * fix validator key cache not being updated properly in chaindag read-only mode * drop pre-altair summaries from `kvstore` * recreate missing summaries from altair+ blocks as well (in case database has lost some to an involuntary restart) * print database startup timings in chaindag load log * avoid allocating superfluos state at startup * use a recursive sql query to load the summaries of the unfinalized blocks	2022-01-30 18:51:04 +02:00
Emil	0051af430b	Put `application/json` as a higher preference than `application/octet-stream`	2022-01-30 18:50:14 +02:00
tersec	29e2169585	phase 0 & altair beacon chain and altair validator spec URL updates (#3339 )	2022-01-29 13:53:31 +00:00
tersec	89ffa8a1a7	spec URL & copyright year update (#3338 )	2022-01-29 01:05:39 +00:00
tersec	60bf5b8bf4	use v1.1.9 test vectors (#3337 )	2022-01-28 22:47:48 +00:00
tersec	95fee10328	clean up hashed rollback proc declarations (#3333 ) * clean up hashed rollback proc declarations * use generic hashed rollback proc type	2022-01-28 14:24:37 +00:00
cheatfate	1287a20b13	Use HTTP status codes instead of status in body.	2022-01-28 15:36:27 +02:00
Jacek Sieka	e264276b36	keep unviables in quarantine (#3331 ) they remain unviable even after a reorg	2022-01-28 11:59:55 +01:00
Zahary Karadjov	49b7daa39d	[ncli_db] bugfix: take into account finalization delay in reward calc post Altair This fixes a problem affecting Prater's epoch 64444.	2022-01-28 12:03:23 +02:00
tersec	dcb671617c	add/support TERMINAL_BLOCK_HASH_ACTIVATION_EPOCH (#3303 )	2022-01-27 19:52:08 +00:00
Jacek Sieka	84b6ad871d	harden status message handling Additional sanity checking of the status message exchanged during a fresh connection: * check that head and finalized make sense, slot-wise * verify that finalized root lies on the canonical chain, when possible * re-check these things for every status message during sync	2022-01-27 18:46:47 +02:00
Eugene Kabanov	aa27baacf5	Fix 408 Timeout error returned by REST server. (#3301 ) * Disable REST server timeouts. * Add options to CLI to tune REST server parameters.	2022-01-27 18:41:05 +02:00
tersec	7c51da037f	add block gossip validation condition (#3325 )	2022-01-26 17:22:06 +00:00
tersec	2b4a960270	rename On{Merge,Bellatrix}BlockAdded and Rollback{Merge,Bellatrix}HashedProc (#3321 )	2022-01-26 13:21:29 +01:00
Jacek Sieka	f70aceef37	Harden handling of unviable forks (#3312 ) * Harden handling of unviable forks In our current handling of unviable forks, we allow peers to send us blocks that come from a different fork - this is not necessarily an error as it can happen naturally, but it does open up the client to a case where the same unviable fork keeps getting requested - rather than allowing this to happen, we'll now give these peers a small negative score - if it keeps happening, we'll disconnect them. * keep track of unviable forks in quarantine, to avoid filling it with known junk * collect peer scores in single module * descore peers when they send unviable blocks during sync * don't give score for duplicate blocks * increase quarantine size to a level that allows finality to happen under optimal conditions - this helps avoid downloading the same blocks over and over in case of an unviable fork * increase initial score for new peers to make room for one more failure before disconnection * log and score invalid/unviable blocks in requestmanager too * avoid ChainDAG dependency in quarantine * reject gossip blocks with unviable parent * continue processing unviable sync blocks in order to build unviable dag * docs * Update beacon_chain/consensus_object_pools/block_pools_types.nim * add unviable queue test	2022-01-26 13:20:08 +01:00
tersec	bd0a3a9b10	rearrange MEV code (#3319 )	2022-01-25 19:43:28 +00:00
Emil	efbd939108	Make `handleValidatorExitCommand` work with `REST API`	2022-01-25 14:00:29 +02:00
Jacek Sieka	d076e1a11b	ncli_db: import states and blocks from era file (#3313 )	2022-01-25 09:28:26 +01:00
tersec	00a347457a	dynamic sync committee subscriptions (#3308 ) * dynamic sync committee subscriptions * fast-path trivial case rather than rely on RNG with probability 1 outcome Co-authored-by: zah <zahary@gmail.com> * use func instead of template; avoid calling async function unnecessarily * avoid unnecessary sync committee topic computation; use correct epoch lookahead; enforce exception/effect tracking * don't over-optimistically update ENR syncnets; non-looping version of nearSyncCommitteePeriod * allow separately setting --allow-all-{sub,att,sync}nets * remove unnecessary async Co-authored-by: zah <zahary@gmail.com>	2022-01-24 20:40:59 +00:00
tersec	062275461c	add flashbots (milestone 1) consensus beacon block types (#3314 ) * add flashbots (milestone 1) consensus beacon block types * remove MEV types from main bellatrix spec module	2022-01-24 20:15:22 +00:00
tersec	351c2fd48a	rename mergeData to bellatrixData and mergeFork to bellatrixFork (#3315 )	2022-01-24 16:23:13 +00:00
Jacek Sieka	61342c2449	limit by-root requests to non-finalized blocks (#3293 ) * limit by-root requests to non-finalized blocks Presently, we keep a mapping from block root to `BlockRef` in memory - this has simplified reasoning about the dag, but is not sustainable with the chain growing. We can distinguish between two cases where by-root access is useful: * unfinalized blocks - this is where the beacon chain is operating generally, by validating incoming data as interesting for future fork choice decisions - bounded by the length of the unfinalized period * finalized blocks - historical access in the REST API etc - no bounds, really In this PR, we limit the by-root block index to the first use case: finalized chain data can more efficiently be addressed by slot number. Future work includes: * limiting the `BlockRef` horizon in general - each instance is 40 bytes+overhead which adds up - this needs further refactoring to deal with the tail vs state problem * persisting the finalized slot-to-hash index - this one also keeps growing unbounded (albeit slowly) Anyway, this PR easily shaves ~128mb of memory usage at the time of writing. * No longer honor `BeaconBlocksByRoot` requests outside of the non-finalized period - previously, Nimbus would generously return any block through this libp2p request - per the spec, finalized blocks should be fetched via `BeaconBlocksByRange` instead. * return `Opt[BlockRef]` instead of `nil` when blocks can't be found - this becomes a lot more common now and thus deserves more attention * `dag.blocks` -> `dag.forkBlocks` - this index only carries unfinalized blocks from now - `finalizedBlocks` covers the other `BlockRef` instances * in backfill, verify that the last backfilled block leads back to genesis, or panic * add backfill timings to log * fix missing check that `BlockRef` block can be fetched with `getForkedBlock` reliably * shortcut doppelganger check when feature is not enabled * in REST/JSON-RPC, fetch blocks without involving `BlockRef` * fix dag.blocks ref	2022-01-21 13:33:16 +02:00
tersec	1a37cae329	allow Eth1 monitor to run without genesis_deposit_contract_snapshot.ssz (#3279 )	2022-01-21 12:59:09 +02:00
Eugene Kabanov	0ea6dfa517	Fix current slot value and finishing progress for backfilling. (#3304 )	2022-01-21 10:35:54 +01:00
Dustin Brody	9699858422	rename MERGE_FORK_VERSION to BELLATRIX_FORK_VERSION	2022-01-20 19:33:05 +02:00
Mamy Ratsimbazafy	9e9ccf4a1f	Slashing prot interchange tests v5.2.1 (#3277 ) * initial support for minification and new interchange tests. Removal of v1 and v1 migration. * Synthetic attestations: SQLite3 requires one statement/query per prepared statement * Fix DB import interrupted if no attestation was found * Skip test relying on undocumented test behavior (https://github.com/eth-clients/slashing-protection-interchange-tests/pull/12#issuecomment-1011158701) * Skip test relying on unclear minification behavior: creating an invalid minified attestation with source > target or setting target = max(source, target) * remove DB v1 and update submodule * Apply suggestions from code review Co-authored-by: Jacek Sieka <jacek@status.im> Co-authored-by: Jacek Sieka <jacek@status.im>	2022-01-20 17:14:06 +01:00
Jacek Sieka	1df549143e	rest: fix GraffitiBytes serialization (#3299 )	2022-01-20 13:31:55 +01:00
Jacek Sieka	570379d3d9	Backfiller (#3263 ) Backfilling is the process of downloading historical blocks via P2P that are required to fulfill `GetBlocksByRange` duties - this happens during both trusted node and finalized checkpoint syncs. In particular, backfilling happens after syncing to head, such that attestation work can start as soon as possible. * Fix SyncQueue initialization procedure. Remove usage of `awaitne`. Add cancellation support. Remove unneeded `sleepAsync()` if peer's head is older than needed. Add `direction` field to all logs. Fix syncmanager wedge issue. Add proper resource cleaning procedure on backward sync finish. Co-authored-by: cheatfate <eugene.kabanov@status.im>	2022-01-20 08:25:45 +01:00
Jacek Sieka	e4939538cd	fix non-totalized validator metric (#3297 ) ...when running on host with many validators, this explodes	2022-01-19 15:20:48 +01:00
tersec	2f635d3337	rename *_{MERGE => BELLATRIX} constant names (#3296 )	2022-01-18 16:31:05 +00:00
tersec	9c0c9c98ce	complete switch to beacon_chain/specs/datatypes/bellatrix (#3295 )	2022-01-18 13:36:52 +00:00
Zahary Karadjov	47f1f7ff1a	More efficient reward data persistance; Address review comments The new format is based on compressed CSV files in two channels: * Detailed per-epoch data * Aggregated "daily" summaries The use of append-only CSV file speeds up significantly the epoch processing speed during data generation. The use of compression results in smaller storage requirements overall. The use of the aggregated files has a very minor cost in both CPU and storage, but leads to near interactive speed for report generation. Other changes: - Implemented support for graceful shut downs to avoid corrupting the saved files. - Fixed a memory leak caused by lacking `StateCache` clean up on each iteration. - Addressed review comments - Moved the rewards and penalties calculation code in a separate module Required invasive changes to existing modules: - The `data` field of the `KeyedBlockRef` type is made public to be used by the validator rewards monitor's Chain DAG update procedure. - The `getForkedBlock` procedure from the `blockchain_dag.nim` module is made public to be used by the validator rewards monitor's Chain DAG update procedure.	2022-01-18 01:56:56 +02:00
Zahary Karadjov	29aad0241b	Precise per-component ETH-denominated rewards tracking This is an alternative take on https://github.com/status-im/nimbus-eth2/pull/3107 that aims for more minimal interventions in the spec modules at the expense of duplicating more of the spec logic in ncli_db.	2022-01-18 01:56:56 +02:00
Jacek Sieka	4e2d2ff7f4	rest: fix invalid type `RestSyncCommitteeSubscription` Using the wrong type here causes requests to fail due to the overly zealous parameter validation - the failure is harmless in the current duty subscription model, but would have caused more serious failures down the line.	2022-01-17 22:33:24 +02:00
Jacek Sieka	6bf3330d73	fix ugly delay logging	2022-01-17 20:12:36 +01:00
Jacek Sieka	ff5b91cd58	Revert "Don't use GC memory for the initial beacon block summaries loading" (#3292 ) This reverts commit `7e2fc2b726`.	2022-01-17 12:07:49 +00:00
Jacek Sieka	836f6984bb	move `state_transition` to `Result` (#3284 ) * better error messages in api * avoid `BlockData` copies when replaying blocks	2022-01-17 12:19:58 +01:00
Jacek Sieka	68247f81b3	Trusted node sync (#3209 ) * Trusted node sync Trusted node sync, aka checkpoint sync, allows syncing tyhe chain from a trusted node instead of relying on a full sync from genesis. Features include: * sync from any slot, including the latest finalized slot * backfill blocks either from the REST api (default) or p2p (#3263) Future improvements: * top up blocks between head in database and some other node - this makes for an efficient backup tool * recreate historical state to enable historical queries * fixes * load genesis from network metadata * check checkpoint block root against state * fix invalid block root in rest json decoding * odds and ends * retry looking for epoch-boundary checkpoint blocks	2022-01-17 10:27:08 +01:00
Zahary Karadjov	ebde027262	Re-enable the HTTP support in Eth1Monitor This reverts commit `6fddff524c`.	2022-01-16 18:26:21 +02:00
Zahary Karadjov	7e2fc2b726	Don't use GC memory for the initial beacon block summaries loading	2022-01-15 10:15:17 +02:00
Jacek Sieka	167068e739	valmon: fix `--validator-monitor-totals` feature (#3286 ) Else log size explodes for machines with many validators	2022-01-14 15:57:46 +01:00
Zahary Karadjov	bef13b6cce	Version 1.6.0	2022-01-14 13:52:06 +02:00
tersec	d878948ed2	update sync committee gossip validation comments; spec URL updates (#3280 )	2022-01-13 13:46:08 +00:00
Jacek Sieka	d57c2dc4e5	use tail block as sync pivot (#3276 ) When syncing, we show how much of the sync has completed - with checkpoint sync, the syncing does not always go from slot 0 to head, but rather can start in the middle. To show a consistent `%` between restarts, we introduce the concept of a pivot point, such that if I sync 10% of the chain, then restart the client, it picks up at 10% (instead of counting from 0). What it looks like: ``` INF ... sync="01d12h41m (15.96%) 13.5158slots/s (QDDQDDQQDP:339018)" ... ```	2022-01-13 10:37:53 +01:00
Jacek Sieka	e9486f5e5b	state_sim: clean up attestation production (#3274 ) * use same naming as everywhere * avoid iterator bug that leads to state copy	2022-01-12 21:42:03 +01:00
tersec	14aab2c13f	update 10 modules from using merge to bellatrix (#3272 )	2022-01-12 15:50:30 +01:00
Jacek Sieka	805e85e1ff	time: spring cleaning (#3262 ) Time in the beacon chain is expressed relative to the genesis time - this PR creates a `beacon_time` module that collects helpers and utilities for dealing the time units - the new module does not deal with actual wall time (that's remains in `beacon_clock`). Collecting the time related stuff in one place makes it easier to find, avoids some circular imports and allows more easily identifying the code actually needs wall time to operate. * move genesis-time-related functionality into `spec/beacon_time` * avoid using `chronos.Duration` for time differences - it does not support negative values (such as when something happens earlier than it should) * saturate conversions between `FAR_FUTURE_XXX`, so as to avoid overflows * fix delay reporting in validator client so it uses the expected deadline of the slot, not "closest wall slot" * simplify looping over the slots of an epoch * `compute_start_slot_at_epoch` -> `start_slot` * `compute_epoch_at_slot` -> `epoch` A follow-up PR will (likely) introduce saturating arithmetic for the time units - this is merely code moves, renames and fixing of small bugs.	2022-01-11 11:01:54 +01:00
tersec	ae61512ee9	rename upgrade_to_{merge,bellatrix}; detect unchanging spec YAMLs (#3265 )	2022-01-10 09:39:43 +00:00
Jacek Sieka	20e700fae4	Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex (#3259 ) * Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex Harden the use of `CommitteeIndex` et al to prevent future issues by using a distinct type, then validating before use in several cases - datatypes in spec are kept simple though so that invalid data still can be read. * fix invalid epoch used in REST `/eth/v1/beacon/states/{state_id}/committees` committee length (could return invalid data) * normalize some variable names * normalize committee index loops * fix `RestAttesterDuty` to use `uint64` for `validator_committee_index` * validate `CommitteeIndex` on ingress in REST API * update rest rules with stricter parsing * better REST serializers * save lots of memory by not using `zip` ...at least a few bytes!	2022-01-09 01:28:49 +02:00
Jacek Sieka	6f7e0e3393	REST cleanups (#3255 ) * REST cleanups * reject out-of-range committee requests * print all hex values as lower-case * allow requesting state information by head state root * turn `DomainType` into array (follow spec) * `uint_to_bytesXX` -> `uint_to_bytes` (follow spec) * fix wrong dependent root in `/eth/v1/validator/duties/proposer/` * update documentation - `--subscribe-all-subnets` is no longer needed when using the REST interface with validator clients * more fixes * common helpers for dependent block * remove test rules obsoleted by more strict epoch tests * fix trailing commas * Update docs/the_nimbus_book/src/rest-api.md * Update docs/the_nimbus_book/src/rest-api.md Co-authored-by: sacha <sacha@status.im>	2022-01-08 22:06:34 +02:00
Jacek Sieka	25bb927e62	better web3monitor error message (#3260 ) ``` WRN 2022-01-08 11:35:00.963+01:00 Eth1 chain monitoring failure, restarting topics="eth1" err="Failed to setup web3 connection: (111) Connection refused" ```	2022-01-08 14:35:36 +01:00
tersec	bac0eaa92e	update 10 modules from using merge to bellatrix (#3257 )	2022-01-07 18:10:40 +01:00
Jacek Sieka	ba99c8fe4f	update era file documentation / impl (#3226 ) Overhaul of era files, including documentation and reference implementations * store blocks, then state, then slot indices for easy lookup at low cost * document era file rationale * altair+ support in era writer	2022-01-07 11:13:19 +01:00
tersec	0fd8bf7b56	spec URL updates (#3254 )	2022-01-06 18:35:38 +00:00
tersec	8242e57f41	initial migration from spec/datatypes/{merge => bellatrix} (#3249 )	2022-01-06 12:25:35 +01:00
Emil	2a12d1c49f	Support more content-types when specifying the Graffiti string	2022-01-06 10:56:59 +02:00
Jacek Sieka	0e2b4e39fa	REST JSON support improvements (#3232 ) * support downloading blocks / states via JSON in addition to SSZ - slow, but needed for infura support - SSZ is still used when server supports it * use common forked block/state reader in REST API * fix stack overflows in REST JSON decoder * fix invalid serialization of `justification_bits` in `/eth/v1/debug/beacon/states` and `/eth/v2/debug/beacon/states` * fix REST client to use `/eth/...` instead of `/api/eth/...`, update "default" urls to expose REST api via `/eth` as well as this is what the standard says - `/api` was added early on based on an example "base url" in the spec that has been removed since * expose Nimbus REST extensions via `/nimbus` in addition to `/api/nimbus` to stay consistent with `/eth` * fix invalid state root when reading states via REST * fix recursive imports in `spec/ssz_codec` * remove usages of `serialization.useCustomSerialization` - fickle	2022-01-06 08:38:40 +01:00
Jacek Sieka	0a4728a241	Handle access to historical data for which there is no state (#3217 ) With checkpoint sync in particular, and state pruning in the future, loading states or state-dependent data may fail. This PR adjusts the code to allow this to be handled gracefully. In particular, the new availability assumption is that states are always available for the finalized checkpoint and newer, but may fail for anything older. The `tail` remains the point where state loading de-facto fails, meaning that between the tail and the finalized checkpoint, we can still get historical data (but code should be prepared to handle this as an error). However, to harden the code against long replays, several operations which are assumed to work only with non-final data (such as gossip verification and validator duties) now limit their search horizon to post-finalized data. * harden several state-dependent operations by logging an error instead of introducing a panic when state loading fails * `withState` -> `withUpdatedState` to differentiate from the other `withState` * `updateStateData` can now fail if no state is found in database - it is also hardened against excessively long replays * `getEpochRef` can now fail when replay fails * reject blocks with invalid target root - they would be ignored previously * fix recursion bug in `isProposed`	2022-01-05 19:38:04 +01:00
zah	fba1f08a5e	Implement #3129 (Optimized history traversals in the REST API) (#3219 ) * Fix REST some rest call signatures and implement a simple API benchmark tool * Implement #3129 (Optimized history traversals in the REST API) Other notable changes: The `updateStateData` procedure in the `blockchain_dag.nim` module is optimized to not rewind down to the last snapshot state saved in the database if the supplied input state can be used as a starting point instead. * Disallow await in withStateForBlockSlot	2022-01-05 15:49:10 +01:00
tersec	5878d34117	rename forkDigests.merge to forkDigests.bellatrix (#3245 )	2022-01-05 14:24:15 +00:00
tersec	66c9b7fbce	shift block_sim fork epochs; allow VC to work with non-multiple-of-3 SECONDS_PER_SLOT (#3244 )	2022-01-05 13:41:39 +00:00
tersec	7594fa660d	copyright year and spec URL updates (#3243 )	2022-01-05 11:07:14 +00:00
Zahary Karadjov	54d0d588b1	Implementation of the Keymanager API (BETA) https://github.com/ethereum/keymanager-APIs	2022-01-04 18:51:45 +02:00
Tanguy	511f2d24f0	Tune getLowSubnets (#3241 ) * Tune getLowSubnets * Also aim for dHigh peers in gossipsub * Apply suggestions from code review Co-authored-by: Jacek Sieka <jacek@status.im> Co-authored-by: Jacek Sieka <jacek@status.im>	2022-01-04 14:37:04 +01:00
tersec	b81c06edab	rename Beacon{Block,State}Fork.Merge to Bellatrix; update copyright years (#3240 )	2022-01-04 09:45:38 +00:00
tersec	d20387e910	update copyright years and spec URLs (#3239 )	2022-01-04 06:08:19 +00:00
tersec	da017d2ca5	update from phase0/altair v1.1.6 URLs to v1.1.8 spec URLs (#3238 )	2022-01-04 03:57:15 +00:00
Jacek Sieka	c4ce59e55b	Assorted logging improvements (#3237 ) * log doppelganger detection when it activates and when it causes missed duties * less prominent eth1 sync progress * log in-progress sync at notice only when actually missing duties * better detail in replay log * don't log finalization checkpoints - this is quite verbose when syncing and already included in "Slot start"	2022-01-03 22:18:49 +01:00
tersec	3c63a78c01	use v1.1.8 test vectors (#3236 )	2022-01-03 17:43:00 +00:00
Jacek Sieka	61b6fc1016	3x speedup in snappy compression (#3234 ) * 3x speedup in snappy compression oh, the wonders of `copyMem` in `endians2` - speeds up all kinds of operations like database stores, sending gossip etc. * endian usage fixes	2022-01-03 18:17:10 +01:00
tersec	e78d12beb9	support GOSSIP_MAX_SIZE_MERGE blocks; prevent fork choice stutter via aggregate attestations (#3230 ) * support GOSSIP_MAX_SIZE_MERGE-sized blocks; prevent fork choice clock stutter via aggregate attestations * relay max gossip size to libp2p, use tight uncompressed bounds for fixed-size messages * Update beacon_chain/networking/eth2_network.nim Co-authored-by: Jacek Sieka <jacek@status.im> * Update beacon_chain/networking/eth2_network.nim Co-authored-by: Jacek Sieka <jacek@status.im> Co-authored-by: Jacek Sieka <jacek@status.im>	2022-01-03 16:20:15 +00:00
tersec	8be1699014	use v1.1.7 test vectors (#3231 ) * use v1.1.7 test vectors	2022-01-03 13:06:14 +00:00
tersec	d4680df8d2	convert between engine and consensus ExecutionPayloads (#3228 ) * convert between engine and consensus ExecutionPayloads	2022-01-03 13:22:56 +01:00
Jacek Sieka	7ec97a6b35	Fix missing checkpoint states` (#3225 ) With the right sequence of events (for example a REST request or a validation), it can happen that the first traversal across a state checkpoint boundary is done without storing that state on disk - this causes problens when replaying states, because now states may be missing from the database. Here, we simply avoid using the caches when advancing a state that will go into the database, ensuring that the information lost during caching always is permanently stored. * fix recursion bug in `isProposed`	2021-12-30 12:33:03 +01:00
Jacek Sieka	6b60a774e0	Lazy aggregated batch verification (#3212 ) A novel optimisation for attestation and sync committee message validation: when batching, we look for signatures of the same message and aggregate these before batch-validating: this results in up to 60% fewer signature verifications on a busy server, leading to a significant reduction in CPU usage. * increase batch size slightly which helps finding more aggregates * add metrics for batch verification efficiency * use simple `blsVerify` when there is only one signature to verify in the batch, avoiding the RNG	2021-12-29 15:28:40 +01:00
Zahary Karadjov	a860cd6250	Restore the build support of the -d:has_genesis_detection feature	2021-12-23 16:58:54 +02:00
tersec	1a6a56bdb1	use BeaconTime instead of Slot in fork choice (#3138 ) * use v1.1.6 test vectors; use BeaconTime instead of Slot in fork choice * tick through every slot at least once * use div INTERVALS_PER_SLOT and use precomputed constants of them * use correct (even if numerically equal) constant	2021-12-21 18:56:08 +00:00
tersec	0d4e49f946	Merge fork gossip support (#3213 ) * Merge fork gossip support * index directly by BeaconStateFork and remove debugging log statement	2021-12-21 15:24:23 +01:00
Jacek Sieka	1021e3324e	Revert writing backfill root to database (#3215 ) Introduced in #3171, it turns out we can just follow the block headers to achieve the same effect * leaves the constant in the code so as to avoid confusion when reading database that had the constant written (such as the fleet nodes and other unstable users)	2021-12-21 11:40:14 +01:00
Jacek Sieka	c270ec21e4	Validator monitoring (#2925 ) Validator monitoring based on and mostly compatible with the implementation in Lighthouse - tracks additional logs and metrics for specified validators so as to stay on top on performance. The implementation works more or less the following way: * Validator pubkeys are singled out for monitoring - these can be running on the node or not * For every action that the validator takes, we record steps in the process such as messages being seen on the network or published in the API * When the dust settles at the end of an epoch, we report the information from one epoch before that, which coincides with the balances being updated - this is a tradeoff between being correct (waiting for finalization) and providing relevant information in a timely manner)	2021-12-20 20:20:31 +01:00
tersec	6ef3834f4a	fix type-conversions-to-self, unexport from nimbus_beacon_node, and rm unused vars/procs (#3211 )	2021-12-20 12:21:17 +01:00
tersec	c7be88b432	some spec URL updates (#3210 )	2021-12-19 15:12:33 +00:00
tersec	57974ce61b	forkchoiceUpdate support (#3199 )	2021-12-17 12:23:32 +00:00
Tanguy	4a72def1d5	Bump libp2p (#3207 )	2021-12-17 12:39:24 +01:00
tersec	d7799ecdcc	v1.1.6 spec updates (#3206 )	2021-12-17 06:56:33 +00:00
Jacek Sieka	118840d241	SyncManager cleanups for backfill support (#3189 ) * SyncManager cleanups for backfill support Cleanups, fixes and simplifications, in anticipation of backfill support for the `SyncManager`: * reformat sync progress indicator to show time left and % done more prominently: * old: `sync="sPssPsssss:2:2.4229:00h57m (2706898)"` * new: `sync="14d12h31m (0.52%) 1.1378slots/s (wQQQQQDDQQ:1287520)"` * reset average speed when going out of sync * pass all block errors to sync manager, including duplicate/unviable * penalize peers for reporting a head block that is outside of our expected wall clock time (they're likely on a different network or trying to disrupt sync) * remove `SyncFailureKind` (unused) * remove `inRange` (unused) * add `Q` for sync queue requests that are in the `SyncQueue` but not yet in the `BlockProcessor` queue * update last slot in `SyncQueue` after getting peer status * fix race condition between `wakeupWaiters` and `resetWait`, where workers would not be correctly reset if block verification returned a completed future without event loop * log syncmanager direction * Fix ordering issue. Some of the requests size of which are not equal to `chunkSize` could be processed in wrong order which could lead to sync process freezes. Co-authored-by: cheatfate <eugene.kabanov@status.im>	2021-12-16 15:57:16 +01:00
Etan Kissling	0037e6b89c	reject malformed keystore files (#3201 ) PBKDF2 based keystore files are required to have `dklen >= 32`. This patch ensures that keystores not fulfilling that requirement are properly rejected.	2021-12-15 19:55:11 +01:00
Jacek Sieka	0f44d2eff7	additional startup logging	2021-12-15 11:13:48 +01:00

1 2 3 4 5 ...

2521 Commits