nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Jacek Sieka	9c2f43ed0e	Speed up altair block processing 2x (#3115 ) * Speed up altair block processing >2x Like #3089, this PR drastially speeds up historical REST queries and other long state replays. * cache sync committee validator indices * use ~80mb less memory for validator pubkey mappings * batch-verify sync aggregate signature (fixes #2985) * document sync committee hack with head block vs sync message block * add batch signature verification failure tests Before: ``` ../env.sh nim c -d:release -r ncli_db --db:mainnet_0/db bench --start-slot:-1000 All time are ms Average, StdDev, Min, Max, Samples, Test Validation is turned off meaning that no BLS operations are performed 5830.675, 0.000, 5830.675, 5830.675, 1, Initialize DB 0.481, 1.878, 0.215, 59.167, 981, Load block from database 8422.566, 0.000, 8422.566, 8422.566, 1, Load state from database 6.996, 1.678, 0.042, 14.385, 969, Advance slot, non-epoch 93.217, 8.318, 84.192, 122.209, 32, Advance slot, epoch 20.513, 23.665, 11.510, 201.561, 981, Apply block, no slot processing 0.000, 0.000, 0.000, 0.000, 0, Database load 0.000, 0.000, 0.000, 0.000, 0, Database store ``` After: ``` 7081.422, 0.000, 7081.422, 7081.422, 1, Initialize DB 0.553, 2.122, 0.175, 66.692, 981, Load block from database 5439.446, 0.000, 5439.446, 5439.446, 1, Load state from database 6.829, 1.575, 0.043, 12.156, 969, Advance slot, non-epoch 94.716, 2.749, 88.395, 100.026, 32, Advance slot, epoch 11.636, 23.766, 4.889, 205.250, 981, Apply block, no slot processing 0.000, 0.000, 0.000, 0.000, 0, Database load 0.000, 0.000, 0.000, 0.000, 0, Database store ``` * add comment	2021-11-24 13:43:50 +01:00
Jacek Sieka	95dd846a9b	Make sync horizon configurable (#3113 ) Currently, we don't have a good answer to the question "are we synced yet" - the sync manager syncs based on the peers it's connected to, but just because some peer looks like it should be synced from doesn't mean we're out of sync. Instead, we use a very silly time-based heuristic - the problem with that is that the network can go into a rut where nobody produces blocks - better heuristics would be needed here, but in the meantime, a command line option can get us out of a tight spot - this PR places such an option in the client, in the unlikely event it should be needed (most likely in a testnet).	2021-11-18 20:35:26 +01:00
Jacek Sieka	f19a497eec	ncli_db: add putState, putBlock (#3096 ) * ncli_db: add putState, putBlock These tools allow modifying an existing nimbus database for the purpose of recovery or reorg, moving the head, tail and genesis to arbitrary points. * remove potentially expensive `putState` in `BeaconStateDB` * introduce `latest_block_root` which computes the root of the latest applied block from the `latest_block_header` field (instead of passing it in separately) * avoid some unnecessary BeaconState copies during init * discover https://github.com/nim-lang/Nim/issues/19094 * prefer `HashedBeaconState` in a few places to avoid recomputing state root * fetch latest block root from state when creating blocks * harden `get_beacon_proposer_index` against invalid slots and document * move random spec function tests to `test_spec.nim` * avoid unnecessary state root computation before block proposal	2021-11-18 13:02:43 +01:00
tersec	9e395011d9	update 22 spec URLs to v1.1.5 (#3111 )	2021-11-18 08:08:00 +00:00
Jacek Sieka	b22d86e161	REST/JSON-RPC: speed up several requests (#3092 ) REST/JSON-RPC and a few more also invalidate caches unnecessarily, similar to https://github.com/status-im/nimbus-eth2/pull/3089 * avoid copying validator on balance request	2021-11-12 23:29:28 +01:00
tersec	2e868dc2ba	mass/mechanical update of 1.1.4 phase0 and altair spec URLs to 1.1.5 (#3067 )	2021-11-09 07:40:41 +00:00
tersec	eb3ad25859	mass/mechanical update of 1.1.3 phase/altair spec URLs to 1.1.4 (#3058 )	2021-11-08 06:18:10 +00:00
Zahary Karadjov	29e5700838	Bugfix: Avoid the aggregation of duplicate signatures when creating sync committee contributions	2021-11-07 21:41:10 +02:00
Jacek Sieka	ea0a191723	Better REST/RPC error messages (#3046 ) * Better REST/RPC error messages * homogenise block logging (root first) * homegenise message verification pipeline (verify in `gossip_verification`, act in `eth2_processor`) * use `subcommitteeIdx` consistently * log each sent contribution * fix block_sim * fix block topic * don't recalc root on gossip block validation * move position loop into sync pool	2021-11-05 17:39:47 +02:00
tersec	36e37bda40	v1.1.3 spec refs URLs (#3036 )	2021-10-27 18:40:17 +00:00
tersec	8307e9c601	mechanical non-merge v1.1.2 to v1.1.3 spec URL updates (#3030 )	2021-10-26 16:44:23 +00:00
Jacek Sieka	421bf936ff	odds and ends (#3015 ) * `allSyncCommittees` => `allSyncSubcommittees` * simplify `_snappy` topic generation (avoid pointless string copies) * simplify gossip id generator (avoid pointless string copies) * avoid redundant syncnet ENR updates * simplify topic validation (allow only validated topics)	2021-10-21 15:09:19 +02:00
Jacek Sieka	9cf32c3748	clean up sync subcommittee handling * `SyncCommitteeIndex` -> `SyncSubcommitteeIndex` * `syncCommitteePeriod` -> `sync_committee_period` (spec spelling) * tighten period comparisons * fix assert when validating committee message with non-altair state in REST api	2021-10-20 22:59:13 +03:00
Jacek Sieka	bf6ad41d7d	add drop and sync committee metrics * use storeBlock for processing API blocks * avoid double block dump * count all gossip metrics at the same spot * simplify block broadcast	2021-10-20 18:20:12 +03:00
Jacek Sieka	c247702ebc	normalize subnet logging * call it subnet id everywhere * log aggregate sent from VC * log subnet with aggregate	2021-10-20 15:06:44 +03:00
tersec	c0a2f1c98e	refactor executionPayload tests; reduce HashSet creation (#3003 )	2021-10-20 13:36:38 +02:00
Jacek Sieka	df3fc9525f	import cleanup (#2997 ) * import cleanup ...and remove some unused types * add random imports * more imports	2021-10-19 16:09:26 +02:00
Jacek Sieka	c40cc6cec1	clean up fork enum and field names * single naming strategy * simplify some fork code * simplify forked block production	2021-10-19 11:06:38 +03:00
Jacek Sieka	4f7a8cf79d	register vc duties with subnet tracker (#2949 ) * register vc duties with subnet tracker * fix activation logging during startup * cache slot signature to avoid duplicate signature work * schedule aggregation duties one slot at a time to avoid CPU spike at each epoch * lower aggregation subnet pre-subscription time to 4 slots (lowers bandwidth and CPU usage) * update stability subnets in ENR on startup * log gossip state * perform gossip subscriptions just before the next slot starts * document stuff * add random include * don't overwrite subscription state when not subscribed * log target gossip state * updating gossip status once is enough * add test * remove syncQueueLen - this one is not updated at the end of the sync and may cause gossip to disconnect itself completely - use a simple head distance instead * fix gossip disconnection - if in hysteresis, node.gossipState will be set to disabled even though we don't disable topic subscriptions * fix extra duty registration call	2021-10-18 11:11:44 +02:00
tersec	2eb9a608a4	add payloadId; add merge vector test script; remove consensusValidated (#2982 )	2021-10-13 16:08:50 +02:00
tersec	2ad1b7366a	update 62 spec URLs to v1.1.2 (#2979 )	2021-10-12 10:17:37 +00:00
tersec	0ae736f397	update 67 spec URLs to v1.1.2 (#2977 )	2021-10-12 08:09:59 +00:00
zah	29ee9ec81e	Bugfix: don't crash on keystores without description (#2967 )	2021-10-07 18:30:34 +02:00
Etan Kissling	9ee134324b	allow `withXxx` to access fork-specific fields (#2943 ) So far, `withState` and `withBlck` templates could only be used to have convenience access to fork-agnostic BeaconState and BeaconBlock fields. This patch: - injects an additional `stateFork` constant that allows to use `when` expressions to also access Altair and Merge-specific fields. - introduces a `withStateAndBlck` template to support operating on both a `BeaconState` and `BeaconBlock` at a time. - makes sync committee related functions Merge aware. - changes a couple if-else trees for forks into case statements so that forgotten future forks are promoted to compile-time errors.	2021-10-06 20:05:06 +03:00
Eugene Kabanov	65257b82f8	Validator key management API (#2755 ) Implements https://github.com/ethereum/beacon-APIs/pull/151	2021-10-04 22:08:31 +03:00
Etan Kissling	b217150f1d	use forked `getAttestationsForBlock` everywhere (#2937 ) There are a number of locations in the code that get attestations on a forked beacon state. For attestation pools test, a convenience wrapper was available to reduce clutter. This patch integrates that wrapper into the core component so that it can also take advantage of the wrapper.	2021-10-01 01:29:32 +00:00
Etan Kissling	ba84a55699	fix `makeBeaconBlockForHeadAndSlot` for merge (#2934 ) This fixes an if-else structure that was not aware of the merge phase in `makeBeaconBlockForHeadAndSlot`, avoiding a potential crash.	2021-09-30 16:27:17 +00:00
Etan Kissling	2e9fa87f8b	use `SyncAggregate.init()` everywhere (#2932 ) The initialization of a `SyncAggregate` to its default value is not very intuitive. There is an `init` function in `sync_committee_msg_pool` that provides a convenience wrapper. This patch exports that initializer so that the rest of the code base can also take advantage of it.	2021-09-30 13:56:07 +00:00
tersec	47a6045b05	update a dozen spec URLs (#2929 )	2021-09-29 23:43:28 +00:00
Etan Kissling	e243ba2c0b	revise `makeBeaconBlock` overloads (#2879 ) The phase0 and altair overloads of `makeBeaconBlock` slightly differ in their signatures which makes using them unnecessarily verbose. - A placeholder `sync_aggregate` argument similar to `executionPayload` is added to the phase0 overload to match the altair signature. - A wrapper operating on `ForkedHashedBeaconState` is introduced.	2021-09-29 12:10:44 +00:00
tersec	2b2846b468	implement forked merge state/block support (#2890 ) * implement forked state/block support * merge support for containsOrphan; import cleanup; 80-column lines * add merge block header operations and slot sanity fixture * add epoch state transition tests; implement is_valid_gas_limit(), is_merge_block(), is_execution_enabled(), and compute_timestamp_at_slot() * implement process_execution_payload() and add merge deposit operations tests * add merge block sanity tests * add merge case to syncCommitteeParticipants * v1.1.0-beta.5 updates * reduce getTestStates-based memory usage; don't try to REST-serialize ExecutionPayload transactions without underlying support * add execution payload tests; switch var to let in tests/official/	2021-09-27 14:22:58 +00:00
Jacek Sieka	e47a8cbe42	fixes (#2901 ) * export kvstore from beacon_chain_db * fix rest HashList deserialization * fix asTrusted	2021-09-27 11:24:58 +02:00
Eugene Kabanov	0c635334a2	Sync committee related REST API implementation. (#2856 )	2021-09-24 01:13:25 +03:00
Eugene Kabanov	b566d4657f	REST /eth/v1/events API call implementation. (#2878 ) * Placing callbacks into strategic places. * Initial events call implementation. * Post rebase fixes. * Change addSyncContribution() implementation. * Add `attestation-sent` event. Remove gcsafe, raises from callbacks implementations. Move `attestation-received` fire at the end of attestation processing. * Address review comments.	2021-09-22 14:17:15 +02:00
tersec	0ad7216bc4	mass update from beta.3 spec refs to beta.4 spec refs (#2862 )	2021-09-10 18:56:03 +00:00
tersec	9e145afda3	update 29 Altair spec ref URLs to beta.3 (#2839 )	2021-08-31 12:16:27 +00:00
Zahary Karadjov	7d1efa443d	Restore the sync committee pool pruning and add tests	2021-08-30 11:06:45 +03:00
zah	3689c68cbf	Carry out the sync committee gossip duties Other changes: * Add server getBlockV2(), and produceBlockV2(). * Add getBlockV2() to REST test suite. * Add client getBlockV2(), and produceBlockV2(). * Fix URLs in comments. * Add some primitives and fix some issues in forks.nim. * Switch `validator_client` to V2 calls usage. * Bump `chronos` with imports fixes. * Bump `nim-json-serialization` for `requireAllFields`.	2021-08-30 03:58:30 +03:00
tersec	2d8a796a93	altair-capable beacon block creation (#2834 ) * altair-capable beacon block creation * update block_sim to use sync committees and the new block production interface	2021-08-29 14:50:21 +00:00
tersec	0418fbada2	introduce SyncCommitteeMsgPool to eth2_processor and nimbus_beacon_node (#2831 )	2021-08-28 22:27:51 +00:00
Jacek Sieka	6d47d96c84	altair upgrade for prater (#2828 ) and a few import fixes for free	2021-08-27 16:54:51 +00:00
Jacek Sieka	6a4bf98ea2	better error messages for keystore operations (#2812 ) in particular, incluse os error string	2021-08-27 16:53:21 +00:00
Jacek Sieka	01596c45dd	cleanups and fixes (#2827 ) * import cleanup * fix json-rpc exception handlers * avoid unnecessary presto client import * introduce ForkedBeaconBlock, some altair logging * url fixes	2021-08-27 11:00:06 +02:00
Jacek Sieka	ba06f13942	cleanups (#2809 ) * cleanups * use ForkedTrustedSignedBeaconBlock.ionit where appropriate * move `is_aggregator` to `spec/` * use `errReject` in a few more places * update enr fork id when time is auspicious * use network broadcast functions * Return Ignore for aggregate signature validation timeouts ...consistently between aggregates and attestations. * clean up some more reject/ignore rules * shorten texts a bit * errReject->checkedReject, use err helpers throughout * get rid of quarantine in exitpool as well	2021-08-24 21:49:51 +02:00
Eugene Kabanov	66cb18d69b	Number of REST fixes for Altair. (#2790 ) * Fix getForkSchedule call. Create cache of all configuration endpoints at node startup. Add prepareJsonResponse() call to create cached responses. Mark all procedures with `raises`. * Add getForkSchedule to VC. Fix getForkSchedule return type for API. More `raises` annotations. Fix VC fork_service.nim. * Use `push raises` instead of inline `raises`. * Improvements for REST API aggregated attestations and attestations processing. * Rename eth2_network.sendXXX procedures to eth2_network.broadcastXXX. Add broadcastBeaconBlock() and broadcastAggregateAndProof(). Fix links to specification in REST API declarations. Add implementation for v2 getStateV2(). Add validator_duties.sendXXX procedures which not only broadcast data, but also validate it. Fix JSON-RPC/REST to use new validator_duties.sendXXX procedures instead of own implementations. * Fix validator_client online nodes count incorrect value. Fix aggregate and proof attestation could be sent too late. * Adding timeout for block wait in attestations processing. Fix compilation errors. * Attempt to debug aggregate and proofs. * Fix Beacon AIP to use `sendAttestation`. Add link comment to produceBlockV2. * Add debug logs before publish operation for blocks, attestations and aggregated attestations. Fix attestations publishing issue. * logging fixes `indexInCommnittee` already logged in attestation Co-authored-by: Jacek Sieka <jacek@status.im>	2021-08-23 12:41:48 +02:00
tersec	092d9350de	eth2.0-specs -> consensus-specs repo rename (#2801 )	2021-08-20 23:37:45 +00:00
tersec	317b6de4e6	send attestations and exit messages on fork-appropriate topic (#2773 ) * send attestations and exit messages on fork-appropriate topic * document why use wall clock over attestation slot * centralize some fork-topic-picking-logic in eth2_network * pick up new test in summary * allow specified GetTimeFn for testing purposes * add GenesisTime and use it in eth2_network * replace GetTimeFn and GenesisTime with GetBeaconTimeFn	2021-08-19 10:45:31 +00:00
Jacek Sieka	a7a65bce42	disentangle eth2 types from the ssz library (#2785 ) * reorganize ssz dependencies This PR continues the work in https://github.com/status-im/nimbus-eth2/pull/2646, https://github.com/status-im/nimbus-eth2/pull/2779 as well as past issues with serialization and type, to disentangle SSZ from eth2 and at the same time simplify imports and exports with a structured approach. The principal idea here is that when a library wants to introduce SSZ support, they do so via 3 files: * `ssz_codecs` which imports and reexports `codecs` - this covers the basic byte conversions and ensures no overloads get lost * `xxx_merkleization` imports and exports `merkleization` to specialize and get access to `hash_tree_root` and friends * `xxx_ssz_serialization` imports and exports `ssz_serialization` to specialize ssz for a specific library Those that need to interact with SSZ always import the `xxx_` versions of the modules and never `ssz` itself so as to keep imports simple and safe. This is similar to how the REST / JSON-RPC serializers are structured in that someone wanting to serialize spec types to REST-JSON will import `eth2_rest_serialization` and nothing else. * split up ssz into a core library that is independendent of eth2 types * rename `bytes_reader` to `codec` to highlight that it contains coding and decoding of bytes and native ssz types * remove tricky List init overload that causes compile issues * get rid of top-level ssz import * reenable merkleization tests * move some "standard" json serializers to spec * remove `ValidatorIndex` serialization for now * remove test_ssz_merkleization * add tests for over/underlong byte sequences * fix broken seq[byte] test - seq[byte] is not an SSZ type There are a few things this PR doesn't solve: * like #2646 this PR is weak on how to handle root and other dontSerialize fields that "sometimes" should be computed - the same problem appears in REST / JSON-RPC etc * Fix a build problem on macOS * Another way to fix the macOS builds Co-authored-by: Zahary Karadjov <zahary@gmail.com>	2021-08-18 20:57:58 +02:00
tersec	a0c518cb4f	sync committee/aggregate signature signing and verification (#2784 ) * sync committee/aggregate signature signing and verification * add message signature tests	2021-08-17 08:07:17 +00:00
Jacek Sieka	7a622e8505	rework spec imports (#2779 ) The spec imports are a mess to work with, so this branch cleans them up a bit to ensure that we avoid generic sandwitches and that importing stuff generally becomes easier. * reexport crypto/digest/presets because these are part of the public symbol set of the rest of the spec types * don't export `merge` types from `base` - this causes circular deps * fix circular deps in `ssz/spec_types` - this is the first step in disentangling ssz from spec * be explicit about phase0 vs altair - longer term, `altair` will become the "natural" type set, then merge and so on, so no point in giving `phase0` special preferential treatment	2021-08-12 13:08:20 +00:00
Jacek Sieka	9697b73e71	forkedbeaconstate_helpers -> forks (#2772 ) Simpler module name for stuff that covers forks * check that runtime config matches database state * also include some assorted altair cleanups * use "standard" genesis fork in local testnet to work around missing runtime config support	2021-08-10 22:46:35 +02:00
tersec	d638ca0c7f	log overall sent aggregated attestation message signatures (#2754 ) * log overall sent aggregated attestation message signatures * log aggregated attestations via REST API	2021-08-03 10:32:55 +00:00
Jacek Sieka	2d6a661ac6	Syncv2 (#2723 ) * bump libp2p * altair sync v2 Use V2 sync requests after the altair fork has happened, according to the wall clock * Fix the behavior of the v1 req/resp calls after Altair Co-authored-by: Zahary Karadjov <zahary@gmail.com>	2021-07-15 21:01:07 +02:00
tersec	e4afc36d71	use ForkedTrustedSignedBeaconBlock (#2720 ) * use ForkedTrustedSignedBeaconBlock * remove --subscribe-all-subnets * https://ethereum.github.io/eth2.0-APIs/#/Beacon/getBlock implementation was passing through forked beaconblocks	2021-07-14 12:18:52 +00:00
Ștefan Talpalaru	840935ddc2	limit validator balance metric label values (#2719 )	2021-07-14 08:22:03 +02:00
Jacek Sieka	3f9c1fdf4e	More RuntimeConfig cleanup (#2716 ) * remove from BeaconChainDB (doesn't depend on runtime config) * eth2-testnets -> eth2-networks * use `cfg` name throughout	2021-07-13 16:27:10 +02:00
Eugene Kabanov	3b6f4fab4a	New validator client using REST API. (#2651 ) * Initial commit. * Exporting getConfig(). * Add beacon node checking procedures. * Post rebase fixes. * Use runSlotLoop() from nimbus_beacon_node. Fallback implementation. Fixes for ETH2 REST serialization. * Add beacon_clock.durationToNextSlot(). Move type declarations from beacon_rest_api to json_rest_serialization. Fix seq[ValidatorIndex] serialization. Refactor ValidatorPool and add some utility procedures. Create separate version of validator_client. * Post-rebase fixes. Remove CookedPubKey from validator_pool.nim. * Now we should be able to produce attestations and aggregate and proofs. But its not working yet. * Debugging attestation sending. * Add durationToNextAttestation. Optimize some debug logs. Fix aggregation_bits encoding. Bump chronos/presto. * Its alive. * Fixes for launch_local_testnet script. Bump chronos. * Switch client API to not use `/api` prefix. * Post-rebase adjustments. * Fix endpoint for publishBlock(). * Add CONFIG_NAME. Add more checks to ensure that beacon_node is compatible. * Add beacon committee subscription support to validator_client. * Fix stacktrace should be an array of strings. Fix committee subscriptions should not be `data` keyed. * Log duration to next block proposal. * Fix beacon_node_status import. * Use jsonMsgResponse() instead of jsonError(). * Fix graffityBytes usage. Remove unnecessary `await`. Adjust creation of SignedBlock instance. Remove legacy files. * Rework durationToNextSlot() and durationToNextEpoch() to use `fromNow`. * Fix race condition for block proposal and attestations for same slot. Fix local_testnet script to properly kill tasks on Windows. Bump chronos and nim-http-tools, to allow connections to infura.io (basic auth). * Catch services errors. Improve performance of local_testnet.sh script on Windows. Fix race condition when attestation producing. * Post-rebase fixes. * Bump chronos and presto. * Calculate block publishing delay. Fix pkill in one more place. * Add error handling and timeouts to firstSuccess() template. Add onceToAll() template. Add checkNodes() procedure. Refactor firstSuccess() template. Add error checking to api.nim calls. * Deprecated usage onceToAll() for better stability. Address comment and send attestations asap. * Avoid unnecessary loop when calculating minimal duration.	2021-07-13 13:15:07 +02:00
Jacek Sieka	23eea197f6	Implement split preset/config support (#2710 ) * Implement split preset/config support This is the initial bulk refactor to introduce runtime config values in a number of places, somewhat replacing the existing mechanism of loading network metadata. It still needs more work, this is the initial refactor that introduces runtime configuration in some of the places that need it. The PR changes the way presets and constants work, to match the spec. In particular, a "preset" now refers to the compile-time configuration while a "cfg" or "RuntimeConfig" is the dynamic part. A single binary can support either mainnet or minimal, but not both. Support for other presets has been removed completely (can be readded, in case there's need). There's a number of outstanding tasks: * `SECONDS_PER_SLOT` still needs fixing * loading custom runtime configs needs redoing * checking constants against YAML file * yeerongpilly support `build/nimbus_beacon_node --network=yeerongpilly --discv5:no --log-level=DEBUG` * load fork epoch from config * fix fork digest sent in status * nicer error string for request failures * fix tools * one more * fixup * fixup * fixup * use "standard" network definition folder in local testnet Files are loaded from their standard locations, including genesis etc, to conform to the format used in the `eth2-networks` repo. * fix launch scripts, allow unknown config values * fix base config of rest test * cleanups * bundle mainnet config using common loader * fix spec links and names * only include supported preset in binary * drop yeerongpilly, add altair-devnet-0, support boot_enr.yaml	2021-07-12 15:01:38 +02:00
zah	eb2dc5cbbb	Implement the new Altair req/resp protocols (#2676 ) * Implement the new Altair req/resp protocols Also fixes the altair message-id computation by providing the correct forkdigest prefix in `isAltairTopic`. Co-authored-by: Tanguy Cizain <tanguycizain@gmail.com>	2021-07-07 12:09:47 +03:00
Jacek Sieka	7825d12448	increase block attestation wait time (#2705 ) We generally send out attestations 250 ms after the block arrives. Recent efficiency improvements have led to a slightly increased incidence of "slot 0" issues where attestations are dropped by other nodes because they have not yet had time to process the block due to epoch processing taking time. This PR mitigates the problem by increasing the window between receiving the block and sending out attestations.	2021-07-06 15:11:18 +02:00
tersec	7577f8c2ef	add blockchain_dag altair database reading; add rollback tests (#2683 ) * add blockchain_dag altair database reading; add rollback tests; fix some unnecessary type conversions * remove debugging scaffolding * proposeSignedBlock() will need to be async for merge; introduce altair types to VC	2021-06-29 15:09:29 +00:00
tersec	ae1abf24af	add Altair support to block quarantine/clearance and block_sim (#2662 ) * add Altair support to the block quarantine * switch some spec/datatypes imports to spec/datatypes/base * add Altair support to block_clearance * allow runtime configuration of Altair transition slot * enable Altair in block_sim, including in CI	2021-06-23 14:43:18 +00:00
tersec	b1d5609171	remove false OnBlockAdded dependency on phase0 HashedBeaconState (#2661 ) * remove false OnBlockAdded dependency on phase.HashedBeaconState * introduce altair data types into block_clearance; update some alpha.6 spec refs to alpha.7; add get_active_validator_indices_len ForkedHashedBeaconState wrapper * switch many modules from using datatypes (with phase0 states/blocks) to datatypes/base (fork-independent); update spec refs from alpha.6 to alpha.7 and remove rm'd G2_POINT_AT_INFINITY * switch more modules from using datatypes (with phase0 states/blocks) to datatypes/base (fork-independent); update spec refs from alpha.6 to alpha.7 * remove unnecessary phase0-only wrapper of get_attesting_indices(); allow signatures_batch to process either fork; remove O(n^2) nested loop in process_inactivity_updates(); add altair support to getAttestationsforTestBlock() * add Altair versions of asSigVerified(), asTrusted(), and makeBeaconBlock() * fix spec URL to be Altair for Altair makeBeaconBlock()	2021-06-21 08:35:24 +00:00
tersec	53d05060c9	fix assertion in beacon block creation rollback/restore (#2655 )	2021-06-17 09:22:39 +02:00
tersec	146fa48454	use ForkedHashedBeaconState in StateData (#2634 ) * use ForkedHashedBeaconState in StateData * fix FAR_FUTURE_EPOCH -> slot overflow; almost always use assign() * avoid stack allocation in maybeUpgradeStateToAltair() * create and use dispatch functions for check_attester_slashing(), check_proposer_slashing(), and check_voluntary_exit() * use getStateRoot() instead of various state.data.hbsPhase0.root * remove withStateVars.hashedState(), which doesn't work as a design anymore * introduce spec/datatypes/altair into beacon_chain_db * fix inefficient codegen for getStateField(largeStateField) * state_transition_slots() doesn't either need/use blocks or runtime presets * combine process_slots(HBS)/state_transition_slots(HBS) which differ only in last-slot htr optimization * getStateField(StateData, ...) was replaced by getStateField(ForkedHashedBeaconState, ...) * fix rollback * switch some state_transition(), process_slots, makeTestBlocks(), etc to use ForkedHashedBeaconState * remove state_transition(phase0.HashedBeaconState) * remove process_slots(phase0.HashedBeaconState) * remove state_transition_block(phase0.HashedBeaconState) * remove unused callWithBS(); separate case expression from if statement * switch back from nested-ref-object construction to (ref Foo)(Bar())	2021-06-11 20:51:46 +03:00
Jacek Sieka	d859bc12f0	write uncompressed validator keys to database (#2639 ) * write uncompressed validator keys to database Loading 150k+ validator keys on startup in compressed format takes a lot of time - better store them in uncompressed format which makes behaviour just after startup faster / more predictable. * refactor cached validator key access * fix isomorphic cast to work with non-var instances * remove cooked pubkey cache - directly use database cache in chaindag as well (one less cache to keep in sync) * bump blscurve, introduce loadValid for known-to-be-valid keys	2021-06-10 10:37:02 +03:00
Jacek Sieka	abe0d7b4ae	singe validator key cache Instead of keeping a validator key list per EpochRef, this PR introduces a single shared validator key list in ChainDAG, and cleans up some other ChainDAG and key-related issues. The PR does not introduce the validator key list in the state transition - this is because we batch-check all signatures before entering the spec code, thus the spec code never hits the cache. A future refactor should _probably_ remove the threadvar altogether. There's a few other small fixes in here that make the flow easier to read: * fix `var ChainDAGRef` -> `ChainDAGRef` * fix `var QuarantineRef` -> `QuarantineRef` * consistent `dag` variable name * avoid using threadvar pubkey cache in most cases * better error messages in batch signature checking	2021-06-01 20:43:44 +03:00
tersec	46c5a0110a	log doppelganger attestation signature; rm withState.HashedBeaconState uses (#2608 )	2021-05-28 15:51:15 +03:00
Jacek Sieka	d16da06c92	ncli_db: validator performance database tool Record attestation performance per epoch in sqlite database	2021-05-27 19:14:26 +03:00
tersec	0b0bfd1de0	use StateData in place of BeaconState outside state transition code (#2551 ) * use StateData in place of BeaconState outside state transition code * propagate more StateData usage * remove withStateVars().state * wrap get_beacon_committee(BeaconState, ...) as gbc(StateData, ...) * switch makeAttestation() to use StateData * use StateData wrapper/dispatcher for get_committee_count_per_slot() * convert AttestationCache.init(), weak subjectivity functions, and updateValidatorMetrics() * add get_shuffled_active_validator_indices(StateData) and get_block_root_at_slot(StateData) * switch makeAttestationData() to StateData * sync AllTests-mainnet.md after rebase	2021-05-21 09:23:28 +00:00
Zahary Karadjov	dc49a51654	Merge stable into unstable (take 2)	2021-05-20 13:52:09 +03:00
Zahary Karadjov	b7aa30adfd	Merge stable into unstable	2021-05-20 13:50:40 +03:00
tersec	d8bb91d9a9	partially integrate eth1 merge changes (#2548 ) * partially integrate eth1 merge changes * use hexToSeqByte() and validate execution engine opaque transaction length * remove incorrect REST serialization code	2021-05-20 10:44:13 +00:00
Zahary Karadjov	2cb1396969	Log the slashing DB pruning time	2021-05-17 21:42:28 +03:00
Zahary Karadjov	b9924214ab	Better error-handling for the slashingdb import/export feature * Error when specifying an invalid --data-dir (or --validator-dir) * Error when entering an invalid validator public key (e.g. invalid hex value) * Warning when attempting to export a validator not present in the local database Some unnecessary remains of the v1 mode has been removed as well	2021-05-17 21:42:23 +03:00
Jacek Sieka	97f4e1fffe	Db1 cont (#2573 ) * Revert "Revert "Upgrade database schema" (#2570)" This reverts commit `6057c2ffb4`. * ssz: fix loading empty lists into existing instances Not a problem earlier because we didn't reuse instances * bump nim-eth * bump nim-web3	2021-05-17 18:37:26 +02:00
Zahary Karadjov	5c313b958e	Simplify the slashing db import/export CLI	2021-05-17 17:12:03 +03:00
tersec	6057c2ffb4	Revert "Upgrade database schema" (#2570 ) This reverts commit `22ddf74752`.	2021-05-17 06:34:44 +00:00
Mamy André-Ratsimbazafy	dacc508992	slashing import integrated in NBC	2021-05-16 21:48:38 +03:00
Mamy André-Ratsimbazafy	0574531c43	add restore from slashing DB	2021-05-16 21:45:24 +03:00
Jacek Sieka	895ccd1c95	clean up imports (#2557 )	2021-05-14 20:08:07 +03:00
Jacek Sieka	22ddf74752	Upgrade database schema The `kvstore` design we're using now turns out to not be the best way to use `sqlite` - in particular, there are some significant benefits to using rowid in certain situations and to keep data in separate tables. With this branch, there are massive improvements in startup time (seconds instead of minutes) and state/block storage and pruning times (milliseconds instead of seconds) - these improvements can in particular be seen on slow drives and translate directly into better attestation performance. * update kvstore to new keyspace design * remove `DirStoreRef` and the hidden `--state-db-kind` option - this was an experiment to store large blobs in files, but with the new kvstore, there's no compelling reason to do so * remove `DbMap` - unused and would need updating for new keyspace design * introduce separate tables for each data type (blocks, states etc) * remove "WITHOUT ROWID" pessimization for tables with large blobs * close DbSeq statements explicitly (and earlier) * store beacon block summaries in separate table, without SSZ compression and load them all with single query on startup * stop storing backwards compat full states * mark genesis beacon block as trusted * avoid faststreams when loading SSZ data * remove `DisagreementBehavior` (unused)	2021-05-14 20:05:23 +03:00
Jacek Sieka	0022015a91	clean up imports (#2557 )	2021-05-12 14:31:02 +02:00
Mamy Ratsimbazafy	149ff49c8e	Remove correlated queries in finalization pruning, all use indexes (#2554 )	2021-05-11 10:41:37 +02:00
Mamy Ratsimbazafy	e6b559a35a	Slashing db pruning [Merge only after v2 has been default for 1 noticeable release] (#2452 ) * Enable slashing DB pruning * integrate slashing DB pruning with onSlotEnd * rebase tests	2021-05-10 16:32:28 +02:00
Jacek Sieka	867d8f3223	Perform attestation check before broadcast (#2550 ) Currently, we have a bit of a convoluted flow where when sending attestations, we start broadcasting them over gossip then pass them to the attestation validation to include them in the local attestation pool - it should be the other way around: we should be checking attestations _before_ gossipping them - this serves as an additional safety net to ensure that we don't publish junk - this becomes more important when publishing attestations from the API. Also, the REST API was performing its own validation meaning attestations coming from REST would be validated twice - finally, the JSON RPC wasn't pre-validating and would happily broadcast invalid attestations. * Unified attestation production pipeline with the same flow for gossip, locally and API-produced attestations: all are now validated and entered into the pool, then broadcast/republished * Refactor subnet handling with specific SubnetId alias, streamlining where subnets are computed, avoiding the need to pass around the number of active validators * Move some of the subnet handling code to eth2_network * Use BitArray throughout for subnet handling	2021-05-10 09:13:36 +02:00
Jacek Sieka	efdf759cc0	avoid some slashing protection queries (#2528 ) This PR reduces the number of database queries for slashing protection from 5 reads and 1 write to 2 reads and 1 write in the optimistic case. In the process, it removes user-level support for writing the database in the version 1 format in order to simplify the code flow, and prevent code rot. In particular, the v1 format was not covered by any unit tests and has no advantages over v2. The concrete code to read and write it remains for now, in particular to support upgrades from v1 to v2. The branch also removes the use of concepts which doesn't work with checked exceptions - in particular, this highlights code that both raises exceptions and returns error codes, which could be cleaned up in the future. * Cache internal validator ID * Rely on unique index to check for trivial duplicate votes * Combine two surround vote queries into one * Combine API for checking and registering slashing into single function The slashing DB is normally not a bottleneck, but may become one with high attached validator counts.	2021-05-04 15:17:28 +02:00
Jacek Sieka	7dba1b37dd	remove attestation/aggregate queue (#2519 ) With the introduction of batching and lazy attestation aggregation, it no longer makes sense to enqueue attestations between the signature check and adding them to the attestation pool - this only takes up valuable CPU without any real benefit. * add successfully validated attestations to attestion pool directly * avoid copying participant list around for single-vote attestations, pass single validator index instead * release decompressed gossip memory earlier, specially during async message validation * use cooked signatures in a few more places to avoid reloads and errors * remove some Defect-raising versions of signature-loading * release decompressed data memory before validating message	2021-04-26 22:39:44 +02:00
tersec	99fccaee6e	more abstraction over BeaconState (#2509 ) * more abstraction over BeaconState * use HashedBeaconState copy of htr	2021-04-16 08:49:37 +00:00
Jacek Sieka	f1f424cc2d	attestation processing speedups * avoid creating indexed attestation just to check signatures - above all, don't create it when not checking signatures ;) * avoid pointer op when adding attestation to pool * better iterator for yielding attestations * add metric / log for attestation packing time	2021-04-14 21:51:17 +03:00
tersec	050e3ac48b	abstract over more BeaconState usage (#2496 )	2021-04-14 11:34:35 +02:00
Jacek Sieka	4ed2e34a9e	Revamp attestation pool This is a revamp of the attestation pool that cleans up several aspects of attestation processing as the network grows larger and block space becomes more precious. The aim is to better exploit the divide between attestation subnets and aggregations by keeping the two kinds separate until it's time to either produce a block or aggregate. This means we're no longer eagerly combining single-vote attestations, but rather wait until the last moment, and then try to add singles to all aggregates, including those coming from the network. Importantly, the branch improves on poor aggregate quality and poor attestation packing in cases where block space is running out. A basic greed scoring mechanism is used to select attestations for blocks - attestations are added based on how much many new votes they bring to the table. * Collect single-vote attestations separately and store these until it's time to make aggregates * Create aggregates based on single-vote attestations * Select _best_ aggregate rather than _first_ aggregate when on aggregation duty * Top up all aggregates with singles when it's time make the attestation cut, thus improving the chances of grabbing the best aggregates out there * Improve aggregation test coverage * Improve bitseq operations * Simplify aggregate signature creation * Make attestation cache temporary instead of storing it in attestation pool - most of the time, blocks are not being produced, no need to keep the data around * Remove redundant aggregate storage that was used only for RPC * Use tables to avoid some linear seeks when looking up attestation data * Fix long cleanup on large slot jumps * Avoid some pointers * Speed up iterating all attestations for a slot (fixes #2490)	2021-04-13 20:24:02 +03:00
Dustin Brody	398c151b7d	revert change	2021-04-13 18:50:06 +02:00
Dustin Brody	d6fa4d06bc	abstract over more BeaconState usage	2021-04-13 18:47:44 +02:00
tersec	498c998552	abstract over most withStateVars/withState state var usage (#2484 ) * abstract over most withStateVars/withState state var usage * cleanups	2021-04-13 15:05:44 +02:00
tersec	d3cad92693	remove some BeaconState use and abstract over other uses (#2482 ) * remove some BeaconState use and abstract over other uses * remove out-of-context comment	2021-04-08 08:24:25 +00:00
Mamy Ratsimbazafy	6b13cdce36	Batch attestations (#2439 ) * batch attestations * Fixes (but now need to investigate the chronos 0 .. 4095 crash similar to https://github.com/status-im/nimbus-eth2/issues/1518 * Try to remove the processing loop to no avail :/ * batch aggregates * use resultsBuffer size for triggering deadline schedule * pass attestation pool tests * Introduce async gossip validators. May fix the 4096 bug (reentrancy issue?) (similar to sync unknown blocks #1518) * Put logging at debug level, add speed info * remove unnecessary batch info when it is known to be one * downgrade some logs to trace level * better comments [skip ci] * Address most review comments * only use ref for async proc * fix exceptions in eth2_network * update async exceptions in gossip_validation * eth2_network 2nd pass * change to sleepAsync * Update beacon_chain/gossip_processing/batch_validation.nim Co-authored-by: Jacek Sieka <jacek@status.im> Co-authored-by: Jacek Sieka <jacek@status.im>	2021-04-02 16:36:43 +02:00
Jacek Sieka	74732a23fe	json cleanups (#2456 ) * move json-rpc specific marshalling to rpc * serialize Epoch/Slot with cast to avoid Defect * avoid a few eth1 deps * simplify imports	2021-03-26 15:11:06 +01:00
Jacek Sieka	2695cfa864	EH cleanup (#2455 ) almost 100% raises in nimbus-eth2 now! * fix some rare exception-related crashes in json-rpc	2021-03-26 07:52:01 +01:00
Zahary Karadjov	2eacfc4685	Bump modules to take advantage of the new Json format flavors support Since quite a lot of additional procs were now compiled as generics, this lead to compiler bugs that had to be worked-around: * The `Domain` type was renamed to `Eth2Domain` to avoid compilation errors due to conflicts with `nativesockets.Domain`. Similarly, `eth2_network.KeyPair` was renamed to `NetKeyPair`. * A new more robust version of `hexToByteArray` was added to stew	2021-03-25 09:37:35 +02:00
Jacek Sieka	8b76ceed52	Fix minor exception effect issues (#2448 ) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it.	2021-03-24 17:20:55 +01:00
tersec	3076f5c3b6	rm std/random from beacon_chain and rm attestation timing randomness (#2442 ) * remove added attestation timing randomness * remove os/random from rest of beacon_chain, primarily deposit_contract * remove scaffolding * randomize std/random seed in beacon node and validator client * use CSPRNG to more securely seed std/random	2021-03-23 06:57:10 +00:00
tersec	97850741a0	remove unused imports in slashing prtection (#2436 )	2021-03-19 08:26:02 +00:00
Jacek Sieka	3cb31e66b4	set upper bound on EpochRef cache (#2403 ) * set upper bound on EpochRef cache * max 32 EpochRef instances * less memory waste in BlockRef by removing EpochRef seq that is mostly unused (~20mb) * less memory waste in dag block lookup by not keeping an extra copy of digest (~70mb) * fix `==` and `$` for Eth2Digest * remove `ChainDAG.tmpState` (~50mb?) all in all, this branch cuts mainnet memory usage by ~160-180mb and puts limits on EpochRef cache usage - where normally it hovered around 950mb before, it's now sitting at 600-700mb on my machine. * docs	2021-03-17 11:17:15 +01:00
Mamy Ratsimbazafy	c47d636cb3	Split Eth2Processor in prep for batching (#2396 ) * Split Eth2Processor in gossip and consensus part and materialize the shared block queue * Update initialization in test_sync_manager	2021-03-11 11:10:57 +01:00
Mamy Ratsimbazafy	f7cddcc8ab	Fix #2393 (#2395 ) * Fix #2393 * check both * Fix shortLog(int64)	2021-03-10 16:53:42 +01:00
tersec	82c300186b	annotate slashing protection v2 with uint64 -> int64 overflow conditions (#2392 ) * annotate slashing protection v2 with uint64 -> int64 overflow conditions * fix variables * remove assertion which gets tripped by interchange tests	2021-03-10 08:35:04 +00:00
Mamy Ratsimbazafy	de1060e7f3	centralize p2p validation in a single file and address https://github.com/status-im/nimbus-eth2/pull/2377#issuecomment-791313118 (#2383 )	2021-03-06 08:32:55 +01:00
Mamy Ratsimbazafy	d47f53cd9d	Reorg (5/5) (#2377 ) * Reorg things left into networking and gossip_processing * time -> beacon_clock * fix builds	2021-03-05 14:12:00 +01:00
Mamy Ratsimbazafy	5d7f9c3a04	Consensus object pools [reorg 4/5] (#2374 ) * Add documentation * make test doesn't try to build the beacon node :/	2021-03-04 10:13:44 +01:00
Mamy Ratsimbazafy	2f17ac7b64	Move SSZ, deposit_contracts & eth1_monitor [reorg files 3/5] (#2371 ) * move deposit_contract * Move SSZ * fix ssz import in tests * move also eth1_monitor * forgot to delete the original * fix comma [skip ci] * Fix "make" & tools imports * Fix import * Fix import again * rename deposit_contract -> eth1 * Revert ssz move to subfolder * path fixes [skip ci]	2021-03-03 07:23:05 +01:00
tersec	2b5a3a6810	remove more int64 usage (#2369 ) * remove more int64 usage * explain loop bounds	2021-03-02 13:40:28 +00:00
Mamy Ratsimbazafy	3276dfc683	Consolidate modules by areas [part 1] (#2365 ) * Move sync in subfolder * move validator related thingies in validators * fix binary builds * update bounds comment [skip ci]	2021-03-02 11:27:45 +01:00

... 4 5 6 7 8

363 Commits