nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
tersec	ff6c581273	keep proposer boosting permanently enabled (#3565 )	2022-04-12 12:06:30 +02:00
Jacek Sieka	4207b127f9	era: load blocks and states (#3394 ) * era: load blocks and states Era files contain finalized history and can be thought of as an alternative source for block and state data that allows clients to avoid syncing this information from the P2P network - the P2P network is then used to "top up" the client with the most recent data. They can be freely shared in the community via whatever means (http, torrent, etc) and serve as a permanent cold store of consensus data (and, after the merge, execution data) for history buffs and bean counters alike. This PR gently introduces support for loading blocks and states in two cases: block requests from rest/p2p and frontfilling when doing checkpoint sync. The era files are used as a secondary source if the information is not found in the database - compared to the database, there are a few key differences: * the database stores the block indexed by block root while the era file indexes by slot - the former is used only in rest, while the latter is used both by p2p and rest. * when loading blocks from era files, the root is no longer trivially available - if it is needed, it must either be computed (slow) or cached (messy) - the good news is that for p2p requests, it is not needed * in era files, "framed" snappy encoding is used while in the database we store unframed snappy - for p2p2 requests, the latter requires recompression while the former could avoid it * front-filling is the process of using era files to replace backfilling - in theory this front-filling could happen from any block and front-fills with gaps could also be entertained, but our backfilling algorithm cannot take advantage of this because there's no (simple) way to tell it to "skip" a range. * front-filling, as implemented, is a bit slow (10s to load mainnet): we load the full BeaconState for every era to grab the roots of the blocks - it would be better to partially load the state - as such, it would also be good to be able to partially decompress snappy blobs * lookups from REST via root are served by first looking up a block summary in the database, then using the slot to load the block data from the era file - however, there needs to be an option to create the summary table from era files to fully support historical queries To test this, `ncli_db` has an era file exporter: the files it creates should be placed in an `era` folder next to `db` in the data directory. What's interesting in particular about this setup is that `db` remains as the source of truth for security purposes - it stores the latest synced head root which in turn determines where a node "starts" its consensus participation - the era directory however can be freely shared between nodes / people without any (significant) security implications, assuming the era files are consistent / not broken. There's lots of future improvements to be had: * we can drop the in-memory `BlockRef` index almost entirely - at this point, resident memory usage of Nimbus should drop to a cool 500-600 mb * we could serve era files via REST trivially: this would drop backfill times to whatever time it takes to download the files - unlike the current implementation that downloads block by block, downloading an era at a time almost entirely cuts out request overhead * we can "reasonably" recreate detailed state history from almost any point in time, turning an O(slot) process into O(1) effectively - we'll still need caches and indices to do this with sufficient efficiency for the rest api, but at least it cuts the whole process down to minutes instead of hours, for arbitrary points in time * CI: ignore failures with Nim-1.6 (temporary) * test fixes Co-authored-by: Ștefan Talpalaru <stefantalpalaru@yahoo.com>	2022-03-23 09:58:17 +01:00
tersec	f0ada15dac	automated CL spec ref URL updates from v1.1.9 to v1.1.10 (#3455 )	2022-03-02 10:00:21 +00:00
tersec	79761c78a4	proc -> func, mainly in spec/state transition and adjecent modules (#3405 )	2022-02-17 11:53:55 +00:00
tersec	5eecb9a21f	rename no{R=>r}eturn, no{I=>i}init, short{l=>L}og, E{T=>t}h2Node, Beacon{c=>C}hainDB (#3403 )	2022-02-16 23:24:44 +01:00
tersec	873a8ec1e6	use isZeroMemory for Eth2Digest comparisons (#3386 ) * use isZeroMemory for Eth2Digest comparisons * use Eth2Digest.isZero abstraction	2022-02-14 05:26:19 +00:00
tersec	d358299875	fork choice proposer boosting support (#3349 ) * fork choice proposer boosting support * detect nodeDelta underflow/overflow	2022-02-04 12:59:40 +01:00
tersec	29e2169585	phase 0 & altair beacon chain and altair validator spec URL updates (#3339 )	2022-01-29 13:53:31 +00:00
Jacek Sieka	61342c2449	limit by-root requests to non-finalized blocks (#3293 ) * limit by-root requests to non-finalized blocks Presently, we keep a mapping from block root to `BlockRef` in memory - this has simplified reasoning about the dag, but is not sustainable with the chain growing. We can distinguish between two cases where by-root access is useful: * unfinalized blocks - this is where the beacon chain is operating generally, by validating incoming data as interesting for future fork choice decisions - bounded by the length of the unfinalized period * finalized blocks - historical access in the REST API etc - no bounds, really In this PR, we limit the by-root block index to the first use case: finalized chain data can more efficiently be addressed by slot number. Future work includes: * limiting the `BlockRef` horizon in general - each instance is 40 bytes+overhead which adds up - this needs further refactoring to deal with the tail vs state problem * persisting the finalized slot-to-hash index - this one also keeps growing unbounded (albeit slowly) Anyway, this PR easily shaves ~128mb of memory usage at the time of writing. * No longer honor `BeaconBlocksByRoot` requests outside of the non-finalized period - previously, Nimbus would generously return any block through this libp2p request - per the spec, finalized blocks should be fetched via `BeaconBlocksByRange` instead. * return `Opt[BlockRef]` instead of `nil` when blocks can't be found - this becomes a lot more common now and thus deserves more attention * `dag.blocks` -> `dag.forkBlocks` - this index only carries unfinalized blocks from now - `finalizedBlocks` covers the other `BlockRef` instances * in backfill, verify that the last backfilled block leads back to genesis, or panic * add backfill timings to log * fix missing check that `BlockRef` block can be fetched with `getForkedBlock` reliably * shortcut doppelganger check when feature is not enabled * in REST/JSON-RPC, fetch blocks without involving `BlockRef` * fix dag.blocks ref	2022-01-21 13:33:16 +02:00
Jacek Sieka	805e85e1ff	time: spring cleaning (#3262 ) Time in the beacon chain is expressed relative to the genesis time - this PR creates a `beacon_time` module that collects helpers and utilities for dealing the time units - the new module does not deal with actual wall time (that's remains in `beacon_clock`). Collecting the time related stuff in one place makes it easier to find, avoids some circular imports and allows more easily identifying the code actually needs wall time to operate. * move genesis-time-related functionality into `spec/beacon_time` * avoid using `chronos.Duration` for time differences - it does not support negative values (such as when something happens earlier than it should) * saturate conversions between `FAR_FUTURE_XXX`, so as to avoid overflows * fix delay reporting in validator client so it uses the expected deadline of the slot, not "closest wall slot" * simplify looping over the slots of an epoch * `compute_start_slot_at_epoch` -> `start_slot` * `compute_epoch_at_slot` -> `epoch` A follow-up PR will (likely) introduce saturating arithmetic for the time units - this is merely code moves, renames and fixing of small bugs.	2022-01-11 11:01:54 +01:00
Jacek Sieka	20e700fae4	Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex (#3259 ) * Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex Harden the use of `CommitteeIndex` et al to prevent future issues by using a distinct type, then validating before use in several cases - datatypes in spec are kept simple though so that invalid data still can be read. * fix invalid epoch used in REST `/eth/v1/beacon/states/{state_id}/committees` committee length (could return invalid data) * normalize some variable names * normalize committee index loops * fix `RestAttesterDuty` to use `uint64` for `validator_committee_index` * validate `CommitteeIndex` on ingress in REST API * update rest rules with stricter parsing * better REST serializers * save lots of memory by not using `zip` ...at least a few bytes!	2022-01-09 01:28:49 +02:00
tersec	bac0eaa92e	update 10 modules from using merge to bellatrix (#3257 )	2022-01-07 18:10:40 +01:00
Jacek Sieka	0a4728a241	Handle access to historical data for which there is no state (#3217 ) With checkpoint sync in particular, and state pruning in the future, loading states or state-dependent data may fail. This PR adjusts the code to allow this to be handled gracefully. In particular, the new availability assumption is that states are always available for the finalized checkpoint and newer, but may fail for anything older. The `tail` remains the point where state loading de-facto fails, meaning that between the tail and the finalized checkpoint, we can still get historical data (but code should be prepared to handle this as an error). However, to harden the code against long replays, several operations which are assumed to work only with non-final data (such as gossip verification and validator duties) now limit their search horizon to post-finalized data. * harden several state-dependent operations by logging an error instead of introducing a panic when state loading fails * `withState` -> `withUpdatedState` to differentiate from the other `withState` * `updateStateData` can now fail if no state is found in database - it is also hardened against excessively long replays * `getEpochRef` can now fail when replay fails * reject blocks with invalid target root - they would be ignored previously * fix recursion bug in `isProposed`	2022-01-05 19:38:04 +01:00
tersec	1a6a56bdb1	use BeaconTime instead of Slot in fork choice (#3138 ) * use v1.1.6 test vectors; use BeaconTime instead of Slot in fork choice * tick through every slot at least once * use div INTERVALS_PER_SLOT and use precomputed constants of them * use correct (even if numerically equal) constant	2021-12-21 18:56:08 +00:00
Mamy Ratsimbazafy	97da6e1365	Fork choice EF consensus tests (#3041 ) * add EF fork choice tests to CI * checkpoints * compilation fixes and add test to preset dependent suite * support longpaths on Windows CI * skip minimal tests (long paths issue + impl detals tested) * fix stackoverflow on some platforms * rebase on top of https://github.com/status-im/nimbus-eth2/pull/3054 * fix stack usage	2021-11-25 19:41:39 +01:00
tersec	39f6a6534e	document how to run merge local testnet with Nethermind (#3110 )	2021-11-17 20:45:39 +01:00
tersec	2b2846b468	implement forked merge state/block support (#2890 ) * implement forked state/block support * merge support for containsOrphan; import cleanup; 80-column lines * add merge block header operations and slot sanity fixture * add epoch state transition tests; implement is_valid_gas_limit(), is_merge_block(), is_execution_enabled(), and compute_timestamp_at_slot() * implement process_execution_payload() and add merge deposit operations tests * add merge block sanity tests * add merge case to syncCommitteeParticipants * v1.1.0-beta.5 updates * reduce getTestStates-based memory usage; don't try to REST-serialize ExecutionPayload transactions without underlying support * add execution payload tests; switch var to let in tests/official/	2021-09-27 14:22:58 +00:00
tersec	092d9350de	eth2.0-specs -> consensus-specs repo rename (#2801 )	2021-08-20 23:37:45 +00:00
Jacek Sieka	7a622e8505	rework spec imports (#2779 ) The spec imports are a mess to work with, so this branch cleans them up a bit to ensure that we avoid generic sandwitches and that importing stuff generally becomes easier. * reexport crypto/digest/presets because these are part of the public symbol set of the rest of the spec types * don't export `merge` types from `base` - this causes circular deps * fix circular deps in `ssz/spec_types` - this is the first step in disentangling ssz from spec * be explicit about phase0 vs altair - longer term, `altair` will become the "natural" type set, then merge and so on, so no point in giving `phase0` special preferential treatment	2021-08-12 13:08:20 +00:00
tersec	7577f8c2ef	add blockchain_dag altair database reading; add rollback tests (#2683 ) * add blockchain_dag altair database reading; add rollback tests; fix some unnecessary type conversions * remove debugging scaffolding * proposeSignedBlock() will need to be async for merge; introduce altair types to VC	2021-06-29 15:09:29 +00:00
tersec	b1d5609171	remove false OnBlockAdded dependency on phase0 HashedBeaconState (#2661 ) * remove false OnBlockAdded dependency on phase.HashedBeaconState * introduce altair data types into block_clearance; update some alpha.6 spec refs to alpha.7; add get_active_validator_indices_len ForkedHashedBeaconState wrapper * switch many modules from using datatypes (with phase0 states/blocks) to datatypes/base (fork-independent); update spec refs from alpha.6 to alpha.7 and remove rm'd G2_POINT_AT_INFINITY * switch more modules from using datatypes (with phase0 states/blocks) to datatypes/base (fork-independent); update spec refs from alpha.6 to alpha.7 * remove unnecessary phase0-only wrapper of get_attesting_indices(); allow signatures_batch to process either fork; remove O(n^2) nested loop in process_inactivity_updates(); add altair support to getAttestationsforTestBlock() * add Altair versions of asSigVerified(), asTrusted(), and makeBeaconBlock() * fix spec URL to be Altair for Altair makeBeaconBlock()	2021-06-21 08:35:24 +00:00
Jacek Sieka	7dba1b37dd	remove attestation/aggregate queue (#2519 ) With the introduction of batching and lazy attestation aggregation, it no longer makes sense to enqueue attestations between the signature check and adding them to the attestation pool - this only takes up valuable CPU without any real benefit. * add successfully validated attestations to attestion pool directly * avoid copying participant list around for single-vote attestations, pass single validator index instead * release decompressed gossip memory earlier, specially during async message validation * use cooked signatures in a few more places to avoid reloads and errors * remove some Defect-raising versions of signature-loading * release decompressed data memory before validating message	2021-04-26 22:39:44 +02:00
Mamy Ratsimbazafy	5d7f9c3a04	Consensus object pools [reorg 4/5] (#2374 ) * Add documentation * make test doesn't try to build the beacon node :/	2021-03-04 10:13:44 +01:00
Mamy André-Ratsimbazafy	597374605a	Fork choice update for HF1	2021-02-19 15:57:20 +02:00
tersec	8d25663681	remove several IntSet usages in lieu of seq[ValidatorIndex] (#2288 ) * remove several IntSet usages in lieu of seq[ValidatorIndex] * convert smaller types to larger types * larger type, again	2021-02-08 08:27:30 +01:00
tersec	1bdbf099cc	use IntSet rather than HashSet[ValidatorIndex] (#2267 ) * use IntSet rather than HashSet[ValidatorIndex] * add bounds check before uint64 -> int conversion * use intsets in block transitions * remove superfluous Nim issue explanation/reference	2021-01-26 12:52:00 +01:00
Zahary Karadjov	14b2d4324d	openarray -> openArray	2020-11-03 23:23:10 +02:00
Jacek Sieka	7c0b4d28d2	speed up reward/penalty calculation Calculating rewards/penalties is slow due to how we compute sets of attestations validators then use the sets for inclusion checks, to see who attested. The dominant function during validated block processing / epoch processing is hash set building and lookup. This PR inverts the flow by removing the sets and creating a single large validator status list, then applying all relevant state attestations, then updating rewards and penalties. This provides a 10x speedup to epoch processing which in turn speeds up both empty slot and block processing - for example, on startup, we replay all non-finalized blocks to prime fork choice - the same when validating attestations or replaying states on reorg.	2020-10-23 19:23:36 +03:00
Jacek Sieka	499e5ca991	misc memory and perf fixes (#1899 ) * misc memory and perf fixes * use EpochRef for attestation aggregation * compress effective balances in memory (medalla unfinalized: 4gb -> 1gb) * avoid hitting db when rewinding to head or clearance state * avoid hitting db when blocks can be applied to in-memory state - speeds up startup considerably * avoid storing epochref in fork choice * simplify and speed up beacon block creation flow - avoids state reload thanks to head rewind optimization * iterator-based committee and attestation participation help avoid lots of small memory allocations throughout epoch transition (40% speedup on epoch processing, for example during startup) * add constant for threshold	2020-10-22 12:53:33 +02:00
Dustin Brody	9a543e0af7	partial hotfix for #1879 crash	2020-10-16 11:46:19 +03:00
tersec	b79e5f8af5	update nim-beacon-chain to nimbus-eth2 in beacon_chain/, ncli/, tests/, and README.md (#1843 )	2020-10-08 19:02:05 +00:00
Jacek Sieka	99afafecd7	fix quadratic seq assignment in fork choice (#1805 ) this would reallocate the attestation queue on every attestation and other call to update_time, causing quite the overhead (~10% cpu spent when gossiping)	2020-10-03 23:43:27 +02:00
Mamy Ratsimbazafy	0280d6c73e	Revisiting log levels (#1788 ) * Update log level - https://github.com/status-im/nim-beacon-chain/issues/1779 https://github.com/status-im/nim-beacon-chain/issues/1785 * Address review comments * Document the logging strategy [skip ci]	2020-10-01 20:56:42 +02:00
tersec	02ddc41960	ignore sqlite WAL journals in git; increase logging priority of attestation/block sending (#1590 ) * ignore sqlite WAL journal files in git; switch attestation resolved from info to debug * promote sent attestations/blocks to notice rather than demote resolved attestations/blocks to debug	2020-08-31 14:34:04 +00:00
Jacek Sieka	fa1621db46	implement clock disparity for attestation validation (#1568 ) This implements disparity, resolving a part of https://github.com/status-im/nim-beacon-chain/issues/1367 * make BeaconTime a duration for fractional seconds * factor out attestation/aggregate validation * simplify recording of queued attestations * simplify attestation signature check * fix blocks_received metric * add some trivial validation tests * remove unresolved attestation table - attestations for unknown blocks are dropped instead (cannot verify their signature)	2020-08-27 09:34:12 +02:00
Mamy Ratsimbazafy	81788becfc	Fork choice - almost free pruning - fix #1534 (#1535 ) * initial - cheaper pruning - addresses #1534 * Pass tests: update offset when pruning, proper handling of pruned parents * Use options instead of nil for nilable newHead (finalization passing but rootcause not solved) * First line of defense against stackoverflow in tests * Fix compute_delta offset after pruning * Rebase fix - medalla ready * Remove Option[BlockRef]	2020-08-26 17:23:34 +02:00
Jacek Sieka	9244ae7a38	more speedups * evaluate block attestations under the epochref of the block - this is what the state transition function does * avoid copying attestation seq unnecessarily * avoid unnecessary hashset for unslashed indices	2020-08-19 14:51:04 +03:00
Jacek Sieka	9da8b2692f	simplify fork choice code (#1521 ) * standardize init * avoid loading state on init * avoid some inefficient exception-based code * remove some TODO	2020-08-18 16:56:32 +02:00
Jacek Sieka	79ff4f7c41	fork choice refresh (#1520 ) * add attestation processing queue so attestations don't get processed too early * rework justified slot delay to match spec / lighthouse better * keep less state in fork choice * request epochref less	2020-08-17 20:36:13 +02:00
Jacek Sieka	5da25e76be	avoid rewind in fork choice application (#1489 )	2020-08-12 04:49:52 +00:00
Jacek Sieka	c6674de5d2	use epoch ref to update fork choice this dramatically speeds up startup in long periods of non-finality	2020-08-04 20:00:31 +03:00
Viktor Kirilov	0a96e5f564	renamed CandidateChains to ChainDagRef and made the Quarantine type a ref type so there is a single instance in the beacon node (#1407 )	2020-07-31 14:49:06 +00:00
Viktor Kirilov	c032366547	removed the BlockPool type and all of the proxy functions around it (#1401 ) * removed the BlockPool type and all of the proxy functions around it - passing the chain DAG and the quarantine explicitly where appropriately - they don't need to be bundled in a type * fixed the build after the rebase	2020-07-30 21:18:17 +02:00
Jacek Sieka	c5fecd472f	more fork-choice fixes (#1388 ) * more fork-choice fixes * use target block/epoch to validate attestations * make addLocalValidators sync * add current and previous epoch to cache before doing state transition * update head state using clearance state as a shortcut, when possible * use blockslot for fork choice balances * send attestations using epochref cache * fix invalid finalized parent being used also simplify epoch block traversal * single error handling style in fork choice * import fix, remove unused async	2020-07-30 17:48:25 +02:00
Jacek Sieka	157ddd2ac4	Fork choice fixes 5 (#1381 ) * limit attestations kept in attestation pool With fork choice updated, the attestation pool only needs to keep track of attestations that will eventually end up in blocks - we can thus limit the horizon of attestations that we keep more aggressively. To get here, we expose getEpochRef which gets metadata about a particular epochref, and make sure to populate it when a block is added - this ensures that state rewinds during block addition are minimized. In addition, we'll use the target root/epoch when validating attestations - this helps minimize the number of different states that we need to rewind to, in general. * remove CandidateChains.justifiedState unused * remove BlockPools.Head object * avoid quadratic quarantine loop * fix	2020-07-28 13:54:32 +00:00
Jacek Sieka	fd4d319450	Use fork v2 (#1358 ) * fork choice fixes, round 3 * introduce checkpoint tracker * split out fork choice backend that is independent of dag * correctly update best checkpoint to use for head selection * correctly consider wall clock when processing attestations * preload head history only (only one history is loaded from database anyway) * love the DAG * switch to fork choice v2 also remove BlockRef.children * fix	2020-07-25 21:41:12 +02:00
Jacek Sieka	fb2f742972	Fork choice fixes 2 (#1356 ) * fork choice cleanup * enable v2 pruning * prefer `get_current_epoch` * fix finalization check to use correct epoch * small cleanups * add `count_active_validators` * remove misleading logs * fix justified checkpoint slot calculation in rpc	2020-07-22 23:01:44 +02:00
Jacek Sieka	f0720faf17	Fork choice fixes (#1350 ) * remove cruft * reenable fork choice and fix several issues * in addForkChoice_v2, the `.error` field would be accessed even when Result is ok * remove workaround for invalid block structure in fork choice * fix `tmpState` being used recursively in callback, causing state corruption while processing attestation * fix block callback being called twice per block * pass state to callback to avoid unnecessary rewinding * enable head select, fix another bug * never use `get` without `isOk` * log nil blockref in case blockref is nil * add missing error checking * use correct epoch when updating attestation message	2020-07-22 11:42:55 +02:00
Jacek Sieka	8b01284b0e	cache block hash (#1329 ) hash_tree_root was turning up when running beacon_node, turns out to be repeated hash_tree_root invocations - this pr brings them back down to normal. this PR caches the root of a block in the SignedBeaconBlock object - this has the potential downside that even invalid blocks will be hashed (as part of deserialization) - later, one could imagine delaying this until checks have passed there's also some cleanup of the `cat=` logs which were applied randomly and haphazardly, and to a large degree are duplicated by other information in the log statements - in particular, topics fulfill the same role	2020-07-16 15:16:51 +02:00
Mamy Ratsimbazafy	3cdae9f6be	Dual headed fork choice [Revolution] (#1238 ) * Dual headed fork choice * fix finalizedEpoch not moving * reduce fork choice verbosity * Add failing tests due to pruning * Properly handle duplicate blocks in sync * test_block_pool also add a test for duplicate blocks * comments addressing review * Fix fork choice v2, was missing integrating block proposed * remove a spurious debug writeStackTrace * update block_sim * Use OrderedTable to ensure that we always load parents before children in fork choice * Load the DAG data in fork choice at init if there is some (can sync witti) * Cluster of quarantined blocks were not properly added to the fork choice * Workaround async gcsafe warnings * Update blockpoool tests * Do the callback before clearing the quarantine * Revert OrderedTable, implement topological sort of DAG, allow forkChoice to be initialized from arbitrary finalized heads * Make it work with latest devel - Altona readyness * Add a recovery mechanism when forkchoice desyncs with blockpool * add the current problematic node to the stack * Fix rebase indentation bug (but still producing invalid block) * Fix cache at epoch boundaries and lateBlock addition	2020-07-09 11:29:32 +02:00

1 2

57 Commits