nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Jacek Sieka	05ffe7b2bf	Prune `BlockRef` on finalization (#3513 ) Up til now, the block dag has been using `BlockRef`, a structure adapted for a full DAG, to represent all of chain history. This is a correct and simple design, but does not exploit the linearity of the chain once parts of it finalize. By pruning the in-memory `BlockRef` structure at finalization, we save, at the time of writing, a cool ~250mb (or 25%:ish) chunk of memory landing us at a steady state of ~750mb normal memory usage for a validating node. Above all though, we prevent memory usage from growing proportionally with the length of the chain, something that would not be sustainable over time - instead, the steady state memory usage is roughly determined by the validator set size which grows much more slowly. With these changes, the core should remain sustainable memory-wise post-merge all the way to withdrawals (when the validator set is expected to grow). In-memory indices are still used for the "hot" unfinalized portion of the chain - this ensure that consensus performance remains unchanged. What changes is that for historical access, we use a db-based linear slot index which is cache-and-disk-friendly, keeping the cost for accessing historical data at a similar level as before, achieving the savings at no percievable cost to functionality or performance. A nice collateral benefit is the almost-instant startup since we no longer load any large indicies at dag init. The cost of this functionality instead can be found in the complexity of having to deal with two ways of traversing the chain - by `BlockRef` and by slot. * use `BlockId` instead of `BlockRef` where finalized / historical data may be required * simplify clearance pre-advancement * remove dag.finalizedBlocks (~50:ish mb) * remove `getBlockAtSlot` - use `getBlockIdAtSlot` instead * `parent` and `atSlot` for `BlockId` now require a `ChainDAGRef` instance, unlike `BlockRef` traversal * prune `BlockRef` parents on finality (~200:ish mb) * speed up ChainDAG init by not loading finalized history index * mess up light client server error handling - this need revisiting :)	2022-03-17 17:42:56 +00:00
Jacek Sieka	c64bf045f3	remove StateData (#3507 ) One more step on the journey to reduce `BlockRef` usage across the codebase - this one gets rid of `StateData` whose job was to keep track of which block was last assigned to a state - these duties have now been taken over by `latest_block_root`, a fairly recent addition that computes this block root from state data (at a small cost that should be insignificant) 99% mechanical change.	2022-03-16 08:20:40 +01:00
Jacek Sieka	4363215a32	relax `BlockRef` database assumptions (#3472 ) * remove `getForkedBlock(BlockRef)` which assumes block data exists but doesn't support archive/backfilled blocks * fix REST `/eth/v1/beacon/headers` request not returning archive/backfilled blocks * avoid re-encoding in REST block SSZ requests (using `getBlockSSZ`)	2022-03-11 13:08:17 +01:00
zah	9c1ff78f84	Fix a reward calculation bug affecting Prater epoch 64781 (#3428 ) To calculate the deltas correctly, the `process_inactivity_updates` function must be called before the rewards and penalties processing code in order to update the `inactivity_scores` field in the state. This would have required duplicating more logic from the spec in the ncli modules, so I've decided to pay the price of introducing a run-time copy of the state at each epoch which eliminates the need to duplicate logic (both for this fix and the previous one). Other changes: * Fixes for the read-only mode of the `BeaconChainDb` * Fix an uint64 underflow in the debug output procedure for printing balance deltas * Allow Bellatrix states in the reward computation helpers	2022-02-22 14:14:17 +02:00
Jacek Sieka	adfe655b16	db: make block loading generic (#3413 ) Streamline lookup with Forky and BeaconBlockFork (then we can do the same for era) We use type to avoid conditionals, as fork is often already known at a "higher" level. * load blockid before loading block by root - this is needed to map root to slot and will eventually be done via block summary table for "old" blocks Co-authored-by: tersec <tersec@users.noreply.github.com>	2022-02-21 09:48:02 +01:00
Jacek Sieka	a88427bd39	ncli_db: more readonly support (#3411 ) Update several `ncli_db` commands to run in readOnly mode, allowing them to be used with a running instance - in particular era export. * export all eras by default * skip already-exported eras	2022-02-18 07:37:44 +01:00
tersec	9c18765b3b	remove ncli_db pruneDatabase (#3356 )	2022-02-03 20:03:01 +01:00
Zahary Karadjov	ac16eb4691	Streamline the validator reward analysis Notable improvements: * A separate aggregation pass is no longer required. * The user can opt to produce only aggregated data (resuing in a much smaller data set). * Large portion of the number cruching in Jupyter is now done in C through the rich DataFrames API. * Added support for comparisons against the "median" validator performance in the network.	2022-02-01 11:30:14 +02:00
Jacek Sieka	d076e1a11b	ncli_db: import states and blocks from era file (#3313 )	2022-01-25 09:28:26 +01:00
Zahary Karadjov	54a745cb0e	Bugfix: Take into account the finalization delay in the ncli_db rewards calculation This fixes a reward calculation error affecting Prater's epoch 31256	2022-01-23 23:10:56 +02:00
Jacek Sieka	61342c2449	limit by-root requests to non-finalized blocks (#3293 ) * limit by-root requests to non-finalized blocks Presently, we keep a mapping from block root to `BlockRef` in memory - this has simplified reasoning about the dag, but is not sustainable with the chain growing. We can distinguish between two cases where by-root access is useful: * unfinalized blocks - this is where the beacon chain is operating generally, by validating incoming data as interesting for future fork choice decisions - bounded by the length of the unfinalized period * finalized blocks - historical access in the REST API etc - no bounds, really In this PR, we limit the by-root block index to the first use case: finalized chain data can more efficiently be addressed by slot number. Future work includes: * limiting the `BlockRef` horizon in general - each instance is 40 bytes+overhead which adds up - this needs further refactoring to deal with the tail vs state problem * persisting the finalized slot-to-hash index - this one also keeps growing unbounded (albeit slowly) Anyway, this PR easily shaves ~128mb of memory usage at the time of writing. * No longer honor `BeaconBlocksByRoot` requests outside of the non-finalized period - previously, Nimbus would generously return any block through this libp2p request - per the spec, finalized blocks should be fetched via `BeaconBlocksByRange` instead. * return `Opt[BlockRef]` instead of `nil` when blocks can't be found - this becomes a lot more common now and thus deserves more attention * `dag.blocks` -> `dag.forkBlocks` - this index only carries unfinalized blocks from now - `finalizedBlocks` covers the other `BlockRef` instances * in backfill, verify that the last backfilled block leads back to genesis, or panic * add backfill timings to log * fix missing check that `BlockRef` block can be fetched with `getForkedBlock` reliably * shortcut doppelganger check when feature is not enabled * in REST/JSON-RPC, fetch blocks without involving `BlockRef` * fix dag.blocks ref	2022-01-21 13:33:16 +02:00
Zahary Karadjov	8a7cdc61f6	[ncli db] Add a requirements file for the Jupyter notebook	2022-01-18 20:24:20 +02:00
tersec	2f635d3337	rename *_{MERGE => BELLATRIX} constant names (#3296 )	2022-01-18 16:31:05 +00:00
tersec	9c0c9c98ce	complete switch to beacon_chain/specs/datatypes/bellatrix (#3295 )	2022-01-18 13:36:52 +00:00
Zahary Karadjov	47f1f7ff1a	More efficient reward data persistance; Address review comments The new format is based on compressed CSV files in two channels: * Detailed per-epoch data * Aggregated "daily" summaries The use of append-only CSV file speeds up significantly the epoch processing speed during data generation. The use of compression results in smaller storage requirements overall. The use of the aggregated files has a very minor cost in both CPU and storage, but leads to near interactive speed for report generation. Other changes: - Implemented support for graceful shut downs to avoid corrupting the saved files. - Fixed a memory leak caused by lacking `StateCache` clean up on each iteration. - Addressed review comments - Moved the rewards and penalties calculation code in a separate module Required invasive changes to existing modules: - The `data` field of the `KeyedBlockRef` type is made public to be used by the validator rewards monitor's Chain DAG update procedure. - The `getForkedBlock` procedure from the `blockchain_dag.nim` module is made public to be used by the validator rewards monitor's Chain DAG update procedure.	2022-01-18 01:56:56 +02:00
Zahary Karadjov	29aad0241b	Precise per-component ETH-denominated rewards tracking This is an alternative take on https://github.com/status-im/nimbus-eth2/pull/3107 that aims for more minimal interventions in the spec modules at the expense of duplicating more of the spec logic in ncli_db.	2022-01-18 01:56:56 +02:00
Jacek Sieka	836f6984bb	move `state_transition` to `Result` (#3284 ) * better error messages in api * avoid `BlockData` copies when replaying blocks	2022-01-17 12:19:58 +01:00
Jacek Sieka	805e85e1ff	time: spring cleaning (#3262 ) Time in the beacon chain is expressed relative to the genesis time - this PR creates a `beacon_time` module that collects helpers and utilities for dealing the time units - the new module does not deal with actual wall time (that's remains in `beacon_clock`). Collecting the time related stuff in one place makes it easier to find, avoids some circular imports and allows more easily identifying the code actually needs wall time to operate. * move genesis-time-related functionality into `spec/beacon_time` * avoid using `chronos.Duration` for time differences - it does not support negative values (such as when something happens earlier than it should) * saturate conversions between `FAR_FUTURE_XXX`, so as to avoid overflows * fix delay reporting in validator client so it uses the expected deadline of the slot, not "closest wall slot" * simplify looping over the slots of an epoch * `compute_start_slot_at_epoch` -> `start_slot` * `compute_epoch_at_slot` -> `epoch` A follow-up PR will (likely) introduce saturating arithmetic for the time units - this is merely code moves, renames and fixing of small bugs.	2022-01-11 11:01:54 +01:00
Jacek Sieka	20e700fae4	Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex (#3259 ) * Harden CommitteeIndex, SubnetId, SyncSubcommitteeIndex Harden the use of `CommitteeIndex` et al to prevent future issues by using a distinct type, then validating before use in several cases - datatypes in spec are kept simple though so that invalid data still can be read. * fix invalid epoch used in REST `/eth/v1/beacon/states/{state_id}/committees` committee length (could return invalid data) * normalize some variable names * normalize committee index loops * fix `RestAttesterDuty` to use `uint64` for `validator_committee_index` * validate `CommitteeIndex` on ingress in REST API * update rest rules with stricter parsing * better REST serializers * save lots of memory by not using `zip` ...at least a few bytes!	2022-01-09 01:28:49 +02:00
Jacek Sieka	ba99c8fe4f	update era file documentation / impl (#3226 ) Overhaul of era files, including documentation and reference implementations * store blocks, then state, then slot indices for easy lookup at low cost * document era file rationale * altair+ support in era writer	2022-01-07 11:13:19 +01:00
Jacek Sieka	0e2b4e39fa	REST JSON support improvements (#3232 ) * support downloading blocks / states via JSON in addition to SSZ - slow, but needed for infura support - SSZ is still used when server supports it * use common forked block/state reader in REST API * fix stack overflows in REST JSON decoder * fix invalid serialization of `justification_bits` in `/eth/v1/debug/beacon/states` and `/eth/v2/debug/beacon/states` * fix REST client to use `/eth/...` instead of `/api/eth/...`, update "default" urls to expose REST api via `/eth` as well as this is what the standard says - `/api` was added early on based on an example "base url" in the spec that has been removed since * expose Nimbus REST extensions via `/nimbus` in addition to `/api/nimbus` to stay consistent with `/eth` * fix invalid state root when reading states via REST * fix recursive imports in `spec/ssz_codec` * remove usages of `serialization.useCustomSerialization` - fickle	2022-01-06 08:38:40 +01:00
Jacek Sieka	0a4728a241	Handle access to historical data for which there is no state (#3217 ) With checkpoint sync in particular, and state pruning in the future, loading states or state-dependent data may fail. This PR adjusts the code to allow this to be handled gracefully. In particular, the new availability assumption is that states are always available for the finalized checkpoint and newer, but may fail for anything older. The `tail` remains the point where state loading de-facto fails, meaning that between the tail and the finalized checkpoint, we can still get historical data (but code should be prepared to handle this as an error). However, to harden the code against long replays, several operations which are assumed to work only with non-final data (such as gossip verification and validator duties) now limit their search horizon to post-finalized data. * harden several state-dependent operations by logging an error instead of introducing a panic when state loading fails * `withState` -> `withUpdatedState` to differentiate from the other `withState` * `updateStateData` can now fail if no state is found in database - it is also hardened against excessively long replays * `getEpochRef` can now fail when replay fails * reject blocks with invalid target root - they would be ignored previously * fix recursion bug in `isProposed`	2022-01-05 19:38:04 +01:00
tersec	b81c06edab	rename Beacon{Block,State}Fork.Merge to Bellatrix; update copyright years (#3240 )	2022-01-04 09:45:38 +00:00
Jacek Sieka	1021e3324e	Revert writing backfill root to database (#3215 ) Introduced in #3171, it turns out we can just follow the block headers to achieve the same effect * leaves the constant in the code so as to avoid confusion when reading database that had the constant written (such as the fleet nodes and other unstable users)	2021-12-21 11:40:14 +01:00
Jacek Sieka	c270ec21e4	Validator monitoring (#2925 ) Validator monitoring based on and mostly compatible with the implementation in Lighthouse - tracks additional logs and metrics for specified validators so as to stay on top on performance. The implementation works more or less the following way: * Validator pubkeys are singled out for monitoring - these can be running on the node or not * For every action that the validator takes, we record steps in the process such as messages being seen on the network or published in the API * When the dust settles at the end of an epoch, we report the information from one epoch before that, which coincides with the balances being updated - this is a tradeoff between being correct (waiting for finalization) and providing relevant information in a timely manner)	2021-12-20 20:20:31 +01:00
Jacek Sieka	a223d62b07	Cleanups (#3123 ) Renames and cleanups split out from the validator monitoring branch, so as to reduce conflict area vs other PR:s * add constants for expected message timing * name validators after the messages they validate, mostly, to make grepping easier * unify field naming of EpochInfo across forks to make cross-fork code easier	2021-11-25 13:20:36 +01:00
Jacek Sieka	f19a497eec	ncli_db: add putState, putBlock (#3096 ) * ncli_db: add putState, putBlock These tools allow modifying an existing nimbus database for the purpose of recovery or reorg, moving the head, tail and genesis to arbitrary points. * remove potentially expensive `putState` in `BeaconStateDB` * introduce `latest_block_root` which computes the root of the latest applied block from the `latest_block_header` field (instead of passing it in separately) * avoid some unnecessary BeaconState copies during init * discover https://github.com/nim-lang/Nim/issues/19094 * prefer `HashedBeaconState` in a few places to avoid recomputing state root * fetch latest block root from state when creating blocks * harden `get_beacon_proposer_index` against invalid slots and document * move random spec function tests to `test_spec.nim` * avoid unnecessary state root computation before block proposal	2021-11-18 13:02:43 +01:00
Jacek Sieka	ec650c7fd7	Support starting from altair (#3054 ) * Support starting from altair * hide `finalized-checkpoint-` - they are incomplete and usage may cause crashes * remove genesis detection code (broken, obsolete) * enable starting ChainDAG from altair checkpoints - this is a prerequisite for checkpoint sync (TODO: backfill) * tighten checkpoint state conditions * show error when starting from checkpoint with existing database (not supported) * print rest-compatible JSON in ncli/state_sim * altair/merge support in ncli * more altair/merge support in ncli_db * pre-load header to speed up loading * fix forked block decoding	2021-11-10 13:39:08 +02:00
Jacek Sieka	a086cf01ac	altair fork handling cleanups (#3050 ) * fix stack overflow crash in REST/debug/getStateV2 * introduce `ForkyXxx` for generic type matching of `Xxx` across branches (SomeHashedBeaconState -> ForkyHashedBeaconState et al) - `Some` is already used for other types of type classes * consolidate function naming in BeaconChainDB, use some generics * import `forks.nim` from other spec modules and move `Forked` helpers around to resolve circular imports remove `ForkedBeaconState`, use `ForkedHashedBeaconState` throughout (less data shuffling between the types) * fix several cases of states being stored on stack in tests, causing random failures on some platforms * remove reading json support from ncli - this should be ported to the rest json reading instead (doesn't currently work because stack sizes)	2021-11-05 08:34:34 +01:00
Jacek Sieka	c40cc6cec1	clean up fork enum and field names * single naming strategy * simplify some fork code * simplify forked block production	2021-10-19 11:06:38 +03:00
Jacek Sieka	f90b2b8b1f	reward accounting for altair+ (#2981 ) Similar to the existing `RewardInfo`, this PR adds the infrastructure needed to export epoch processing information from altair+. Because accounting is done somewhat differently, the PR uses a fork-specific object to extrct the information in order to make the cost on the spec side low. * RewardInfo -> EpochInfo, ForkedEpochInfo * use array for computing new sync committee * avoid repeated total active balance computations in block processing * simplify proposer index check * simplify epoch transition tests * pre-compute base increment and reuse in epoch processing, and a few other small optimizations This PR introduces the type and does the heavy lifting in terms of refactoring - the tools that use the accounting will need separate PR:s (as well as refinements to the exportred information)	2021-10-13 16:24:36 +02:00
Jacek Sieka	850bec6ae1	speed up `validatorDb`/`validatorPerf` (#2885 ) avoid needless hashing and use bigger txn:s	2021-09-30 15:21:06 +02:00
tersec	43a976f89b	proc -> func in ncli/, research/, and test/ (#2818 )	2021-08-25 14:51:52 +00:00
Jacek Sieka	a7a65bce42	disentangle eth2 types from the ssz library (#2785 ) * reorganize ssz dependencies This PR continues the work in https://github.com/status-im/nimbus-eth2/pull/2646, https://github.com/status-im/nimbus-eth2/pull/2779 as well as past issues with serialization and type, to disentangle SSZ from eth2 and at the same time simplify imports and exports with a structured approach. The principal idea here is that when a library wants to introduce SSZ support, they do so via 3 files: * `ssz_codecs` which imports and reexports `codecs` - this covers the basic byte conversions and ensures no overloads get lost * `xxx_merkleization` imports and exports `merkleization` to specialize and get access to `hash_tree_root` and friends * `xxx_ssz_serialization` imports and exports `ssz_serialization` to specialize ssz for a specific library Those that need to interact with SSZ always import the `xxx_` versions of the modules and never `ssz` itself so as to keep imports simple and safe. This is similar to how the REST / JSON-RPC serializers are structured in that someone wanting to serialize spec types to REST-JSON will import `eth2_rest_serialization` and nothing else. * split up ssz into a core library that is independendent of eth2 types * rename `bytes_reader` to `codec` to highlight that it contains coding and decoding of bytes and native ssz types * remove tricky List init overload that causes compile issues * get rid of top-level ssz import * reenable merkleization tests * move some "standard" json serializers to spec * remove `ValidatorIndex` serialization for now * remove test_ssz_merkleization * add tests for over/underlong byte sequences * fix broken seq[byte] test - seq[byte] is not an SSZ type There are a few things this PR doesn't solve: * like #2646 this PR is weak on how to handle root and other dontSerialize fields that "sometimes" should be computed - the same problem appears in REST / JSON-RPC etc * Fix a build problem on macOS * Another way to fix the macOS builds Co-authored-by: Zahary Karadjov <zahary@gmail.com>	2021-08-18 20:57:58 +02:00
tersec	89d9aa9240	sync committee topic names; borrow -> template/distinctbase (#2781 )	2021-08-15 16:50:31 +02:00
Jacek Sieka	7a622e8505	rework spec imports (#2779 ) The spec imports are a mess to work with, so this branch cleans them up a bit to ensure that we avoid generic sandwitches and that importing stuff generally becomes easier. * reexport crypto/digest/presets because these are part of the public symbol set of the rest of the spec types * don't export `merge` types from `base` - this causes circular deps * fix circular deps in `ssz/spec_types` - this is the first step in disentangling ssz from spec * be explicit about phase0 vs altair - longer term, `altair` will become the "natural" type set, then merge and so on, so no point in giving `phase0` special preferential treatment	2021-08-12 13:08:20 +00:00
Jacek Sieka	9697b73e71	forkedbeaconstate_helpers -> forks (#2772 ) Simpler module name for stuff that covers forks * check that runtime config matches database state * also include some assorted altair cleanups * use "standard" genesis fork in local testnet to work around missing runtime config support	2021-08-10 22:46:35 +02:00
Jacek Sieka	3f9c1fdf4e	More RuntimeConfig cleanup (#2716 ) * remove from BeaconChainDB (doesn't depend on runtime config) * eth2-testnets -> eth2-networks * use `cfg` name throughout	2021-07-13 16:27:10 +02:00
Jacek Sieka	23eea197f6	Implement split preset/config support (#2710 ) * Implement split preset/config support This is the initial bulk refactor to introduce runtime config values in a number of places, somewhat replacing the existing mechanism of loading network metadata. It still needs more work, this is the initial refactor that introduces runtime configuration in some of the places that need it. The PR changes the way presets and constants work, to match the spec. In particular, a "preset" now refers to the compile-time configuration while a "cfg" or "RuntimeConfig" is the dynamic part. A single binary can support either mainnet or minimal, but not both. Support for other presets has been removed completely (can be readded, in case there's need). There's a number of outstanding tasks: * `SECONDS_PER_SLOT` still needs fixing * loading custom runtime configs needs redoing * checking constants against YAML file * yeerongpilly support `build/nimbus_beacon_node --network=yeerongpilly --discv5:no --log-level=DEBUG` * load fork epoch from config * fix fork digest sent in status * nicer error string for request failures * fix tools * one more * fixup * fixup * fixup * use "standard" network definition folder in local testnet Files are loaded from their standard locations, including genesis etc, to conform to the format used in the `eth2-networks` repo. * fix launch scripts, allow unknown config values * fix base config of rest test * cleanups * bundle mainnet config using common loader * fix spec links and names * only include supported preset in binary * drop yeerongpilly, add altair-devnet-0, support boot_enr.yaml	2021-07-12 15:01:38 +02:00
tersec	445def6c8b	block_clearance, ncli, and ncli_db Altair state saving (#2672 ) * block_clearance, ncli, and ncli_db Altair state saving * avoid invalidating SSZ hash caches with every assignment	2021-06-24 18:34:08 +00:00
tersec	146fa48454	use ForkedHashedBeaconState in StateData (#2634 ) * use ForkedHashedBeaconState in StateData * fix FAR_FUTURE_EPOCH -> slot overflow; almost always use assign() * avoid stack allocation in maybeUpgradeStateToAltair() * create and use dispatch functions for check_attester_slashing(), check_proposer_slashing(), and check_voluntary_exit() * use getStateRoot() instead of various state.data.hbsPhase0.root * remove withStateVars.hashedState(), which doesn't work as a design anymore * introduce spec/datatypes/altair into beacon_chain_db * fix inefficient codegen for getStateField(largeStateField) * state_transition_slots() doesn't either need/use blocks or runtime presets * combine process_slots(HBS)/state_transition_slots(HBS) which differ only in last-slot htr optimization * getStateField(StateData, ...) was replaced by getStateField(ForkedHashedBeaconState, ...) * fix rollback * switch some state_transition(), process_slots, makeTestBlocks(), etc to use ForkedHashedBeaconState * remove state_transition(phase0.HashedBeaconState) * remove process_slots(phase0.HashedBeaconState) * remove state_transition_block(phase0.HashedBeaconState) * remove unused callWithBS(); separate case expression from if statement * switch back from nested-ref-object construction to (ref Foo)(Bar())	2021-06-11 20:51:46 +03:00
tersec	8ebd496fbe	Altair transition tests (#2624 ) * Working Altair transition tests * with fixed upstream test vectors, remove state root workaround * switch upgrade_to_altair() to returning a reference * remove test_state_transition * fix invalid fork state/block combinations error messages * avoid memory copies by reintroducing state_transition_slots(var SomeHashedBeaconState)	2021-06-04 10:38:00 +00:00
tersec	28a5bca71a	split state_transition() into slots/block parts and use only block where appropriate (#2630 )	2021-06-03 11:42:25 +02:00
Jacek Sieka	d16da06c92	ncli_db: validator performance database tool Record attestation performance per epoch in sqlite database	2021-05-27 19:14:26 +03:00
Jacek Sieka	584fcd50c1	ncli: fix inclusion distance statistic (#2587 )	2021-05-24 10:40:45 +02:00
tersec	0b0bfd1de0	use StateData in place of BeaconState outside state transition code (#2551 ) * use StateData in place of BeaconState outside state transition code * propagate more StateData usage * remove withStateVars().state * wrap get_beacon_committee(BeaconState, ...) as gbc(StateData, ...) * switch makeAttestation() to use StateData * use StateData wrapper/dispatcher for get_committee_count_per_slot() * convert AttestationCache.init(), weak subjectivity functions, and updateValidatorMetrics() * add get_shuffled_active_validator_indices(StateData) and get_block_root_at_slot(StateData) * switch makeAttestationData() to StateData * sync AllTests-mainnet.md after rebase	2021-05-21 09:23:28 +00:00
Jacek Sieka	97f4e1fffe	Db1 cont (#2573 ) * Revert "Revert "Upgrade database schema" (#2570)" This reverts commit `6057c2ffb4`. * ssz: fix loading empty lists into existing instances Not a problem earlier because we didn't reuse instances * bump nim-eth * bump nim-web3	2021-05-17 18:37:26 +02:00
tersec	6057c2ffb4	Revert "Upgrade database schema" (#2570 ) This reverts commit `22ddf74752`.	2021-05-17 06:34:44 +00:00
Jacek Sieka	22ddf74752	Upgrade database schema The `kvstore` design we're using now turns out to not be the best way to use `sqlite` - in particular, there are some significant benefits to using rowid in certain situations and to keep data in separate tables. With this branch, there are massive improvements in startup time (seconds instead of minutes) and state/block storage and pruning times (milliseconds instead of seconds) - these improvements can in particular be seen on slow drives and translate directly into better attestation performance. * update kvstore to new keyspace design * remove `DirStoreRef` and the hidden `--state-db-kind` option - this was an experiment to store large blobs in files, but with the new kvstore, there's no compelling reason to do so * remove `DbMap` - unused and would need updating for new keyspace design * introduce separate tables for each data type (blocks, states etc) * remove "WITHOUT ROWID" pessimization for tables with large blobs * close DbSeq statements explicitly (and earlier) * store beacon block summaries in separate table, without SSZ compression and load them all with single query on startup * stop storing backwards compat full states * mark genesis beacon block as trusted * avoid faststreams when loading SSZ data * remove `DisagreementBehavior` (unused)	2021-05-14 20:05:23 +03:00
tersec	39da640beb	use getStateField() in ncli_db (#2549 )	2021-05-07 13:14:20 +00:00
Jacek Sieka	646923c3dd	add attestation stats tool to ncli_db (#2539 ) This also makes future efforts to provide metrics and logs for attestation efficiency easier * Export rewards from epoch transition * Use less memory for reward calculation (bool -> set[enum], field alignment) * Reuse reward memory when replaying, avoiding spike * Allow replaying any range in ncli_db benchmark	2021-05-07 13:36:21 +02:00
Jacek Sieka	beceb060c4	Write state diffs to separate table (and experimentally, files instead of db) (#2460 )	2021-04-06 21:56:45 +03:00
Jacek Sieka	3cb31e66b4	set upper bound on EpochRef cache (#2403 ) * set upper bound on EpochRef cache * max 32 EpochRef instances * less memory waste in BlockRef by removing EpochRef seq that is mostly unused (~20mb) * less memory waste in dag block lookup by not keeping an extra copy of digest (~70mb) * fix `==` and `$` for Eth2Digest * remove `ChainDAG.tmpState` (~50mb?) all in all, this branch cuts mainnet memory usage by ~160-180mb and puts limits on EpochRef cache usage - where normally it hovered around 950mb before, it's now sitting at 600-700mb on my machine. * docs	2021-03-17 11:17:15 +01:00
tersec	8def2486b0	immutable validator database factoring (#2297 ) * initial immutable validator database factoring * remove changes from chain_dag: this abstraction properly belongs in beacon_chain_db * add merging mutable/immutable validator portions; individually test database roundtripping of immutable validators and states-sans-immutable-validators * update test summaries * use stew/assign2 instead of Nim assignment * add reading/writing of immutable validators in chaindag * remove unused import * replace chunked k/v store of immutable validators with per-row SQL table storage * use List instead of HashList * un-stub some ncli_db code so that it uses * switch HashArray to array; move BeaconStateNoImmutableValidators from datatypes to beacon_chain_db * begin only-mutable-part state storage * uncomment some assigns * work around https://github.com/nim-lang/Nim/issues/17253 * fix most of the issues/oversights; local sim runs again * fix test suite by adding missing beaconstate field to copy function * have ncli bench also store immutable validators * extract some immutable-validator-specific code from the beacon chain db module * add more rigorous database state roundtripping, with changing validator sets * adjust ncli_db to use new schema * simplify putState/getState by moving all immutable validator accounting into beacon state DB * remove redundant test case and move code to immutable-beacon-chain module * more efficient, but still brute-force, mutable+immutable validator merging * reuse BeaconState in getState * ensure HashList/HashArray caches are cleared when reusing getState buffers; add ncli_db and a unit test to verify this * HashList.clear() -> HashList.clearCache() * only copy incrementally necessary immutable validators * increase strictness of test cases and fix/work around resulting HashList cache invalidation issues * remove explanatory scaffolding * allow for storage of full (with all validators) states for backwards/forwards-compatibility * adjust DbSeq type usage * store full, with-validators, state every 64 epochs to enable reverting versions * reduce memory allocation and intermediate objects in state storage codepath * eliminate allocation/copying through intermediate BeaconStateNoImmutableValidators objects * skip benchmarking initial genesis-validator-heavy state store * always store new-style state and sometimes old-style state * document intent behind BeaconState/Validator type-punnery * more accurate failure message on SQLite in-memory database initialization failure	2021-03-15 14:11:51 +00:00
Jacek Sieka	aabdd34704	e2store: add era format (#2382 ) Era files contain 8192 blocks and a state corresponding to the length of the array holding block roots in the state, meaning that each block is verifiable using the pubkeys and block roots from the state. Of course, one would need to know the root of the state as well, which is available in the first block of the _next_ file - or known from outside. This PR also adds an implementation to write e2s, e2i and era files, as well as a python script to inspect them. All in all, the format is very similar to what goes on in the network requests meaning it can trivially serve as a backing format for serving said requests. Mainnet, up to the first 671k slots, take up 3.5gb - in each era file, the BeaconState contributes about 9mb at current validator set sizes, up from ~3mb in the early blocks, for a grand total of ~558mb for the 82 eras tested - this overhead could potentially be calculated but one would lose the ability to verify individual blocks (eras could still be verified using historical roots). ``` -rw-rw-r--. 1 arnetheduck arnetheduck 16 5 mar 11.47 ethereum2-mainnet-00000000-00000001.e2i -rw-rw-r--. 1 arnetheduck arnetheduck 1,8M 5 mar 11.47 ethereum2-mainnet-00000000-00000001.e2s -rw-rw-r--. 1 arnetheduck arnetheduck 65K 5 mar 11.47 ethereum2-mainnet-00000001-00000001.e2i -rw-rw-r--. 1 arnetheduck arnetheduck 18M 5 mar 11.47 ethereum2-mainnet-00000001-00000001.e2s ... -rw-rw-r--. 1 arnetheduck arnetheduck 65K 5 mar 11.52 ethereum2-mainnet-00000051-00000001.e2i -rw-rw-r--. 1 arnetheduck arnetheduck 68M 5 mar 11.52 ethereum2-mainnet-00000051-00000001.e2s -rw-rw-r--. 1 arnetheduck arnetheduck 61K 5 mar 11.11 ethereum2-mainnet-00000052-00000001.e2i -rw-rw-r--. 1 arnetheduck arnetheduck 62M 5 mar 11.11 ethereum2-mainnet-00000052-00000001.e2s ```	2021-03-15 11:31:39 +01:00
Dustin Brody	97504fdb9d	ncli_db pruneDatabase checkpointing; remove onSlotEnd lookaheadTime	2021-03-12 23:15:46 +02:00
tersec	6533999c82	benchmark state loading in ncli_db (#2400 )	2021-03-12 10:02:09 +00:00
Mamy Ratsimbazafy	d47f53cd9d	Reorg (5/5) (#2377 ) * Reorg things left into networking and gossip_processing * time -> beacon_clock * fix builds	2021-03-05 14:12:00 +01:00
Mamy Ratsimbazafy	5d7f9c3a04	Consensus object pools [reorg 4/5] (#2374 ) * Add documentation * make test doesn't try to build the beacon node :/	2021-03-04 10:13:44 +01:00
Mamy Ratsimbazafy	2f17ac7b64	Move SSZ, deposit_contracts & eth1_monitor [reorg files 3/5] (#2371 ) * move deposit_contract * Move SSZ * fix ssz import in tests * move also eth1_monitor * forgot to delete the original * fix comma [skip ci] * Fix "make" & tools imports * Fix import * Fix import again * rename deposit_contract -> eth1 * Revert ssz move to subfolder * path fixes [skip ci]	2021-03-03 07:23:05 +01:00
tersec	5cab17dc1a	database state storage benchmarking via ncli_db (#2312 ) * database state storage benchmarking via ncli_db * more cleanups from immutable validator state branch * unexport some eth2_network constants and remove unused variables/templates * make two PeerScore constants public	2021-02-15 17:40:00 +01:00
Jacek Sieka	5713a3ce4c	performance fixes (#2259 ) * performance fixes * don't mark tree cache as dirty on read-only List accesses * store only blob in memory for keys and signatures, parse blob lazily * compare public keys by blob instead of parsing / converting to raw * compare Eth2Digest using non-constant-time comparison * avoid some unnecessary validator copying This branch will in particular speed up deposit processing which has been slowing down block replay. Pre (mainnet, 1600 blocks): ``` All time are ms Average, StdDev, Min, Max, Samples, Test Validation is turned off meaning that no BLS operations are performed 3450.269, 0.000, 3450.269, 3450.269, 1, Initialize DB 0.417, 0.822, 0.036, 21.098, 1400, Load block from database 16.521, 0.000, 16.521, 16.521, 1, Load state from database 27.906, 50.846, 8.104, 1507.633, 1350, Apply block 52.617, 37.029, 20.640, 135.938, 50, Apply epoch block ``` Post: ``` 3502.715, 0.000, 3502.715, 3502.715, 1, Initialize DB 0.080, 0.560, 0.035, 21.015, 1400, Load block from database 17.595, 0.000, 17.595, 17.595, 1, Load state from database 15.706, 11.028, 8.300, 107.537, 1350, Apply block 33.217, 12.622, 17.331, 60.580, 50, Apply epoch block ``` * more perf fixes * load EpochRef cache into StateCache more aggressively * point out security concern with public key cache * reuse proposer index from state when processing block * avoid genericAssign in a few more places * don't parse key when signature is unparseable * fix `==` overload for Eth2Digest * preallocate validator list when getting active validators * speed up proposer index calculation a little bit * reuse cache when replaying blocks in ncli_db * avoid a few more copying loops ``` Average, StdDev, Min, Max, Samples, Test Validation is turned off meaning that no BLS operations are performed 3279.158, 0.000, 3279.158, 3279.158, 1, Initialize DB 0.072, 0.357, 0.035, 13.400, 1400, Load block from database 17.295, 0.000, 17.295, 17.295, 1, Load state from database 5.918, 9.896, 0.198, 98.028, 1350, Apply block 15.888, 10.951, 7.902, 39.535, 50, Apply epoch block 0.000, 0.000, 0.000, 0.000, 0, Database block store ``` * clear full balance cache before processing rewards and penalties ``` All time are ms Average, StdDev, Min, Max, Samples, Test Validation is turned off meaning that no BLS operations are performed 3947.901, 0.000, 3947.901, 3947.901, 1, Initialize DB 0.124, 0.506, 0.026, 202.370, 363345, Load block from database 97.614, 0.000, 97.614, 97.614, 1, Load state from database 0.186, 0.188, 0.012, 99.561, 357262, Advance slot, non-epoch 14.161, 5.966, 1.099, 395.511, 11524, Advance slot, epoch 1.372, 4.170, 0.017, 276.401, 363345, Apply block, no slot processing 0.000, 0.000, 0.000, 0.000, 0, Database block store ```	2021-01-25 13:04:18 +01:00
tersec	027e272007	fix tools/all build (#2010 )	2020-11-13 13:57:48 +01:00
Zahary Karadjov	5f6bdc6709	Store all deposit-derived data in memory	2020-10-15 20:15:51 +03:00
Zahary Karadjov	e6320e5881	Address #1584 Don't keep all deposits in memory (persist them to disk)	2020-10-15 20:15:51 +03:00
tersec	a1a8da82b4	replace assertion with warning message (#1776 )	2020-09-29 17:15:49 +02:00
tersec	aca1a318f2	cleanly close kvstore databases and bump nim-eth (#1630 ) * cleanly close kvstore databases * close databases for all subcommands and during error conditions	2020-09-12 05:35:58 +00:00
tersec	48893f1c2e	add ncli_db subcommand to prune database of unnecessary blocks and states (#1593 ) * add ncli_db subcommand to prune database of unnecessary blocks, states, and state roots * tweak comments * reduce default aggressiveness in pruning old states * move copyPrunedDatabase() to ncli_db, as it's not generally useful as part of beacon_chain_db and doesn't use any internal interfaces	2020-09-11 15:20:34 +02:00
Viktor Kirilov	67d73c4c60	added the --network=<x> option to the tools for which it matters	2020-09-01 12:02:22 +03:00
tersec	b8da265f89	add setting for benchmarking and profiling of sqlite block storage times (#1575 )	2020-08-27 14:52:22 +02:00
Viktor Kirilov	0a96e5f564	renamed CandidateChains to ChainDagRef and made the Quarantine type a ref type so there is a single instance in the beacon node (#1407 )	2020-07-31 14:49:06 +00:00
Viktor Kirilov	c032366547	removed the BlockPool type and all of the proxy functions around it (#1401 ) * removed the BlockPool type and all of the proxy functions around it - passing the chain DAG and the quarantine explicitly where appropriately - they don't need to be bundled in a type * fixed the build after the rebase	2020-07-30 21:18:17 +02:00
Jacek Sieka	157ddd2ac4	Fork choice fixes 5 (#1381 ) * limit attestations kept in attestation pool With fork choice updated, the attestation pool only needs to keep track of attestations that will eventually end up in blocks - we can thus limit the horizon of attestations that we keep more aggressively. To get here, we expose getEpochRef which gets metadata about a particular epochref, and make sure to populate it when a block is added - this ensures that state rewinds during block addition are minimized. In addition, we'll use the target root/epoch when validating attestations - this helps minimize the number of different states that we need to rewind to, in general. * remove CandidateChains.justifiedState unused * remove BlockPools.Head object * avoid quadratic quarantine loop * fix	2020-07-28 13:54:32 +00:00
Jacek Sieka	fb2f742972	Fork choice fixes 2 (#1356 ) * fork choice cleanup * enable v2 pruning * prefer `get_current_epoch` * fix finalization check to use correct epoch * small cleanups * add `count_active_validators` * remove misleading logs * fix justified checkpoint slot calculation in rpc	2020-07-22 23:01:44 +02:00
tersec	6b77f3dda5	update compute_subnet_for_attestation() to use https://github.com/ethereum/eth2.0-specs/pull/1876 signature, which isn't in v0.12.1, which works with lookahead (#1346 )	2020-07-22 08:04:21 +00:00
Jacek Sieka	8b01284b0e	cache block hash (#1329 ) hash_tree_root was turning up when running beacon_node, turns out to be repeated hash_tree_root invocations - this pr brings them back down to normal. this PR caches the root of a block in the SignedBeaconBlock object - this has the potential downside that even invalid blocks will be hashed (as part of deserialization) - later, one could imagine delaying this until checks have passed there's also some cleanup of the `cat=` logs which were applied randomly and haphazardly, and to a large degree are duplicated by other information in the log statements - in particular, topics fulfill the same role	2020-07-16 15:16:51 +02:00
Zahary Karadjov	c4af4e2f35	Working test suite with run-time presets	2020-07-08 02:02:14 +03:00
Jacek Sieka	1301600341	Trusted blocks (#1227 ) * cleanups * fix ncli state root check flag * add block dump to ncli_db * limit ncli_db benchmark length * tone down finalization logs * introduce trusted blocks We only store blocks whose signature we've verified in the database - as such, there's no need to check it again, and most importantly, no need to deserialize the signature when loading from database. 50x startup time improvement, 200x block load time improvement. * fix rewinding when deposits have invalid signature * speed up ancestor iteration by avoiding copy * avoid deserializing signatures for trusted data * load blocks lazily when rewinding (less memory used) * chronicles workarounds * document trustedbeaconblock	2020-06-25 12:23:10 +02:00
tersec	807b920c19	state_transition implements the spec fairly directly (#1220 )	2020-06-23 13:54:24 +00:00
Jacek Sieka	49e9167b28	clean up dump feature * don't write blocks that get added to database * don't write states * write to folders * add state dumping feature to `ncli_db` to get any known state from the database	2020-06-16 13:44:37 +00:00
Jacek Sieka	ea8f96d284	ncli_db: allow saving states by root (#1136 ) also dump state+block when validation fails	2020-06-06 13:26:19 +02:00
Jacek Sieka	56ffb696be	reorder ssz (#1099 ) * reorder ssz * split into hash_trees and ssz_serialization, roughly, for hashing and IO * move bitseqs into ssz (from stew) * clean up imports * docs, imports	2020-06-03 15:52:02 +02:00
Jacek Sieka	1fd2cfef62	ncli_db: add validation flag, better ux	2020-06-01 17:53:41 +00:00
Jacek Sieka	061be037d1	ncli_db: database tool includes a benchmark tool for now	2020-05-28 17:43:02 +00:00

1 2 3

134 Commits