nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
tersec	2240594ed8	beacon_chain_db: proc -> func (#3931 )	2022-08-01 16:17:06 +00:00
Miran	dfd4afc9f2	compatibility with Nim 1.4+ (#3888 )	2022-07-29 10:53:42 +00:00
Etan Kissling	aff53e962f	merge LC db into main BN db (#3832 ) * merge LC db into main BN db To treat derived LC data similar to derived state caches, merge it into the main beacon node DB. * shorten table names, group with lc prefix	2022-07-04 23:46:32 +03:00
Jacek Sieka	138c40161d	avoid unnecessary recompression in block protocol (#3598 ) Blocks can be sent straight from compressed data sources Co-authored-by: Etan Kissling <etan@status.im>	2022-05-05 11:00:02 +00:00
Jacek Sieka	d0dbc4a8f9	Snappy revamp (#3564 ) This PR makes the necessary adjustments to deal with the revamped snappy API. In practical terms for nimbus-eth2, there are performance increases to gossip processing, database reading and writing as well as era file processing. Exporting `.era` files for example, a snappy-heavy operation, almost halves in total processing time: Pre: ``` Average, StdDev, Min, Max, Samples, Test 39.088, 8.735, 23.619, 53.301, 50, tState 237.079, 46.692, 165.620, 355.481, 49, tBlocks ``` Post: ``` All time are ms Average, StdDev, Min, Max, Samples, Test 25.350, 5.303, 15.351, 41.856, 50, tState 141.238, 24.164, 99.990, 199.329, 49, tBlocks ```	2022-04-15 09:44:06 +02:00
Jacek Sieka	f70ff38b53	enable `styleCheck:usages` (#3573 ) Some upstream repos still need fixes, but this gets us close enough that style hints can be enabled by default. In general, "canonical" spellings are preferred even if they violate nep-1 - this applies in particular to spec-related stuff like `genesis_validators_root` which appears throughout the codebase.	2022-04-08 16:22:49 +00:00
Jacek Sieka	5092fc41c7	use snappy-framed format for compressing bellatrix+ database entries (#3551 ) `.era` files and Req/Resp protocols use framed formats - aligning the database with these makes for less recompression work overall as gossip is sent only once while req/resp repeats (potentially) - this also allows efficient pruning-to-era where snappy-recompression is the major cycle thief.	2022-03-29 11:33:06 +00:00
Jacek Sieka	6983dacc26	fix bellatrix table names (#3544 ) this should/will cause existing nimbus databases to revert to the altair merge and resync with the new table name	2022-03-24 14:36:31 +01:00
Jacek Sieka	4207b127f9	era: load blocks and states (#3394 ) * era: load blocks and states Era files contain finalized history and can be thought of as an alternative source for block and state data that allows clients to avoid syncing this information from the P2P network - the P2P network is then used to "top up" the client with the most recent data. They can be freely shared in the community via whatever means (http, torrent, etc) and serve as a permanent cold store of consensus data (and, after the merge, execution data) for history buffs and bean counters alike. This PR gently introduces support for loading blocks and states in two cases: block requests from rest/p2p and frontfilling when doing checkpoint sync. The era files are used as a secondary source if the information is not found in the database - compared to the database, there are a few key differences: * the database stores the block indexed by block root while the era file indexes by slot - the former is used only in rest, while the latter is used both by p2p and rest. * when loading blocks from era files, the root is no longer trivially available - if it is needed, it must either be computed (slow) or cached (messy) - the good news is that for p2p requests, it is not needed * in era files, "framed" snappy encoding is used while in the database we store unframed snappy - for p2p2 requests, the latter requires recompression while the former could avoid it * front-filling is the process of using era files to replace backfilling - in theory this front-filling could happen from any block and front-fills with gaps could also be entertained, but our backfilling algorithm cannot take advantage of this because there's no (simple) way to tell it to "skip" a range. * front-filling, as implemented, is a bit slow (10s to load mainnet): we load the full BeaconState for every era to grab the roots of the blocks - it would be better to partially load the state - as such, it would also be good to be able to partially decompress snappy blobs * lookups from REST via root are served by first looking up a block summary in the database, then using the slot to load the block data from the era file - however, there needs to be an option to create the summary table from era files to fully support historical queries To test this, `ncli_db` has an era file exporter: the files it creates should be placed in an `era` folder next to `db` in the data directory. What's interesting in particular about this setup is that `db` remains as the source of truth for security purposes - it stores the latest synced head root which in turn determines where a node "starts" its consensus participation - the era directory however can be freely shared between nodes / people without any (significant) security implications, assuming the era files are consistent / not broken. There's lots of future improvements to be had: * we can drop the in-memory `BlockRef` index almost entirely - at this point, resident memory usage of Nimbus should drop to a cool 500-600 mb * we could serve era files via REST trivially: this would drop backfill times to whatever time it takes to download the files - unlike the current implementation that downloads block by block, downloading an era at a time almost entirely cuts out request overhead * we can "reasonably" recreate detailed state history from almost any point in time, turning an O(slot) process into O(1) effectively - we'll still need caches and indices to do this with sufficient efficiency for the rest api, but at least it cuts the whole process down to minutes instead of hours, for arbitrary points in time * CI: ignore failures with Nim-1.6 (temporary) * test fixes Co-authored-by: Ștefan Talpalaru <stefantalpalaru@yahoo.com>	2022-03-23 09:58:17 +01:00
Jacek Sieka	70270eeabe	better error messages on directory creation failure (#3536 )	2022-03-22 17:06:21 +00:00
Jacek Sieka	c64bf045f3	remove StateData (#3507 ) One more step on the journey to reduce `BlockRef` usage across the codebase - this one gets rid of `StateData` whose job was to keep track of which block was last assigned to a state - these duties have now been taken over by `latest_block_root`, a fairly recent addition that computes this block root from state data (at a small cost that should be insignificant) 99% mechanical change.	2022-03-16 08:20:40 +01:00
Jacek Sieka	d0183ccd77	Historical state reindex for trusted node sync (#3452 ) When performing trusted node sync, historical access is limited to states after the checkpoint. Reindexing restores full historical access by replaying historical blocks against the state and storing snapshots in the database. The process can be initiated or resumed at any point in time.	2022-03-11 12:49:47 +00:00
Jacek Sieka	40a4c01086	chaindag: don't keep backfill block table in memory (#3429 ) This PR names and documents the concept of the archive: a range of slots for which we have degraded functionality in terms of historical access - in particular: * we don't support rewinding to states in this range * we don't keep an in-memory representation of the block dag The archive de-facto exists in a trusted-node-synced node, but this PR gives it a name and drops the in-memory digest index. In order to satisfy `GetBlocksByRange` requests, we ensure that we have blocks for the entire archive period via backfill. Future versions may relax this further, adding a "pre-archive" period that is fully pruned. During by-slot searches in the archive (both for libp2p and rest requests), an extra database lookup is used to covert the given `slot` to a `root` - future versions will avoid this using era files which natively are indexed by `slot`. That said, the lookup is quite fast compared to the actual block loading given how trivial the table is - it's hard to measure, even. A collateral benefit of this PR is that checkpoint-synced nodes will see 100-200MB memory usage savings, thanks to the dropped in-memory cache - future pruning work will bring this benefit to full nodes as well. * document chaindag storage architecture and assumptions * look up parent using block id instead of full block in clearance (future-proofing the code against a future in which blocks come from era files) * simplify finalized block init, always writing the backfill portion to db at startup (to ensure lookups work as expected) * preallocate some extra memory for finalized blocks, to avoid immediate realloc	2022-02-26 19:16:19 +01:00
zah	9c1ff78f84	Fix a reward calculation bug affecting Prater epoch 64781 (#3428 ) To calculate the deltas correctly, the `process_inactivity_updates` function must be called before the rewards and penalties processing code in order to update the `inactivity_scores` field in the state. This would have required duplicating more logic from the spec in the ncli modules, so I've decided to pay the price of introducing a run-time copy of the state at each epoch which eliminates the need to duplicate logic (both for this fix and the previous one). Other changes: * Fixes for the read-only mode of the `BeaconChainDb` * Fix an uint64 underflow in the debug output procedure for printing balance deltas * Allow Bellatrix states in the reward computation helpers	2022-02-22 14:14:17 +02:00
tersec	7de3f00f35	generic putCorruptState; {Merge=>Bellatrix}BeaconStateNoImmutableValidators (#3427 )	2022-02-21 12:55:56 +01:00
Jacek Sieka	adfe655b16	db: make block loading generic (#3413 ) Streamline lookup with Forky and BeaconBlockFork (then we can do the same for era) We use type to avoid conditionals, as fork is often already known at a "higher" level. * load blockid before loading block by root - this is needed to map root to slot and will eventually be done via block summary table for "old" blocks Co-authored-by: tersec <tersec@users.noreply.github.com>	2022-02-21 09:48:02 +01:00
Jacek Sieka	a88427bd39	ncli_db: more readonly support (#3411 ) Update several `ncli_db` commands to run in readOnly mode, allowing them to be used with a running instance - in particular era export. * export all eras by default * skip already-exported eras	2022-02-18 07:37:44 +01:00
tersec	5eecb9a21f	rename no{R=>r}eturn, no{I=>i}init, short{l=>L}og, E{T=>t}h2Node, Beacon{c=>C}hainDB (#3403 )	2022-02-16 23:24:44 +01:00
Jacek Sieka	7db5647a6e	clean up / document init (#3387 ) * clean up / document init * drop `immutable_validators` data (pre-altair) * document versions where data is first added * avoid needlessly loading genesis block data on startup * add a few more internal database consistency checks * remove duplicate state root lookup on state load * comment	2022-02-16 16:44:04 +01:00
Jacek Sieka	d583e8e4ac	Store finalized block roots in database (3s startup) (#3320 ) * Store finalized block roots in database (3s startup) When the chain has finalized a checkpoint, the history from that point onwards becomes linear - this is exploited in `.era` files to allow constant-time by-slot lookups. In the database, we can do the same by storing finalized block roots in a simple sparse table indexed by slot, bringing the two representations closer to each other in terms of conceptual layout and performance. Doing so has a number of interesting effects: * mainnet startup time is improved 3-5x (3s on my laptop) * the _first_ startup might take slightly longer as the new index is being built - ~10s on the same laptop * we no longer rely on the beacon block summaries to load the full dag - this is a lot faster because we no longer have to look up each block by parent root * a collateral benefit is that we no longer need to load the full summaries table into memory - we get the RSS benefits of #3164 without the CPU hit. Other random stuff: * simplify forky block generics * fix withManyWrites multiple evaluation * fix validator key cache not being updated properly in chaindag read-only mode * drop pre-altair summaries from `kvstore` * recreate missing summaries from altair+ blocks as well (in case database has lost some to an involuntary restart) * print database startup timings in chaindag load log * avoid allocating superfluos state at startup * use a recursive sql query to load the summaries of the unfinalized blocks	2022-01-30 18:51:04 +02:00
Jacek Sieka	d076e1a11b	ncli_db: import states and blocks from era file (#3313 )	2022-01-25 09:28:26 +01:00
Zahary Karadjov	29aad0241b	Precise per-component ETH-denominated rewards tracking This is an alternative take on https://github.com/status-im/nimbus-eth2/pull/3107 that aims for more minimal interventions in the spec modules at the expense of duplicating more of the spec logic in ncli_db.	2022-01-18 01:56:56 +02:00
Jacek Sieka	ff5b91cd58	Revert "Don't use GC memory for the initial beacon block summaries loading" (#3292 ) This reverts commit `7e2fc2b726`.	2022-01-17 12:07:49 +00:00
Zahary Karadjov	7e2fc2b726	Don't use GC memory for the initial beacon block summaries loading	2022-01-15 10:15:17 +02:00
tersec	bac0eaa92e	update 10 modules from using merge to bellatrix (#3257 )	2022-01-07 18:10:40 +01:00
Jacek Sieka	ba99c8fe4f	update era file documentation / impl (#3226 ) Overhaul of era files, including documentation and reference implementations * store blocks, then state, then slot indices for easy lookup at low cost * document era file rationale * altair+ support in era writer	2022-01-07 11:13:19 +01:00
Jacek Sieka	1021e3324e	Revert writing backfill root to database (#3215 ) Introduced in #3171, it turns out we can just follow the block headers to achieve the same effect * leaves the constant in the code so as to avoid confusion when reading database that had the constant written (such as the fleet nodes and other unstable users)	2021-12-21 11:40:14 +01:00
Jacek Sieka	03005f48e1	Backfill support for ChainDAG (#3171 ) In the ChainDAG, 3 block pointers are kept: genesis, tail and head. This PR adds one more block pointer: the backfill block which represents the block that has been backfilled so far. When doing a checkpoint sync, a random block is given as starting point - this is the tail block, and we require that the tail block has a corresponding state. When backfilling, we end up with blocks without corresponding states, hence we cannot use `tail` as a backfill pointer - there is no state. Nonetheless, we need to keep track of where we are in the backfill process between restarts, such that we can answer GetBeaconBlocksByRange requests. This PR adds the basic support for backfill handling - it needs to be integrated with backfill sync, and the REST API needs to be adjusted to take advantage of the new backfilled blocks when responding to certain requests. Future work will also enable moving the tail in either direction: * pruning means moving the tail forward in time and removing states * backwards means recreating past states from genesis, such that intermediate states are recreated step by step all the way to the tail - at that point, tail, genesis and backfill will match up. * backfilling is done when backfill != genesis - later, this will be the WSS checkpoint instead	2021-12-13 14:36:06 +01:00
Jacek Sieka	f69b272850	Keep cooked pubkeys in cache (#3122 ) Turning uncompressed pubkeys into cooked ones is fast, but unnecessary - this should avoid a little work for every signature validation we do by pre-loading them at startup.	2021-11-25 19:41:54 +01:00
Jacek Sieka	f19a497eec	ncli_db: add putState, putBlock (#3096 ) * ncli_db: add putState, putBlock These tools allow modifying an existing nimbus database for the purpose of recovery or reorg, moving the head, tail and genesis to arbitrary points. * remove potentially expensive `putState` in `BeaconStateDB` * introduce `latest_block_root` which computes the root of the latest applied block from the `latest_block_header` field (instead of passing it in separately) * avoid some unnecessary BeaconState copies during init * discover https://github.com/nim-lang/Nim/issues/19094 * prefer `HashedBeaconState` in a few places to avoid recomputing state root * fetch latest block root from state when creating blocks * harden `get_beacon_proposer_index` against invalid slots and document * move random spec function tests to `test_spec.nim` * avoid unnecessary state root computation before block proposal	2021-11-18 13:02:43 +01:00
Jacek Sieka	a086cf01ac	altair fork handling cleanups (#3050 ) * fix stack overflow crash in REST/debug/getStateV2 * introduce `ForkyXxx` for generic type matching of `Xxx` across branches (SomeHashedBeaconState -> ForkyHashedBeaconState et al) - `Some` is already used for other types of type classes * consolidate function naming in BeaconChainDB, use some generics * import `forks.nim` from other spec modules and move `Forked` helpers around to resolve circular imports remove `ForkedBeaconState`, use `ForkedHashedBeaconState` throughout (less data shuffling between the types) * fix several cases of states being stored on stack in tests, causing random failures on some platforms * remove reading json support from ncli - this should be ported to the rest json reading instead (doesn't currently work because stack sizes)	2021-11-05 08:34:34 +01:00
tersec	6b3bf7eb7b	merge hardfork database support (#2911 ) * merge hardfork database support * working block_sim * recreate state transition changes	2021-09-30 01:07:24 +00:00
tersec	2b2846b468	implement forked merge state/block support (#2890 ) * implement forked state/block support * merge support for containsOrphan; import cleanup; 80-column lines * add merge block header operations and slot sanity fixture * add epoch state transition tests; implement is_valid_gas_limit(), is_merge_block(), is_execution_enabled(), and compute_timestamp_at_slot() * implement process_execution_payload() and add merge deposit operations tests * add merge block sanity tests * add merge case to syncCommitteeParticipants * v1.1.0-beta.5 updates * reduce getTestStates-based memory usage; don't try to REST-serialize ExecutionPayload transactions without underlying support * add execution payload tests; switch var to let in tests/official/	2021-09-27 14:22:58 +00:00
Jacek Sieka	e47a8cbe42	fixes (#2901 ) * export kvstore from beacon_chain_db * fix rest HashList deserialization * fix asTrusted	2021-09-27 11:24:58 +02:00
tersec	5670d58155	test for newest fork first (#2891 )	2021-09-23 06:53:36 +00:00
Dustin Brody	6638476b5f	discard putative blocks from invalid hardforks	2021-09-16 16:18:40 +03:00
Jacek Sieka	a7a65bce42	disentangle eth2 types from the ssz library (#2785 ) * reorganize ssz dependencies This PR continues the work in https://github.com/status-im/nimbus-eth2/pull/2646, https://github.com/status-im/nimbus-eth2/pull/2779 as well as past issues with serialization and type, to disentangle SSZ from eth2 and at the same time simplify imports and exports with a structured approach. The principal idea here is that when a library wants to introduce SSZ support, they do so via 3 files: * `ssz_codecs` which imports and reexports `codecs` - this covers the basic byte conversions and ensures no overloads get lost * `xxx_merkleization` imports and exports `merkleization` to specialize and get access to `hash_tree_root` and friends * `xxx_ssz_serialization` imports and exports `ssz_serialization` to specialize ssz for a specific library Those that need to interact with SSZ always import the `xxx_` versions of the modules and never `ssz` itself so as to keep imports simple and safe. This is similar to how the REST / JSON-RPC serializers are structured in that someone wanting to serialize spec types to REST-JSON will import `eth2_rest_serialization` and nothing else. * split up ssz into a core library that is independendent of eth2 types * rename `bytes_reader` to `codec` to highlight that it contains coding and decoding of bytes and native ssz types * remove tricky List init overload that causes compile issues * get rid of top-level ssz import * reenable merkleization tests * move some "standard" json serializers to spec * remove `ValidatorIndex` serialization for now * remove test_ssz_merkleization * add tests for over/underlong byte sequences * fix broken seq[byte] test - seq[byte] is not an SSZ type There are a few things this PR doesn't solve: * like #2646 this PR is weak on how to handle root and other dontSerialize fields that "sometimes" should be computed - the same problem appears in REST / JSON-RPC etc * Fix a build problem on macOS * Another way to fix the macOS builds Co-authored-by: Zahary Karadjov <zahary@gmail.com>	2021-08-18 20:57:58 +02:00
Jacek Sieka	7a622e8505	rework spec imports (#2779 ) The spec imports are a mess to work with, so this branch cleans them up a bit to ensure that we avoid generic sandwitches and that importing stuff generally becomes easier. * reexport crypto/digest/presets because these are part of the public symbol set of the rest of the spec types * don't export `merge` types from `base` - this causes circular deps * fix circular deps in `ssz/spec_types` - this is the first step in disentangling ssz from spec * be explicit about phase0 vs altair - longer term, `altair` will become the "natural" type set, then merge and so on, so no point in giving `phase0` special preferential treatment	2021-08-12 13:08:20 +00:00
Jacek Sieka	7bb76a6cd1	Merge remote-tracking branch 'origin/stable' into merge-stable	2021-08-09 13:14:28 +02:00
Jacek Sieka	ee79c10a7d	update validator key cache on startup (#2760 ) * update validator key cache on startup Versions prior to 1.1.0 do not write a validator key cache at all. Versions from 1.4.0 and upwards require an immutable validator key cache to verify blocks - normally, block verification fills the cache but that assumes that at least one block was verified by a version that has the key cache. Taken together, this breaks direct upgrades from anything <1.1.0 to 1.4.0. The fix is simply to refresh fill the cache from an existing state on startup. * also log serious block validation failures at info level	2021-08-05 11:26:10 +03:00
Jacek Sieka	3f9c1fdf4e	More RuntimeConfig cleanup (#2716 ) * remove from BeaconChainDB (doesn't depend on runtime config) * eth2-testnets -> eth2-networks * use `cfg` name throughout	2021-07-13 16:27:10 +02:00
Jacek Sieka	23eea197f6	Implement split preset/config support (#2710 ) * Implement split preset/config support This is the initial bulk refactor to introduce runtime config values in a number of places, somewhat replacing the existing mechanism of loading network metadata. It still needs more work, this is the initial refactor that introduces runtime configuration in some of the places that need it. The PR changes the way presets and constants work, to match the spec. In particular, a "preset" now refers to the compile-time configuration while a "cfg" or "RuntimeConfig" is the dynamic part. A single binary can support either mainnet or minimal, but not both. Support for other presets has been removed completely (can be readded, in case there's need). There's a number of outstanding tasks: * `SECONDS_PER_SLOT` still needs fixing * loading custom runtime configs needs redoing * checking constants against YAML file * yeerongpilly support `build/nimbus_beacon_node --network=yeerongpilly --discv5:no --log-level=DEBUG` * load fork epoch from config * fix fork digest sent in status * nicer error string for request failures * fix tools * one more * fixup * fixup * fixup * use "standard" network definition folder in local testnet Files are loaded from their standard locations, including genesis etc, to conform to the format used in the `eth2-networks` repo. * fix launch scripts, allow unknown config values * fix base config of rest test * cleanups * bundle mainnet config using common loader * fix spec links and names * only include supported preset in binary * drop yeerongpilly, add altair-devnet-0, support boot_enr.yaml	2021-07-12 15:01:38 +02:00
tersec	7577f8c2ef	add blockchain_dag altair database reading; add rollback tests (#2683 ) * add blockchain_dag altair database reading; add rollback tests; fix some unnecessary type conversions * remove debugging scaffolding * proposeSignedBlock() will need to be async for merge; introduce altair types to VC	2021-06-29 15:09:29 +00:00
tersec	41e0a7abc0	introduce database support for Altair (#2667 ) * introduce immutable Altair BeaconState * add database support for Altair blocks and states * add tests for Altair get/put/contains/delete state * enable blockchain_dag Altair state database storing * properly return error on getting missing altair block	2021-06-24 07:11:47 +00:00
tersec	146fa48454	use ForkedHashedBeaconState in StateData (#2634 ) * use ForkedHashedBeaconState in StateData * fix FAR_FUTURE_EPOCH -> slot overflow; almost always use assign() * avoid stack allocation in maybeUpgradeStateToAltair() * create and use dispatch functions for check_attester_slashing(), check_proposer_slashing(), and check_voluntary_exit() * use getStateRoot() instead of various state.data.hbsPhase0.root * remove withStateVars.hashedState(), which doesn't work as a design anymore * introduce spec/datatypes/altair into beacon_chain_db * fix inefficient codegen for getStateField(largeStateField) * state_transition_slots() doesn't either need/use blocks or runtime presets * combine process_slots(HBS)/state_transition_slots(HBS) which differ only in last-slot htr optimization * getStateField(StateData, ...) was replaced by getStateField(ForkedHashedBeaconState, ...) * fix rollback * switch some state_transition(), process_slots, makeTestBlocks(), etc to use ForkedHashedBeaconState * remove state_transition(phase0.HashedBeaconState) * remove process_slots(phase0.HashedBeaconState) * remove state_transition_block(phase0.HashedBeaconState) * remove unused callWithBS(); separate case expression from if statement * switch back from nested-ref-object construction to (ref Foo)(Bar())	2021-06-11 20:51:46 +03:00
Jacek Sieka	d859bc12f0	write uncompressed validator keys to database (#2639 ) * write uncompressed validator keys to database Loading 150k+ validator keys on startup in compressed format takes a lot of time - better store them in uncompressed format which makes behaviour just after startup faster / more predictable. * refactor cached validator key access * fix isomorphic cast to work with non-var instances * remove cooked pubkey cache - directly use database cache in chaindag as well (one less cache to keep in sync) * bump blscurve, introduce loadValid for known-to-be-valid keys	2021-06-10 10:37:02 +03:00
Jacek Sieka	5974fb0e7d	speed up initial migration an isolated transaction when loading the database helps keep migration time down on first start after upgrade	2021-06-09 20:04:20 +03:00
Jacek Sieka	60df17786e	avoid reading legacy db on write * don't consider legacy database when writing state - this read is slow on kvstore * avoid epoch transition when there's an exact match in cache already * simplify init to only consider checkpoint states	2021-05-30 12:32:51 +03:00
tersec	46c5a0110a	log doppelganger attestation signature; rm withState.HashedBeaconState uses (#2608 )	2021-05-28 15:51:15 +03:00
Jacek Sieka	5be1f8bf93	Revert "increase sqlite cache size (#2607 )" This reverts commit `f55c4bc402`.	2021-05-28 11:35:23 +02:00
Jacek Sieka	f55c4bc402	increase sqlite cache size (#2607 ) This is a test to see how the prater nodes react	2021-05-27 21:16:04 +02:00
Jacek Sieka	ab70f371e1	Revert "create new database in separate file (#2596 )" (#2604 ) This reverts commit `eebc828778`. Adding a separate file turns out not to be enough. This PR reverts the separate file change. Another theory is that the large kvstore table causes cache thrashing - all database connections share a common page cache which would explain the poor performance of the separate file solution.	2021-05-27 12:59:42 +02:00
Jacek Sieka	18d26071d8	hotfix migration should have just kept the full copy-pasted dbseq init	2021-05-26 09:56:21 +02:00
Jacek Sieka	eebc828778	create new database in separate file (#2596 ) The V1 table structure shows great improvements in performance, but if there's an old `kvstore` without rowid:s, these benefits are nullified: reorgs during writes and deletes remain expensive (even if the degradation is reduced somewhat). This PR creates the tables in a new file instead, and uses the old file as a read-only store - this has several interesting properties: * the old database is left completely untouched - this guarantees that downgrades work smooth (they'll only need to resync their missing portions) * starting sync after this PR means only a v1 database is created * v0 databases stick around - no migration is performed (for now) Future PR:s can introduce migration of the data from one database to another - a simply copy will take hours which is downtime we want to avoid - at that point, it might make sense to migrate straight to era files instead.	2021-05-26 09:07:18 +02:00
Jacek Sieka	8dbd796401	prune `validatorIndexFromPubKey` table	2021-05-20 14:10:23 +03:00
Jacek Sieka	97f4e1fffe	Db1 cont (#2573 ) * Revert "Revert "Upgrade database schema" (#2570)" This reverts commit `6057c2ffb4`. * ssz: fix loading empty lists into existing instances Not a problem earlier because we didn't reuse instances * bump nim-eth * bump nim-web3	2021-05-17 18:37:26 +02:00
tersec	6057c2ffb4	Revert "Upgrade database schema" (#2570 ) This reverts commit `22ddf74752`.	2021-05-17 06:34:44 +00:00
Jacek Sieka	22ddf74752	Upgrade database schema The `kvstore` design we're using now turns out to not be the best way to use `sqlite` - in particular, there are some significant benefits to using rowid in certain situations and to keep data in separate tables. With this branch, there are massive improvements in startup time (seconds instead of minutes) and state/block storage and pruning times (milliseconds instead of seconds) - these improvements can in particular be seen on slow drives and translate directly into better attestation performance. * update kvstore to new keyspace design * remove `DirStoreRef` and the hidden `--state-db-kind` option - this was an experiment to store large blobs in files, but with the new kvstore, there's no compelling reason to do so * remove `DbMap` - unused and would need updating for new keyspace design * introduce separate tables for each data type (blocks, states etc) * remove "WITHOUT ROWID" pessimization for tables with large blobs * close DbSeq statements explicitly (and earlier) * store beacon block summaries in separate table, without SSZ compression and load them all with single query on startup * stop storing backwards compat full states * mark genesis beacon block as trusted * avoid faststreams when loading SSZ data * remove `DisagreementBehavior` (unused)	2021-05-14 20:05:23 +03:00
Jacek Sieka	54d6884c89	fix sync issue when upgrading from 1.1.0-inited db This patch writes a full genesis state to `kvstore` if one was missing, which fixes 1.2.0 restarting sync when upgrading from 1.1.0, or when downgrading to a pre-1.1.0 release.	2021-04-20 16:55:18 +03:00
Zahary Karadjov	9776fbfe17	Merge branch 'version-1.1.0' into unstable	2021-04-08 20:50:06 +03:00
Jacek Sieka	7165e0ac31	Reset cached indices when resetting cache on SSZ read (#2480 ) * Reset cached indices when resetting cache on SSZ read When deserializing into an existing structure, the cache should be cleared - goes for json also. Also improve error messages.	2021-04-08 13:11:04 +03:00
Jacek Sieka	beceb060c4	Write state diffs to separate table (and experimentally, files instead of db) (#2460 )	2021-04-06 21:56:45 +03:00
Jacek Sieka	2695cfa864	EH cleanup (#2455 ) almost 100% raises in nimbus-eth2 now! * fix some rare exception-related crashes in json-rpc	2021-03-26 07:52:01 +01:00
Jacek Sieka	8b76ceed52	Fix minor exception effect issues (#2448 ) Makes code compatible with https://github.com/status-im/nim-chronos/pull/166 without requiring it.	2021-03-24 17:20:55 +01:00
tersec	baa43ae7f7	add stricter ABI-compatibility checks for punned types (#2434 ) * add stricter ABI-compatibility checks for punned types * enforce type isomorphism for each pair of cast types to eliminate potential safety hole	2021-03-19 12:30:20 +00:00
tersec	8def2486b0	immutable validator database factoring (#2297 ) * initial immutable validator database factoring * remove changes from chain_dag: this abstraction properly belongs in beacon_chain_db * add merging mutable/immutable validator portions; individually test database roundtripping of immutable validators and states-sans-immutable-validators * update test summaries * use stew/assign2 instead of Nim assignment * add reading/writing of immutable validators in chaindag * remove unused import * replace chunked k/v store of immutable validators with per-row SQL table storage * use List instead of HashList * un-stub some ncli_db code so that it uses * switch HashArray to array; move BeaconStateNoImmutableValidators from datatypes to beacon_chain_db * begin only-mutable-part state storage * uncomment some assigns * work around https://github.com/nim-lang/Nim/issues/17253 * fix most of the issues/oversights; local sim runs again * fix test suite by adding missing beaconstate field to copy function * have ncli bench also store immutable validators * extract some immutable-validator-specific code from the beacon chain db module * add more rigorous database state roundtripping, with changing validator sets * adjust ncli_db to use new schema * simplify putState/getState by moving all immutable validator accounting into beacon state DB * remove redundant test case and move code to immutable-beacon-chain module * more efficient, but still brute-force, mutable+immutable validator merging * reuse BeaconState in getState * ensure HashList/HashArray caches are cleared when reusing getState buffers; add ncli_db and a unit test to verify this * HashList.clear() -> HashList.clearCache() * only copy incrementally necessary immutable validators * increase strictness of test cases and fix/work around resulting HashList cache invalidation issues * remove explanatory scaffolding * allow for storage of full (with all validators) states for backwards/forwards-compatibility * adjust DbSeq type usage * store full, with-validators, state every 64 epochs to enable reverting versions * reduce memory allocation and intermediate objects in state storage codepath * eliminate allocation/copying through intermediate BeaconStateNoImmutableValidators objects * skip benchmarking initial genesis-validator-heavy state store * always store new-style state and sometimes old-style state * document intent behind BeaconState/Validator type-punnery * more accurate failure message on SQLite in-memory database initialization failure	2021-03-15 14:11:51 +00:00
tersec	f0eb45af44	avoid int64 -> uint64 -> int64 conversions in DbSeq (#2398 )	2021-03-10 18:01:43 +00:00
Mamy Ratsimbazafy	d47f53cd9d	Reorg (5/5) (#2377 ) * Reorg things left into networking and gossip_processing * time -> beacon_clock * fix builds	2021-03-05 14:12:00 +01:00
Mamy Ratsimbazafy	2f17ac7b64	Move SSZ, deposit_contracts & eth1_monitor [reorg files 3/5] (#2371 ) * move deposit_contract * Move SSZ * fix ssz import in tests * move also eth1_monitor * forgot to delete the original * fix comma [skip ci] * Fix "make" & tools imports * Fix import * Fix import again * rename deposit_contract -> eth1 * Revert ssz move to subfolder * path fixes [skip ci]	2021-03-03 07:23:05 +01:00
tersec	5cab17dc1a	database state storage benchmarking via ncli_db (#2312 ) * database state storage benchmarking via ncli_db * more cleanups from immutable validator state branch * unexport some eth2_network constants and remove unused variables/templates * make two PeerScore constants public	2021-02-15 17:40:00 +01:00
Jacek Sieka	f012d7060b	better error message on disk / database issues (#2307 ) bumps stew for better result defects as well	2021-02-10 13:21:06 +01:00
Zahary Karadjov	fa99c3b417	Fix #2261 Also bumps Confutils to allow setting the hidden --web3-mode param (to allow testing the eth1 syncing without validators)	2021-01-30 01:32:20 +02:00
Mamy Ratsimbazafy	70a03658e3	Block validation flow v2 + Batch (serial) sig verification (#2250 ) * bump nim-blscurve * Outline the block validation flow * introduce the SigVerified types, pass the tests * Split clearance/quarantine to prepare for batch crypto verif * Add a batch signature collector * Make clearance use SigVerified block and split verification between crypto and state transition * Always use signedBeaconBlock for the onBlockAdded callback * RANDAO signing_root is the epoch instead of the full block * Support skipping BLS for testing * Fix compilation of the validator client * Try to fix strange errors MacOS and Jenkins (Clang, unknown type name br_hmac_drbg_context in stdlib_assertions.nim.c) * address https://github.com/status-im/nimbus-eth2/pull/2250#discussion_r561819858 * address https://github.com/status-im/nimbus-eth2/pull/2250#discussion_r561828025 * onBlockAdded callback should use TrustedSignedBeaconBlock https://github.com/status-im/nimbus-eth2/pull/2250#discussion_r561837261 * address https://github.com/status-im/nimbus-eth2/pull/2250#discussion_r561828946 * Use the application RNG: https://github.com/status-im/nimbus-eth2/pull/2250#discussion_r561815336 * Improve codegen of conversion zero-cost) * Quick fixes with loadWithCache after #2259 (TODO: graceful error since pubkey validations is now done first in signatures_batch) * Graceful handle rogue pubkeys and signatures now that those are lazy-loaded	2021-01-25 20:45:48 +02:00
tersec	921fe5a68f	initial infrastructure for state diffs (#2087 ) Initial infrastructure for state diffs	2021-01-18 22:34:41 +02:00
Jacek Sieka	0f8a3a5ae8	checkpoint database at end of each slot (#2195 ) * checkpoint database at end of each slot To avoid spending time on synchronizing with the file system while doing processing, the manual checkpointing mode turns off fsync during processing and instead checkpoints the database when the slot has ended. From an sqlite perspecitve, in WAL mode this guaranees database consistency but may lead to data loss which is fine - anything missing from the beacon chain database can be recovered on the next startup. * log sync status and delay in slot start message * bump	2020-12-18 22:01:24 +01:00
zah	372c9b798c	Fix the corrupted database state on Pyrmont nodes; Add mainnet genesis (#2056 ) * Handle some web3 timeouts better * Add support for developer .env files * Eth1 improvements; Mainnet genesis state Notable changes: * The deposits table have been removed from the database. The client will no longer process all deposits on start-up. * The network metadata now includes a "state snapshot" of the deposit contract. This allows the client to skip syncing deposits made prior to the snapshot (i.e. genesis). Suitable metadata added for Pyrmont and Mainnet. * The Eth1 monitor won't be started unless there are validators attached to the node. * The genesis detection code is now optional and disabled by default * Bugfix: The client should not produce blocks that will fail validation when it hasn't downloaded the latest deposits yet * Bugfix: Work around the database corruption affecting Pyrmont nodes * Remove metadata for Toledo and Medalla	2020-11-24 22:21:47 +01:00
Jacek Sieka	672915e170	work around long pyrmont startup time (#2060 ) * also fixes unnecessary copy/memory alloc when loading DbSeq entries	2020-11-21 18:53:40 +01:00
Jacek Sieka	5644cfae8f	increase max db object size (fixes #1996 )	2020-11-12 08:42:45 +01:00
Zahary Karadjov	389c11743a	Review TODO items and self-assign the most important ones	2020-11-10 20:41:04 +02:00
Jacek Sieka	fc7885b27e	Store block summary in database This introcudes a cache for block summaries, useful for instantiating the block dag on startup, bringing medalla startup times down from minutes to seconds. This is something of a temporary band-aid that would be obsoleted by a finalized block store.	2020-11-04 11:28:55 +02:00
Zahary Karadjov	14b2d4324d	openarray -> openArray	2020-11-03 23:23:10 +02:00
Zahary Karadjov	18639c3eff	Don't require requests that might fail on non-archive Geth nodes	2020-11-03 23:23:10 +02:00
Eugene Kabanov	eee01a32d5	Regression fix of eth2_network_simulation on Windows. (#1900 ) * Concentrate all sensitive writeFile/createPath calls in one place. Fix eth2_network_simulation for Windows. * Remove artifacts. * fix import Co-authored-by: Jacek Sieka <jacek@status.im>	2020-10-27 12:04:17 +01:00
Zahary Karadjov	5f6bdc6709	Store all deposit-derived data in memory	2020-10-15 20:15:51 +03:00
Zahary Karadjov	2152dc6136	Simplify the mainchain monitor	2020-10-15 20:15:51 +03:00
Zahary Karadjov	ce1fda1195	Store the deposits and the immutable validator data in Sqlite	2020-10-15 20:15:51 +03:00
Zahary Karadjov	4d66914f5a	Fix the test suite	2020-10-15 20:15:51 +03:00
Zahary Karadjov	e6320e5881	Address #1584 Don't keep all deposits in memory (persist them to disk)	2020-10-15 20:15:51 +03:00
Zahary Karadjov	aed291128a	Add support for starting from weak subjectivity checkpoints Also removes the `genesis.ssz` file stored in the data folder. The `medalla-fast-sync` target has been adapted to use the new features.	2020-10-07 09:32:03 +03:00
tersec	aca1a318f2	cleanly close kvstore databases and bump nim-eth (#1630 ) * cleanly close kvstore databases * close databases for all subcommands and during error conditions	2020-09-12 05:35:58 +00:00
Jacek Sieka	aed57df957	avoid hash tree root calculation when loading blocks from database (#1572 )	2020-09-04 08:35:10 +02:00
Zahary Karadjov	3433c77c35	Prevent Snappy decompression bombs	2020-08-19 10:13:04 +03:00
Jacek Sieka	58d77153fc	fix invalid state root being written to database (#1493 ) * fix invalid state root being written to database When rewinding state data, the wrong block reference would be used when saving the state root - this would cause state loading to fail by loading a different state than expected, preventing blocks to be applied. * refactor state loading and saving to consistently use and set StateData block * avoid rollback when state is missing from database (as opposed to being partially overwritten and therefore in need of rollback) * don't store state roots for empty slots - previously, these were used as a cache to avoid recalculating them in state transition, but this has been superceded by hash tree root caching * don't attempt loading states / state roots for non-epoch slots, these are not saved to the database * simplify rewinder and clean up funcitions after caches have been reworked * fix chaindag logscope * add database reload metric * re-enable clearance epoch tests * names	2020-08-13 11:50:05 +02:00
Jacek Sieka	8b01284b0e	cache block hash (#1329 ) hash_tree_root was turning up when running beacon_node, turns out to be repeated hash_tree_root invocations - this pr brings them back down to normal. this PR caches the root of a block in the SignedBeaconBlock object - this has the potential downside that even invalid blocks will be hashed (as part of deserialization) - later, one could imagine delaying this until checks have passed there's also some cleanup of the `cat=` logs which were applied randomly and haphazardly, and to a large degree are duplicated by other information in the log statements - in particular, topics fulfill the same role	2020-07-16 15:16:51 +02:00
Jacek Sieka	1301600341	Trusted blocks (#1227 ) * cleanups * fix ncli state root check flag * add block dump to ncli_db * limit ncli_db benchmark length * tone down finalization logs * introduce trusted blocks We only store blocks whose signature we've verified in the database - as such, there's no need to check it again, and most importantly, no need to deserialize the signature when loading from database. 50x startup time improvement, 200x block load time improvement. * fix rewinding when deposits have invalid signature * speed up ancestor iteration by avoiding copy * avoid deserializing signatures for trusted data * load blocks lazily when rewinding (less memory used) * chronicles workarounds * document trustedbeaconblock	2020-06-25 12:23:10 +02:00
tersec	807b920c19	state_transition implements the spec fairly directly (#1220 )	2020-06-23 13:54:24 +00:00
Jacek Sieka	360ebd705f	db: compress with snappy * 3gb vs 12gb for 4000 epochs of witti * 3-4x sync blocks/sec performance improvement on my FDE SSD drive * generate less quirky code for primitive types	2020-06-14 11:33:00 +03:00
Jacek Sieka	78b767f645	avoid genericAssign for beacon node types (#1166 ) * avoid genericAssign for beacon node types ok, I got fed up of this function messing up cpu measurements - it's so ridiculously slow, it's sad. before, while syncing: ``` 40,65% beacon_node_shared_witti_0 [.] genericAssignAux__U5DxFPRpHCCZDKWQzM9adaw 9,02% libc-2.31.so [.] __memmove_avx_unaligned_erms 7,07% beacon_node_shared_witti_0 [.] BIG_384_58_monty 5,19% beacon_node_shared_witti_0 [.] BIG_384_58_mul 2,72% beacon_node_shared_witti_0 [.] memcpy@plt 1,18% [kernel] [k] rb_next 1,17% beacon_node_shared_witti_0 [.] genericReset 1,06% [kernel] [k] map_private_extent_buffer ``` after: ``` 24,88% beacon_node_shared_witti_0 [.] BIG_384_58_monty 20,29% beacon_node_shared_witti_0 [.] BIG_384_58_mul 3,15% beacon_node_shared_witti_0 [.] BIG_384_58_norm 2,93% beacon_node_shared_witti_0 [.] BIG_384_58_add 2,55% beacon_node_shared_witti_0 [.] BIG_384_58_sqr 1,64% beacon_node_shared_witti_0 [.] BIG_384_58_mod 1,63% beacon_node_shared_witti_0 [.] sha256Transform__BJNBQtWr9bJwzqbyfKXd38Q 1,48% beacon_node_shared_witti_0 [.] FP_BLS381_add 1,39% beacon_node_shared_witti_0 [.] BIG_384_58_sub 1,33% beacon_node_shared_witti_0 [.] BIG_384_58_dnorm 1,14% beacon_node_shared_witti_0 [.] FP2_BLS381_mul 1,05% beacon_node_shared_witti_0 [.] BIG_384_58_cmove 1,05% beacon_node_shared_witti_0 [.] get_shuffled_seq__4uncAHNsSG3Pndo5H11U9aQ ``` * better field iteration	2020-06-12 21:10:22 +02:00
Jacek Sieka	56ffb696be	reorder ssz (#1099 ) * reorder ssz * split into hash_trees and ssz_serialization, roughly, for hashing and IO * move bitseqs into ssz (from stew) * clean up imports * docs, imports	2020-06-03 15:52:02 +02:00
Jacek Sieka	23daa966be	better deserialization log	2020-05-20 15:41:02 +02:00

1 2 3 4

193 Commits