nimbus-eth2

Commit Graph

Author	SHA1	Message	Date
Jacek Sieka	bc80ac3be1	harden REST API `atSlot` against non-finalized blocks (#3538 ) * harden validator API against pre-finalized slot requests * check `syncHorizon` when responding to validator api requests too far from `head` * limit state-id based requests to one epoch ahead of `head` * put historic data bounds on block/attestation/etc validator production API, preventing them from being used with already-finalized slots * add validator block smoke tests * make rest test create a new genesis with the tests running roughly in the first epoch to allow testing a few more boundary conditions	2022-03-23 12:42:16 +01:00
Jacek Sieka	4207b127f9	era: load blocks and states (#3394 ) * era: load blocks and states Era files contain finalized history and can be thought of as an alternative source for block and state data that allows clients to avoid syncing this information from the P2P network - the P2P network is then used to "top up" the client with the most recent data. They can be freely shared in the community via whatever means (http, torrent, etc) and serve as a permanent cold store of consensus data (and, after the merge, execution data) for history buffs and bean counters alike. This PR gently introduces support for loading blocks and states in two cases: block requests from rest/p2p and frontfilling when doing checkpoint sync. The era files are used as a secondary source if the information is not found in the database - compared to the database, there are a few key differences: * the database stores the block indexed by block root while the era file indexes by slot - the former is used only in rest, while the latter is used both by p2p and rest. * when loading blocks from era files, the root is no longer trivially available - if it is needed, it must either be computed (slow) or cached (messy) - the good news is that for p2p requests, it is not needed * in era files, "framed" snappy encoding is used while in the database we store unframed snappy - for p2p2 requests, the latter requires recompression while the former could avoid it * front-filling is the process of using era files to replace backfilling - in theory this front-filling could happen from any block and front-fills with gaps could also be entertained, but our backfilling algorithm cannot take advantage of this because there's no (simple) way to tell it to "skip" a range. * front-filling, as implemented, is a bit slow (10s to load mainnet): we load the full BeaconState for every era to grab the roots of the blocks - it would be better to partially load the state - as such, it would also be good to be able to partially decompress snappy blobs * lookups from REST via root are served by first looking up a block summary in the database, then using the slot to load the block data from the era file - however, there needs to be an option to create the summary table from era files to fully support historical queries To test this, `ncli_db` has an era file exporter: the files it creates should be placed in an `era` folder next to `db` in the data directory. What's interesting in particular about this setup is that `db` remains as the source of truth for security purposes - it stores the latest synced head root which in turn determines where a node "starts" its consensus participation - the era directory however can be freely shared between nodes / people without any (significant) security implications, assuming the era files are consistent / not broken. There's lots of future improvements to be had: * we can drop the in-memory `BlockRef` index almost entirely - at this point, resident memory usage of Nimbus should drop to a cool 500-600 mb * we could serve era files via REST trivially: this would drop backfill times to whatever time it takes to download the files - unlike the current implementation that downloads block by block, downloading an era at a time almost entirely cuts out request overhead * we can "reasonably" recreate detailed state history from almost any point in time, turning an O(slot) process into O(1) effectively - we'll still need caches and indices to do this with sufficient efficiency for the rest api, but at least it cuts the whole process down to minutes instead of hours, for arbitrary points in time * CI: ignore failures with Nim-1.6 (temporary) * test fixes Co-authored-by: Ștefan Talpalaru <stefantalpalaru@yahoo.com>	2022-03-23 09:58:17 +01:00
Etan Kissling	49673c4410	refer to `syncCommitteeMsgPool` consistently (#3537 ) Updates a `sync_committee_msg_pool` reference to camelCase.	2022-03-23 07:46:48 +01:00
Etan Kissling	b2b7b0bd56	serve libp2p protocol for light client sync (#3341 ) This extends the `--serve-light-client-data` launch option to serve locally collected light client data via libp2p. Backfill of historic best `LightClientUpdate` is not yet implemented. See https://github.com/ethereum/consensus-specs/pull/2802	2022-03-22 21:23:36 +01:00
Jacek Sieka	70270eeabe	better error messages on directory creation failure (#3536 )	2022-03-22 17:06:21 +00:00
tersec	e7d017b50c	remove AppVeyor, Travis, and Azure CI pipeline definitions (#3535 )	2022-03-22 11:16:35 +00:00
Jacek Sieka	4a237cb908	enable chronosStrictException (#3533 ) * bump nim-json-rpc	2022-03-22 09:42:28 +01:00
Etan Kissling	8cc394ba49	extract DAG dependent init to own function (#3530 ) During operation as a light client, the chain DAG is not available. As a preparation, the beacon node initialization logic is divided into parts depending on the presence of the chain DAG, and parts that are always available (including a future light client mode). This is a pure code move without semantic changes.	2022-03-21 17:52:15 +01:00
Jacek Sieka	361596c719	harden head update against missing parent (#3529 ) in case BlockRef ends up in some lifetime leak * fix duplicate head logging	2022-03-21 15:18:05 +01:00
Jacek Sieka	13fafe3a40	simplify unviable head pruning (#3528 ) Also note bug that exists that potentially prevents states from being pruned correctly	2022-03-21 09:20:26 +00:00
Etan Kissling	fd1ffd62dd	update light client server for DAG failure modes (#3514 ) Gracefully handles the new failure modes recently introduced to the DAG as part of https://github.com/status-im/nimbus-eth2/pull/3513 Data that is deemed to exist but fails to load leads to an error log to avoid suppressing logic errors accidentally. In `verifyFinalization` mode, the assertions remain active.	2022-03-20 11:58:59 +01:00
Etan Kissling	04b851f775	fix light client data pruning (#3523 ) When eliminating orphaned forks, light client data about blocks was also deleted when the orphaned fork was referring to a state several slots after the block. Linking light client data pruning with block deletion instead of state deletion fixes this problem. Light client data always refers to blocks and their immediate post-state.	2022-03-20 10:09:43 +01:00
Etan Kissling	ca045900c8	extract chain DAG loading to separate function (#3527 ) When transitioning from light client to full node the chain DAG will be loaded separately from the rest of the beacon node initialization. Extracting chain DAG loading to a separate function will allow reusing a lot of the existing code. This code move doesn't change semantics.	2022-03-19 17:48:24 +01:00
Jacek Sieka	ea1acd7397	fix loading when finalized checkpoint slot is missing block (#3525 ) ref loop would stop one block early in this case - trying to load everything in one loop ends up being pretty confusing.. * simplify finalizedBlocks topup by splitting it from the head loop / query	2022-03-19 11:02:17 +00:00
Jacek Sieka	e418497bb2	make attestation duty minimum offset relative to slot length (#3522 )	2022-03-19 09:59:13 +01:00
Etan Kissling	637f1e2be6	simplify `computeEarliestLightClientSlot` (#3524 ) Combine DAG and LC import tails in `computeEarliestLightClientSlot`.	2022-03-19 09:58:55 +01:00
Ștefan Talpalaru	ea5c052016	enable multithreading by default (10-20% faster sync) (#3493 )	2022-03-19 08:59:10 +01:00
Ștefan Talpalaru	a1f3adc3e2	bump vendor/nimbus-build-system (#3526 ) * bump vendor/nimbus-build-system	2022-03-19 08:58:05 +01:00
Etan Kissling	18bd6df1b4	fix light client data collection for checkpoint sync (#3498 ) When doing checkpoint sync, collecting light client data of known blocks and states incorrectly assumes that `finalized_checkpoint` information is also known. Hardens collection to only collect finalized checkpoint data after `dag.computeEarliestLightClientSlot`.	2022-03-18 15:47:53 +01:00
Jacek Sieka	d0223d1f28	fix finalized epoch ref loading on checkpoint start (#3517 ) regression from #3513 that did not take tail into consideration when loading epoch ancestor	2022-03-18 13:13:57 +01:00
tersec	d11d61c745	engine API alpha.7 -> alpha.8 and a few remaining v1.1.9 to v1.1.0 CL spec URL updates (#3519 )	2022-03-18 11:46:39 +00:00
Jacek Sieka	0db1e768e4	don't write `node-metadata.json` on startup (#3515 ) This file is not actually used / useful - should metadata persistence support be added in the future, it needs to be done with a new file such that downgrades, that have the TODO logic unimplemented, don't break.	2022-03-18 12:36:50 +01:00
Jacek Sieka	b3d80827fb	tns: checkpoint wal periodically while backfilling (#3516 ) Witout this, we end up with a massive .wal file that needs to be checkpointed on first startup (which takes a few minutes) - it's much more efficient to do smaller checkpoints, it turns out.	2022-03-18 12:32:20 +01:00
Jacek Sieka	8395f7de8c	increase after-block attestation delay (#3518 ) Recently, block processing times have been going up as the network grows making early attestation riskier. Since blocks are big and attestations are small (though numerous and therefore bandwidth-intense), it seems better to wait a little bit longer after receiving a block, before we publish the attestation.	2022-03-18 11:02:32 +00:00
Etan Kissling	12dc427535	introduce light client processor (#3509 ) Adds `LightClientProcessor` as the pendant to `BlockProcessor` while operating in light client mode. Note that a similar mechanism based on async futures is used for interoperability with existing infrastructure, despite light client object validation being done synchronously.	2022-03-17 23:26:56 +01:00
Etan Kissling	9f8894fb43	broadcast optimistic light client updates (#3499 ) After proposing a new block, broadcasts a `OptimisticLightClientUpdate`. Works for both locally proposed blocks as well as VC submitted ones.	2022-03-17 21:11:29 +01:00
Jacek Sieka	05ffe7b2bf	Prune `BlockRef` on finalization (#3513 ) Up til now, the block dag has been using `BlockRef`, a structure adapted for a full DAG, to represent all of chain history. This is a correct and simple design, but does not exploit the linearity of the chain once parts of it finalize. By pruning the in-memory `BlockRef` structure at finalization, we save, at the time of writing, a cool ~250mb (or 25%:ish) chunk of memory landing us at a steady state of ~750mb normal memory usage for a validating node. Above all though, we prevent memory usage from growing proportionally with the length of the chain, something that would not be sustainable over time - instead, the steady state memory usage is roughly determined by the validator set size which grows much more slowly. With these changes, the core should remain sustainable memory-wise post-merge all the way to withdrawals (when the validator set is expected to grow). In-memory indices are still used for the "hot" unfinalized portion of the chain - this ensure that consensus performance remains unchanged. What changes is that for historical access, we use a db-based linear slot index which is cache-and-disk-friendly, keeping the cost for accessing historical data at a similar level as before, achieving the savings at no percievable cost to functionality or performance. A nice collateral benefit is the almost-instant startup since we no longer load any large indicies at dag init. The cost of this functionality instead can be found in the complexity of having to deal with two ways of traversing the chain - by `BlockRef` and by slot. * use `BlockId` instead of `BlockRef` where finalized / historical data may be required * simplify clearance pre-advancement * remove dag.finalizedBlocks (~50:ish mb) * remove `getBlockAtSlot` - use `getBlockIdAtSlot` instead * `parent` and `atSlot` for `BlockId` now require a `ChainDAGRef` instance, unlike `BlockRef` traversal * prune `BlockRef` parents on finality (~200:ish mb) * speed up ChainDAG init by not loading finalized history index * mess up light client server error handling - this need revisiting :)	2022-03-17 17:42:56 +00:00
Etan Kissling	9a2b50d2c6	allow tagging light client specific libp2p messages (#3485 ) The pre-release light client sync protocol defines additional Req/Resp messages to be made available when `--serve-light-client-data` is set. This patch extends the `{.libp2pProtocol.}` pragma with an optional parameter to tag such light client sync protocol specific messages. The corresponding protocols are only selectively registered with libp2p.	2022-03-17 16:09:18 +02:00
Jacek Sieka	8a63efc413	move `BlockId` to `spec` (#3511 ) The spec implicitly talks about the slot of a block in several places, and keeping it readily available is useful in a number of context - might as well put this implicitly refereneced helper in the spec code directly	2022-03-16 16:00:18 +01:00
Etan Kissling	88af3f2797	update to latest light client spec (#3508 ) Adds the additional check to ensure `optimistic_header` is always after `finalized_header` in `LightClientStore`, as introduced to the spec in https://github.com/ethereum/consensus-specs/pull/2814	2022-03-16 12:56:38 +01:00
tersec	8fbcf29775	update unchanged specs/phase0/p2p-interface.md URL references from v1.1.9 to v1.1.10 (#3510 )	2022-03-16 10:40:35 +00:00
Jacek Sieka	c64bf045f3	remove StateData (#3507 ) One more step on the journey to reduce `BlockRef` usage across the codebase - this one gets rid of `StateData` whose job was to keep track of which block was last assigned to a state - these duties have now been taken over by `latest_block_root`, a fairly recent addition that computes this block root from state data (at a small cost that should be insignificant) 99% mechanical change.	2022-03-16 08:20:40 +01:00
Etan Kissling	6d1d31dd01	avoid re-requesting finalized blocks during sync (#3461 ) When a `beaconBlocksByRange` response advances the `safeSlot`, but later has errors, the sync queue keeps repeating that same request until it is fulfilled without errors. Data up through `safeSlot` is considered to be immutable, i.e., finalized, so re-requesting that data is not useful. By advancing the sync progress in that scenario, those redundant query portions can be avoided. Note, the finalized block _itself_ is always requested, even in the initial request. This behaviour is kept same.	2022-03-15 18:56:56 +01:00
Ștefan Talpalaru	725692544e	Makefile: fix "gnosis-chain-build" (#3503 )	2022-03-15 14:56:01 +01:00
tersec	3f0a5026a4	bump nim-web3 for request header callbacks for JWT (#3496 )	2022-03-15 09:40:04 +00:00
Jacek Sieka	a3bd01b58d	move dependent root computations to `BeaconState` / `EpochRef` (#3478 ) * fewer deps on `BlockRef` traversal in anticipation of pruning * allows identifying EpochRef:s by their shuffling as a first step of * tighten error handling around missing blocks using the zero hash for signalling "missing block" is fragile and easy to miss - with checkpoint sync now, and pruning in the future, missing blocks become "normal".	2022-03-15 09:24:55 +01:00
tersec	a92b175bcc	increase Jenkins timeout (#3497 )	2022-03-14 15:49:47 +00:00
tersec	aace7086d3	bump nim-stew (#3492 )	2022-03-14 15:08:02 +00:00
Etan Kissling	a08114e996	libp2p light client gossip validation (#3486 ) When `--serve-light-client-data` is specified, provides stability on the `optimistic_light_client_update` GossipSub topic.	2022-03-14 14:05:38 +01:00
tersec	f550eb2f17	fix two typos (#3491 )	2022-03-14 12:50:23 +00:00
Etan Kissling	29e5a4a752	error and progress codes for light client sync (#3490 ) When syncing as a light client, different behaviour is needed to handle the various ways how errors may occur. The existing logic for blocks can also be applied to light client objects: - `Invalid`: Malformed object that is clearly an error by its producer. - `MissingParent`: More data is needed to decide applicability. - `UnviableFork`: Object may be valid but will never apply on this fork. - `Duplicate`: No errors were encountered but the object was not useful.	2022-03-14 10:25:54 +01:00
Ștefan Talpalaru	276762958e	Windows: disable status bar (#3484 ) It can randomly lock inside Windows terminal emulators. Better play it safe.	2022-03-14 10:19:50 +01:00
Dustin Brody	346407ef1c	running Nimbus on Kiln	2022-03-13 19:39:11 +00:00
Mamy Ratsimbazafy	9fd7305e26	Cleanup RPC pubkey handling (#3489 )	2022-03-13 08:12:45 +01:00
Etan Kissling	89ac586bd4	serve light client data in CI / dev builds (#3487 ) Adjust config for CI / dev builds to serve light client data by default: `--serve-light-client-data=1 --import-light-client-data=only-new`	2022-03-12 22:12:18 +01:00
dependabot[bot]	280492873f	Bump pillow from 9.0.0 to 9.0.1 in /ncli (#3488 ) Bumps [pillow](https://github.com/python-pillow/Pillow) from 9.0.0 to 9.0.1. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/9.0.0...9.0.1) --- updated-dependencies: - dependency-name: pillow dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-03-12 11:02:52 +01:00
Etan Kissling	ae408c279a	add option to collect light client data (#3474 ) Light clients require full nodes to serve additional data so that they can stay in sync with the network. This patch adds a new launch option `--import-light-client-data` to configure what data to make available. For now, data is only kept in memory; it is not persisted at this time. Note that data is only locally collected, a separate patch is needed to actually make it availble over the network. `--serve-light-client-data` will be used for serving data, but is not functional yet outside tests.	2022-03-11 21:28:10 +01:00
tersec	21b71bd29c	update URL and document Nim bug blocking further genericizing cleanups (#3483 )	2022-03-11 15:03:47 +00:00
Jacek Sieka	d0183ccd77	Historical state reindex for trusted node sync (#3452 ) When performing trusted node sync, historical access is limited to states after the checkpoint. Reindexing restores full historical access by replaying historical blocks against the state and storing snapshots in the database. The process can be initiated or resumed at any point in time.	2022-03-11 12:49:47 +00:00
Ștefan Talpalaru	857a71be6c	launch_local_testnet.sh: Lighthouse VC nodes (#3477 ) * launch_local_testnet.sh: Lighthouse VC nodes	2022-03-11 13:44:56 +01:00

... 2 3 4 5 6 ...

4336 Commits All Branches Search

4336 Commits

All Branches