2018-11-14 16:01:57 -05:00
# Ethereum 2.0 Phase 1 -- Shard Data Chains
2019-05-06 10:30:32 -05:00
**Notice**: This document is a work-in-progress for researchers and implementers.
2018-11-14 16:01:57 -05:00
2019-05-06 10:30:32 -05:00
## Table of contents
2019-02-19 05:26:35 -06:00
<!-- TOC -->
2019-05-06 10:30:32 -05:00
- [Ethereum 2.0 Phase 1 -- Shard Data Chains ](#ethereum-20-phase-1----shard-data-chains )
- [Table of contents ](#table-of-contents )
2019-03-28 17:56:43 -05:00
- [Introduction ](#introduction )
- [Constants ](#constants )
- [Misc ](#misc )
- [Time parameters ](#time-parameters )
- [Signature domains ](#signature-domains )
2019-03-17 06:44:19 -05:00
- [Data structures ](#data-structures )
2019-03-28 17:56:43 -05:00
- [`ShardBlockBody` ](#shardblockbody )
2019-05-07 13:23:28 +01:00
- [`ShardAttestation` ](#shardattestation )
2019-03-28 17:56:43 -05:00
- [`ShardBlock` ](#shardblock )
- [`ShardBlockHeader` ](#shardblockheader )
- [Helper functions ](#helper-functions )
- [`get_period_committee` ](#get_period_committee )
2019-03-31 17:49:02 -05:00
- [`get_switchover_epoch` ](#get_switchover_epoch )
2019-03-28 17:56:43 -05:00
- [`get_persistent_committee` ](#get_persistent_committee )
- [`get_shard_proposer_index` ](#get_shard_proposer_index )
- [`get_shard_header` ](#get_shard_header )
- [`verify_shard_attestation_signature` ](#verify_shard_attestation_signature )
- [`compute_crosslink_data_root` ](#compute_crosslink_data_root )
- [Object validity ](#object-validity )
- [Shard blocks ](#shard-blocks )
- [Shard attestations ](#shard-attestations )
- [Beacon attestations ](#beacon-attestations )
- [Shard fork choice rule ](#shard-fork-choice-rule )
2019-02-19 05:26:35 -06:00
<!-- /TOC -->
2019-03-28 17:56:43 -05:00
## Introduction
2018-11-14 16:01:57 -05:00
2019-03-28 17:56:43 -05:00
This document describes the shard data layer and the shard fork choice rule in Phase 1 of Ethereum 2.0.
2018-11-14 16:01:57 -05:00
2019-03-28 17:56:43 -05:00
## Constants
2018-11-14 16:01:57 -05:00
2019-03-28 17:56:43 -05:00
### Misc
2018-11-14 16:01:57 -05:00
2019-03-28 17:56:43 -05:00
| Name | Value |
| - | - |
| `BYTES_PER_SHARD_BLOCK_BODY` | `2**14` (= 16,384) |
| `MAX_SHARD_ATTESTIONS` | `2**4` (= 16) |
| `PHASE_1_GENESIS_EPOCH` | **TBD** |
| `PHASE_1_GENESIS_SLOT` | get_epoch_start_slot(PHASE_1_GENESIS_EPOCH) |
2018-11-14 16:01:57 -05:00
2019-03-28 17:56:43 -05:00
### Time parameters
2018-11-14 16:01:57 -05:00
2019-03-28 17:56:43 -05:00
| Name | Value | Unit | Duration |
| - | - | :-: | :-: |
2019-05-07 12:13:22 +01:00
| `CROSSLINK_LOOKBACK` | `2**0` (= 1) | epochs | 6.2 minutes |
| `PERSISTENT_COMMITTEE_PERIOD` | `2**11` (= 2,048) | epochs | ~9 days |
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
### Signature domains
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
| Name | Value |
| - | - |
| `DOMAIN_SHARD_PROPOSER` | `128` |
| `DOMAIN_SHARD_ATTESTER` | `129` |
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
## Data structures
### `ShardBlockBody`
```python
2019-05-07 13:23:28 +01:00
{
'data': ['byte', BYTES_PER_SHARD_BLOCK_BODY],
}
```
### `ShardAttestation`
```python
{
'data': {
'slot': Slot,
'shard': Shard,
'shard_block_root': 'bytes32',
},
'aggregation_bitfield': 'bytes',
'aggregate_signature': BLSSignature,
}
2019-03-28 17:56:43 -05:00
```
### `ShardBlock`
```python
{
'slot': Slot,
'shard': Shard,
2019-05-07 13:23:28 +01:00
'beacon_chain_root': 'bytes32',
2019-05-09 01:00:25 +01:00
'parent_root': 'bytes32',
2019-03-28 17:56:43 -05:00
'data': ShardBlockBody,
2019-05-07 13:23:28 +01:00
'state_root': 'bytes32',
2019-03-28 17:56:43 -05:00
'attestations': [ShardAttestation],
'signature': BLSSignature,
}
```
### `ShardBlockHeader`
```python
{
'slot': Slot,
'shard': Shard,
2019-05-07 13:23:28 +01:00
'beacon_chain_root': 'bytes32',
2019-05-09 01:00:25 +01:00
'parent_root': 'bytes32',
2019-05-07 13:23:28 +01:00
'body_root': 'bytes32',
'state_root': 'bytes32',
2019-03-28 17:56:43 -05:00
'attestations': [ShardAttestation],
'signature': BLSSignature,
}
```
2019-02-08 03:54:02 -06:00
## Helper functions
2019-03-28 17:56:43 -05:00
### `get_period_committee`
2019-02-12 19:11:45 +08:00
```python
2019-05-01 09:09:24 +01:00
def get_period_committee(state: BeaconState, epoch: Epoch, shard: Shard, index: int, count: int) -> List[ValidatorIndex]:
2019-02-12 19:11:45 +08:00
"""
2019-03-28 17:56:43 -05:00
Return committee for a period. Used to construct persistent committees.
2019-02-12 19:11:45 +08:00
"""
2019-05-01 15:21:38 +01:00
return compute_committee(
indices=get_active_validator_indices(state, epoch),
seed=generate_seed(state, epoch),
index=shard * count + index,
count=SHARD_COUNT * count,
)
2019-02-12 19:11:45 +08:00
```
2019-03-31 17:49:02 -05:00
### `get_switchover_epoch`
```python
def get_switchover_epoch(state: BeaconState, epoch: Epoch, index: ValidatorIndex):
earlier_start_epoch = epoch - (epoch % PERSISTENT_COMMITTEE_PERIOD) - PERSISTENT_COMMITTEE_PERIOD * 2
return bytes_to_int(hash(generate_seed(state, earlier_start_epoch) + bytes3(index))[0:8]) % PERSISTENT_COMMITTEE_PERIOD
```
2019-03-28 17:56:43 -05:00
### `get_persistent_committee`
2019-02-08 03:54:02 -06:00
```python
2019-02-12 19:11:45 +08:00
def get_persistent_committee(state: BeaconState,
2019-02-14 16:02:01 -07:00
shard: Shard,
2019-03-17 06:44:19 -05:00
slot: Slot) -> List[ValidatorIndex]:
2019-02-08 22:10:54 -06:00
"""
2019-03-17 06:44:19 -05:00
Return the persistent committee for the given ``shard`` at the given ``slot` `.
2019-02-08 22:10:54 -06:00
"""
2019-04-02 22:17:55 +04:00
epoch = slot_to_epoch(slot)
2019-03-17 06:44:19 -05:00
earlier_start_epoch = epoch - (epoch % PERSISTENT_COMMITTEE_PERIOD) - PERSISTENT_COMMITTEE_PERIOD * 2
later_start_epoch = epoch - (epoch % PERSISTENT_COMMITTEE_PERIOD) - PERSISTENT_COMMITTEE_PERIOD
committee_count = max(
len(get_active_validator_indices(state.validator_registry, earlier_start_epoch)) //
(SHARD_COUNT * TARGET_COMMITTEE_SIZE),
len(get_active_validator_indices(state.validator_registry, later_start_epoch)) //
(SHARD_COUNT * TARGET_COMMITTEE_SIZE),
) + 1
2019-05-01 09:09:24 +01:00
2019-03-17 06:44:19 -05:00
index = slot % committee_count
2019-03-28 17:56:43 -05:00
earlier_committee = get_period_committee(state, shard, earlier_start_epoch, index, committee_count)
later_committee = get_period_committee(state, shard, later_start_epoch, index, committee_count)
2019-02-12 19:11:45 +08:00
2019-02-10 00:09:34 -06:00
# Take not-yet-cycled-out validators from earlier committee and already-cycled-in validators from
# later committee; return a sorted list of the union of the two, deduplicated
return sorted(list(set(
2019-03-31 17:49:02 -05:00
[i for i in earlier_committee if epoch % PERSISTENT_COMMITTEE_PERIOD < get_switchover_epoch ( state , epoch , i ) ] +
[i for i in later_committee if epoch % PERSISTENT_COMMITTEE_PERIOD >= get_switchover_epoch(state, epoch, i)]
2019-02-10 00:09:34 -06:00
)))
2019-02-08 03:54:02 -06:00
```
2019-03-17 06:44:19 -05:00
2019-03-28 17:56:43 -05:00
### `get_shard_proposer_index`
2019-02-08 03:54:02 -06:00
2019-02-10 15:44:58 -06:00
```python
def get_shard_proposer_index(state: BeaconState,
2019-02-14 16:02:01 -07:00
shard: Shard,
slot: Slot) -> ValidatorIndex:
2019-03-28 17:56:43 -05:00
# Randomly shift persistent committee
2019-03-17 06:44:19 -05:00
persistent_committee = get_persistent_committee(state, shard, slot)
2019-05-07 10:57:41 +01:00
seed = hash(state.current_shuffling_seed + int_to_bytes(shard, length=8) + int_to_bytes(slot, length=8))
2019-03-28 17:56:43 -05:00
random_index = bytes_to_int(seed[0:8]) % len(persistent_committee)
persistent_committee = persistent_committee[random_index:] + persistent_committee[:random_index]
# Search for an active proposer
for index in persistent_committee:
2019-03-17 06:44:19 -05:00
if is_active_validator(state.validator_registry[index], get_current_epoch(state)):
2019-02-10 15:44:58 -06:00
return index
2018-11-25 08:06:37 -05:00
2019-03-28 17:56:43 -05:00
# No block can be proposed if no validator is active
return None
2018-11-14 16:01:57 -05:00
```
2019-03-28 17:56:43 -05:00
### `get_shard_header`
2018-11-14 16:01:57 -05:00
```python
2019-03-28 17:56:43 -05:00
def get_shard_header(block: ShardBlock) -> ShardBlockHeader:
return ShardBlockHeader(
2019-05-07 12:13:22 +01:00
slot=block.slot,
shard=block.shard,
beacon_chain_root=block.beacon_chain_root,
2019-05-09 01:00:25 +01:00
parent_root=block.parent_root,
2019-05-07 12:13:22 +01:00
body_root=hash_tree_root(block.body),
state_root=block.state_root,
attestations=block.attestations,
signature=block.signature,
2019-02-10 10:17:21 -06:00
)
2018-11-14 16:01:57 -05:00
```
2019-03-28 17:56:43 -05:00
### `verify_shard_attestation_signature`
```python
def verify_shard_attestation_signature(state: BeaconState,
attestation: ShardAttestation) -> None:
data = attestation.data
2019-05-05 12:10:39 +01:00
persistent_committee = get_persistent_committee(state, data.crosslink.shard, data.slot)
2019-03-28 17:56:43 -05:00
assert verify_bitfield(attestation.aggregation_bitfield, len(persistent_committee))
pubkeys = []
for i, index in enumerate(persistent_committee):
2019-05-07 12:13:22 +01:00
if get_bitfield_bit(attestation.aggregation_bitfield, i) == 0b1:
2019-03-28 17:56:43 -05:00
validator = state.validator_registry[index]
assert is_active_validator(validator, get_current_epoch(state))
pubkeys.append(validator.pubkey)
assert bls_verify(
pubkey=bls_aggregate_pubkeys(pubkeys),
2019-05-05 12:10:39 +01:00
message_hash=data.crosslink.shard_block_root,
2019-03-28 17:56:43 -05:00
signature=attestation.aggregate_signature,
domain=get_domain(state, slot_to_epoch(data.slot), DOMAIN_SHARD_ATTESTER)
)
2019-02-19 05:26:35 -06:00
```
2019-03-28 17:56:43 -05:00
### `compute_crosslink_data_root`
2019-02-19 05:26:35 -06:00
```python
2019-05-21 12:41:24 +02:00
def compute_crosslink_data_root(blocks: List[ShardBlock]) -> Bytes32:
2019-03-28 17:56:43 -05:00
def is_power_of_two(value: int) -> bool:
return (value > 0) and (value & (value - 1) == 0)
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
def pad_to_power_of_2(values: List[bytes]) -> List[bytes]:
while not is_power_of_two(len(values)):
values += [b'\x00' * BYTES_PER_SHARD_BLOCK_BODY]
return values
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
def merkle_root_of_bytes(data: bytes) -> bytes:
return merkle_root([data[i:i + 32] for i in range(0, len(data), 32)])
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
return hash(
merkle_root(pad_to_power_of_2([
merkle_root_of_bytes(zpad(serialize(get_shard_header(block)), BYTES_PER_SHARD_BLOCK_BODY)) for block in blocks
])) +
merkle_root(pad_to_power_of_2([
merkle_root_of_bytes(block.body) for block in blocks
]))
2019-02-19 05:26:35 -06:00
)
```
2019-03-28 17:56:43 -05:00
## Object validity
2019-03-02 20:36:04 -06:00
2019-03-28 17:56:43 -05:00
### Shard blocks
2019-03-02 20:36:04 -06:00
2019-03-28 17:56:43 -05:00
Let:
2019-03-02 20:36:04 -06:00
2019-03-28 17:56:43 -05:00
* `beacon_blocks` be the `BeaconBlock` list such that `beacon_blocks[slot]` is the canonical `BeaconBlock` at slot `slot`
* `beacon_state` be the canonical `BeaconState` after processing `beacon_blocks[-1]`
* `valid_shard_blocks` be the list of valid `ShardBlock` , recursively defined
* `unix_time` be the current unix time
* `candidate` be a candidate `ShardBlock` for which validity is to be determined by running `is_valid_shard_block`
2019-02-19 05:26:35 -06:00
```python
2019-03-28 17:56:43 -05:00
def is_valid_shard_block(beacon_blocks: List[BeaconBlock],
beacon_state: BeaconState,
valid_shard_blocks: List[ShardBlock],
2019-05-07 13:23:28 +01:00
unix_time: int,
2019-05-07 12:13:22 +01:00
candidate: ShardBlock) -> bool:
2019-03-28 17:56:43 -05:00
# Check if block is already determined valid
for _, block in enumerate(valid_shard_blocks):
if candidate == block:
return True
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
# Check slot number
assert block.slot >= PHASE_1_GENESIS_SLOT
assert unix_time >= beacon_state.genesis_time + (block.slot - GENESIS_SLOT) * SECONDS_PER_SLOT
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
# Check shard number
assert block.shard < = SHARD_COUNT
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
# Check beacon block
beacon_block = beacon_blocks[block.slot]
2019-04-08 09:51:13 +08:00
assert block.beacon_block_root == signing_root(beacon_block)
2019-05-07 12:13:22 +01:00
assert beacon_block.slot < = block.slot
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
# Check state root
assert block.state_root == ZERO_HASH # [to be removed in phase 2]
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
# Check parent block
if block.slot == PHASE_1_GENESIS_SLOT:
2019-05-06 20:49:46 +01:00
assert candidate.parent_root == ZERO_HASH
2019-03-28 17:56:43 -05:00
else:
parent_block = next(
2019-05-07 12:13:22 +01:00
(block for block in valid_shard_blocks if
2019-05-09 01:00:25 +01:00
signing_root(block) == candidate.parent_root)
2019-03-28 17:56:43 -05:00
, None)
assert parent_block != None
assert parent_block.shard == block.shard
assert parent_block.slot < block.slot
2019-04-08 09:51:13 +08:00
assert signing_root(beacon_blocks[parent_block.slot]) == parent_block.beacon_chain_root
2019-03-28 17:56:43 -05:00
# Check attestations
assert len(block.attestations) < = MAX_SHARD_ATTESTIONS
for _, attestation in enumerate(block.attestations):
assert max(GENESIS_SHARD_SLOT, block.slot - SLOTS_PER_EPOCH) < = attestation.data.slot
2019-04-14 09:54:35 +10:00
assert attestation.data.slot < = block.slot - MIN_ATTESTATION_INCLUSION_DELAY
2019-05-05 12:10:39 +01:00
assert attestation.data.crosslink.shard == block.shard
2019-03-28 17:56:43 -05:00
verify_shard_attestation_signature(beacon_state, attestation)
# Check signature
proposer_index = get_shard_proposer_index(beacon_state, block.shard, block.slot)
assert proposer_index is not None
assert bls_verify(
pubkey=validators[proposer_index].pubkey,
2019-04-08 09:51:13 +08:00
message_hash=signing_root(block),
2019-03-28 17:56:43 -05:00
signature=block.signature,
domain=get_domain(beacon_state, slot_to_epoch(block.slot), DOMAIN_SHARD_PROPOSER)
2019-03-02 20:36:04 -06:00
)
2019-03-28 17:56:43 -05:00
return True
2019-03-02 20:36:04 -06:00
```
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
### Shard attestations
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
Let:
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
* `valid_shard_blocks` be the list of valid `ShardBlock`
* `beacon_state` be the canonical `BeaconState`
* `candidate` be a candidate `ShardAttestation` for which validity is to be determined by running `is_valid_shard_attestation`
2019-03-02 20:36:04 -06:00
```python
2019-03-28 17:56:43 -05:00
def is_valid_shard_attestation(valid_shard_blocks: List[ShardBlock],
beacon_state: BeaconState,
candidate: Attestation) -> bool:
# Check shard block
shard_block = next(
2019-05-07 12:13:22 +01:00
(block for block in valid_shard_blocks if
2019-05-09 01:00:25 +01:00
signing_root(block) == candidate.attestation.data.crosslink.shard_block_root)
2019-03-28 17:56:43 -05:00
, None)
assert shard_block != None
assert shard_block.slot == attestation.data.slot
2019-05-05 12:10:39 +01:00
assert shard_block.shard == attestation.data.crosslink.shard
2019-03-02 20:36:04 -06:00
2019-03-28 17:56:43 -05:00
# Check signature
verify_shard_attestation_signature(beacon_state, attestation)
2019-03-02 20:36:04 -06:00
2019-03-28 17:56:43 -05:00
return True
2019-03-02 20:36:04 -06:00
```
2019-03-28 17:56:43 -05:00
### Beacon attestations
2019-03-02 20:36:04 -06:00
2019-03-28 17:56:43 -05:00
Let:
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
* `shard` be a valid `Shard`
* `shard_blocks` be the `ShardBlock` list such that `shard_blocks[slot]` is the canonical `ShardBlock` for shard `shard` at slot `slot`
* `beacon_state` be the canonical `BeaconState`
* `valid_attestations` be the list of valid `Attestation` , recursively defined
2019-05-06 10:30:32 -05:00
* `candidate` be a candidate `Attestation` which is valid under Phase 0 rules, and for which validity is to be determined under Phase 1 rules by running `is_valid_beacon_attestation`
2019-02-19 05:26:35 -06:00
```python
2019-03-28 17:56:43 -05:00
def is_valid_beacon_attestation(shard: Shard,
shard_blocks: List[ShardBlock],
beacon_state: BeaconState,
valid_attestations: List[Attestation],
candidate: Attestation) -> bool:
# Check if attestation is already determined valid
for _, attestation in enumerate(valid_attestations):
if candidate == attestation:
return True
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
# Check previous attestation
if candidate.data.previous_crosslink.epoch < = PHASE_1_GENESIS_EPOCH:
2019-05-06 18:26:14 +01:00
assert candidate.data.previous_crosslink.data_root == ZERO_HASH
2019-02-19 05:26:35 -06:00
else:
2019-03-28 17:56:43 -05:00
previous_attestation = next(
2019-05-07 12:13:22 +01:00
(attestation for attestation in valid_attestations if
2019-05-09 01:00:25 +01:00
attestation.data.crosslink.data_root == candidate.data.previous_crosslink.data_root)
2019-03-28 17:56:43 -05:00
, None)
assert previous_attestation != None
assert candidate.data.previous_attestation.epoch < slot_to_epoch ( candidate . data . slot )
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
# Check crosslink data root
start_epoch = state.latest_crosslinks[shard].epoch
2019-05-05 12:10:39 +01:00
end_epoch = min(slot_to_epoch(candidate.data.slot) - CROSSLINK_LOOKBACK, start_epoch + MAX_EPOCHS_PER_CROSSLINK)
2019-03-28 17:56:43 -05:00
blocks = []
for slot in range(start_epoch * SLOTS_PER_EPOCH, end_epoch * SLOTS_PER_EPOCH):
blocks.append(shard_blocks[slot])
2019-05-06 18:26:14 +01:00
assert candidate.data.crosslink.data_root == compute_crosslink_data_root(blocks)
2019-02-19 05:26:35 -06:00
2019-03-28 17:56:43 -05:00
return True
2019-02-19 05:26:35 -06:00
```
2019-03-02 20:36:04 -06:00
2019-03-28 17:56:43 -05:00
## Shard fork choice rule
2019-03-02 20:36:04 -06:00
2019-04-16 12:03:22 -05:00
The fork choice rule for any shard is LMD GHOST using the shard attestations of the persistent committee and the beacon chain attestations of the crosslink committee currently assigned to that shard, but instead of being rooted in the genesis it is rooted in the block referenced in the most recent accepted crosslink (i.e. `state.crosslinks[shard].shard_block_root` ). Only blocks whose `beacon_chain_root` is the block in the main beacon chain at the specified `slot` should be considered. (If the beacon chain skips a slot, then the block at that slot is considered to be the block in the beacon chain at the highest slot lower than that slot.)