logos-blockchain-pocs/da/da_calculators/README_da_sampling.md

3.3 KiB
Raw Blame History

DA Sampling Calculator

Live: da_calculator.html

Interactive calculator for the Logos Blockchain data availability sampling protocol. Models sampling as a binary hypothesis test and allows interactive exploration of how protocol parameters affect Type I error, Type II error, grey zone width, wasted slots, and blockchain security horizon.


Background

Logos Blockchain DA uses column sampling to verify data availability without downloading the entire dataset. A node draws S columns uniformly at random and applies a threshold decision rule: declare the blob available if at least τ out of S samples succeed.

This produces two error types that cannot be minimised independently:

  • Type I error α(τ) — data is unrecoverable but sampling concludes it is recoverable. Threatens chain safety.
  • Type II error β(τ, Δ) — data is recoverable but sampling concludes it is unrecoverable. Wastes slots, threatens liveness.

The calculator computes exact hypergeometric expressions for both errors and derives all downstream quantities from them.


Parameters

Parameter Description Default
N Total columns = total subnetworks (N = r·K) 2048
r RS expansion factor 2
K Reconstruction threshold = N/r 1024
S Sample size (columns drawn per round) 20
τ Acceptance threshold (declare available if ≥ τ successes) 20
Δ Grey zone width (derived from ε target) 500
ε Error bound target 10⁻⁴
N_B Blobs per block 1024
n Number of validator nodes 50
f Slot fill rate 1/30
T Slots per epoch 388800

Tabs

Detection Probability

Shows P(data detected as recoverable) as a function of N_A/N (fraction of available columns). Illustrates the grey zone between unrecoverable (N_A ≤ K) and certified recoverable (N_A ≥ K+Δ) regions.

α & β Curves

Plots α(τ) and β(τ, Δ) on a log₁₀ scale as functions of the threshold τ. The intersection of the two curves shows the jointly optimal τ* that minimises max{α, β}.

Multi-Δ Overlay

Compares α and β curves for multiple Δ values simultaneously, showing how the grey zone width affects the tradeoff.

τ* vs ε Sweep

Plots the jointly optimal τ*(ε) and Δ*(ε) as functions of the error bound ε for multiple sample sizes S. Shows how tightening ε requires increasing Δ.

Network Bounds

Shows the network-level error bounds as a function of validator count n:

  • Left panel: P(majority accepts unrecoverable block) = 2ⁿ · ε^{N_B·⌈n/2⌉}
  • Right panel: P(majority rejects recoverable block) = 2ⁿ · [1(1ε)^{N_B}]^{⌈n/2⌉}

Chernoff Bound

Plots the Chernoff-based bounds for the grey zone regime.

Block Builder

Shows the q-quantile of the hitting time τ_{N_B,q} — the number of blobs a block builder must consider to fill a block with N_B valid blobs, as a function of ε and N_B.

Wasted Slots

Shows the average number of wasted slots per epoch as a function of validator count n, across three regimes (Code 2 basic upper bound, Code 3 tight Chernoff upper bound, Code 4 lower bound).

Blockchain

Shows the median time T₁/₂ to the first invalid block accepted by a majority of validators, as a function of validator count n and error bound ε.