logos-blockchain-testing/book/src/scenario-lifecycle.md

# Scenario Lifecycle

A scenario progresses through six distinct phases, each with a specific responsibility:

```mermaid
flowchart TB
    subgraph Phase1["1. Build Phase"]
        Build[Define Scenario]
        BuildDetails["• Declare topology<br/>• Attach workloads<br/>• Add expectations<br/>• Set run duration"]
        Build --> BuildDetails
    end
    
    subgraph Phase2["2. Deploy Phase"]
        Deploy[Provision Environment]
        DeployDetails["• Launch nodes<br/>• Wait for readiness<br/>• Establish connectivity<br/>• Return Runner"]
        Deploy --> DeployDetails
    end
    
    subgraph Phase3["3. Capture Phase"]
        Capture[Baseline Metrics]
        CaptureDetails["• Snapshot initial state<br/>• Start BlockFeed<br/>• Initialize expectations"]
        Capture --> CaptureDetails
    end
    
    subgraph Phase4["4. Execution Phase"]
        Execute[Drive Workloads]
        ExecuteDetails["• Submit transactions<br/>• Disperse DA blobs<br/>• Trigger chaos events<br/>• Run for duration"]
        Execute --> ExecuteDetails
    end
    
    subgraph Phase5["5. Evaluation Phase"]
        Evaluate[Check Expectations]
        EvaluateDetails["• Verify liveness<br/>• Check inclusion<br/>• Validate outcomes<br/>• Aggregate results"]
        Evaluate --> EvaluateDetails
    end
    
    subgraph Phase6["6. Cleanup Phase"]
        Cleanup[Teardown]
        CleanupDetails["• Stop nodes<br/>• Remove containers<br/>• Collect logs<br/>• Release resources"]
        Cleanup --> CleanupDetails
    end
    
    Phase1 --> Phase2
    Phase2 --> Phase3
    Phase3 --> Phase4
    Phase4 --> Phase5
    Phase5 --> Phase6
    
    style Phase1 fill:#e1f5ff
    style Phase2 fill:#fff4e1
    style Phase3 fill:#f0ffe1
    style Phase4 fill:#ffe1f5
    style Phase5 fill:#e1ffe1
    style Phase6 fill:#ffe1e1
```

## Phase Details

### 1. Build the Plan

Declare a topology, attach workloads and expectations, and set the run window. The plan is the single source of truth for what will happen.

**Key actions:**
- Define cluster shape (validators, executors, network topology)
- Configure workloads (transaction rate, DA traffic, chaos patterns)
- Attach expectations (liveness, inclusion, custom checks)
- Set timing parameters (run duration, cooldown period)

**Output:** Immutable `Scenario` plan

### 2. Deploy

Hand the plan to a deployer. It provisions the environment on the chosen backend, waits for nodes to signal readiness, and returns a runner.

**Key actions:**
- Provision infrastructure (processes, containers, or pods)
- Launch validator and executor nodes
- Wait for readiness probes (HTTP endpoints respond)
- Establish node connectivity and metrics endpoints
- Spawn BlockFeed for real-time block observation

**Output:** `Runner` + `RunContext` (with node clients, metrics, control handles)

### 3. Capture Baseline

Expectations snapshot initial state before workloads begin.

**Key actions:**
- Record starting block height
- Initialize counters and trackers
- Subscribe to BlockFeed
- Capture baseline metrics

**Output:** Captured state for later comparison

### 4. Drive Workloads

The runner starts traffic and behaviors for the planned duration.

**Key actions:**
- Submit transactions at configured rates
- Disperse and sample DA blobs
- Trigger chaos events (node restarts, network partitions)
- Run concurrently for the specified duration
- Observe blocks and metrics in real-time

**Duration:** Controlled by `with_run_duration()`

### 5. Evaluate Expectations

Once activity stops (and optional cooldown completes), the runner checks liveness and workload-specific outcomes.

**Key actions:**
- Verify consensus liveness (minimum block production)
- Check transaction inclusion rates
- Validate DA dispersal and sampling
- Assess system recovery after chaos events
- Aggregate pass/fail results

**Output:** Success or detailed failure report

### 6. Cleanup

Tear down resources so successive runs start fresh and do not inherit leaked state.

**Key actions:**
- Stop all node processes/containers/pods
- Remove temporary directories and volumes
- Collect and archive logs (if `NOMOS_TESTS_KEEP_LOGS=1`)
- Release ports and network resources
- Cleanup observability stack (if spawned)

**Guarantee:** Runs even on panic via `CleanupGuard`
Initial import of Nomos testing framework 2025-12-01 12:48:39 +01:00			`# Scenario Lifecycle`

docs(book): update docs 2025-12-18 18:36:38 +01:00			`A scenario progresses through six distinct phases, each with a specific responsibility:`
Initial import of Nomos testing framework 2025-12-01 12:48:39 +01:00
			```mermaid
docs(book): update docs 2025-12-18 18:36:38 +01:00			`flowchart TB`
			`subgraph Phase1["1. Build Phase"]`
			`Build[Define Scenario]`
			`BuildDetails["• Declare topology<br/>• Attach workloads<br/>• Add expectations<br/>• Set run duration"]`
			`Build --> BuildDetails`
			`end`

			`subgraph Phase2["2. Deploy Phase"]`
			`Deploy[Provision Environment]`
			`DeployDetails["• Launch nodes<br/>• Wait for readiness<br/>• Establish connectivity<br/>• Return Runner"]`
			`Deploy --> DeployDetails`
			`end`

			`subgraph Phase3["3. Capture Phase"]`
			`Capture[Baseline Metrics]`
			`CaptureDetails["• Snapshot initial state<br/>• Start BlockFeed<br/>• Initialize expectations"]`
			`Capture --> CaptureDetails`
			`end`

			`subgraph Phase4["4. Execution Phase"]`
			`Execute[Drive Workloads]`
			`ExecuteDetails["• Submit transactions<br/>• Disperse DA blobs<br/>• Trigger chaos events<br/>• Run for duration"]`
			`Execute --> ExecuteDetails`
			`end`

			`subgraph Phase5["5. Evaluation Phase"]`
			`Evaluate[Check Expectations]`
			`EvaluateDetails["• Verify liveness<br/>• Check inclusion<br/>• Validate outcomes<br/>• Aggregate results"]`
			`Evaluate --> EvaluateDetails`
			`end`

			`subgraph Phase6["6. Cleanup Phase"]`
			`Cleanup[Teardown]`
			`CleanupDetails["• Stop nodes<br/>• Remove containers<br/>• Collect logs<br/>• Release resources"]`
			`Cleanup --> CleanupDetails`
			`end`

			`Phase1 --> Phase2`
			`Phase2 --> Phase3`
			`Phase3 --> Phase4`
			`Phase4 --> Phase5`
			`Phase5 --> Phase6`

			`style Phase1 fill:#e1f5ff`
			`style Phase2 fill:#fff4e1`
			`style Phase3 fill:#f0ffe1`
			`style Phase4 fill:#ffe1f5`
			`style Phase5 fill:#e1ffe1`
			`style Phase6 fill:#ffe1e1`
Initial import of Nomos testing framework 2025-12-01 12:48:39 +01:00			```
docs(book): update docs 2025-12-18 18:36:38 +01:00
			`## Phase Details`

			`### 1. Build the Plan`

			`Declare a topology, attach workloads and expectations, and set the run window. The plan is the single source of truth for what will happen.`

			`Key actions:`
			`- Define cluster shape (validators, executors, network topology)`
			`- Configure workloads (transaction rate, DA traffic, chaos patterns)`
			`- Attach expectations (liveness, inclusion, custom checks)`
			`- Set timing parameters (run duration, cooldown period)`

			Output: Immutable `Scenario` plan

			`### 2. Deploy`

			`Hand the plan to a deployer. It provisions the environment on the chosen backend, waits for nodes to signal readiness, and returns a runner.`

			`Key actions:`
			`- Provision infrastructure (processes, containers, or pods)`
			`- Launch validator and executor nodes`
			`- Wait for readiness probes (HTTP endpoints respond)`
			`- Establish node connectivity and metrics endpoints`
			`- Spawn BlockFeed for real-time block observation`

			Output: `Runner` + `RunContext` (with node clients, metrics, control handles)

			`### 3. Capture Baseline`

			`Expectations snapshot initial state before workloads begin.`

			`Key actions:`
			`- Record starting block height`
			`- Initialize counters and trackers`
			`- Subscribe to BlockFeed`
			`- Capture baseline metrics`

			`Output: Captured state for later comparison`

			`### 4. Drive Workloads`

			`The runner starts traffic and behaviors for the planned duration.`

			`Key actions:`
			`- Submit transactions at configured rates`
			`- Disperse and sample DA blobs`
			`- Trigger chaos events (node restarts, network partitions)`
			`- Run concurrently for the specified duration`
			`- Observe blocks and metrics in real-time`

			Duration: Controlled by `with_run_duration()`

			`### 5. Evaluate Expectations`

			`Once activity stops (and optional cooldown completes), the runner checks liveness and workload-specific outcomes.`

			`Key actions:`
			`- Verify consensus liveness (minimum block production)`
			`- Check transaction inclusion rates`
			`- Validate DA dispersal and sampling`
			`- Assess system recovery after chaos events`
			`- Aggregate pass/fail results`

			`Output: Success or detailed failure report`

			`### 6. Cleanup`

			`Tear down resources so successive runs start fresh and do not inherit leaked state.`

			`Key actions:`
			`- Stop all node processes/containers/pods`
			`- Remove temporary directories and volumes`
			- Collect and archive logs (if `NOMOS_TESTS_KEEP_LOGS=1`)
			`- Release ports and network resources`
			`- Cleanup observability stack (if spawned)`

			Guarantee: Runs even on panic via `CleanupGuard`