logos-blockchain-testing/book/src/operations-overview.md

# Operations & Deployment Overview

Operational readiness focuses on prerequisites, environment fit, and clear signals that ensure your test scenarios run reliably across different deployment targets.

## Core Principles

- **Prerequisites First**: Ensure all required files, binaries, and assets are in place before attempting to run scenarios
- **Environment Fit**: Choose the right deployment target (host, compose, k8s) based on your isolation, reproducibility, and resource needs
- **Clear Signals**: Verify runners report node readiness before starting workloads to avoid false negatives
- **Failure Triage**: Map failures to specific causes—missing prerequisites, platform issues, or unmet expectations

## Key Operational Concerns

**Prerequisites:**
- `versions.env` file at repository root (required by helper scripts)
- Node binaries (`logos-blockchain-node`) available or built on demand
- Platform requirements met (Docker for compose, cluster access for k8s)
- Circuit assets for proof generation

**Artifacts:**
- Circuit parameters required by the node binary
- Docker images for compose/k8s deployments
- Binary bundles for reproducible builds

**Environment Configuration:**
- Logging configured via `LOGOS_BLOCKCHAIN_LOG_*` variables
- Observability endpoints (Prometheus, Grafana) optional but useful

**Readiness & Health:**
- Runners verify node readiness before starting workloads
- Health checks prevent premature workload execution
- Consensus liveness expectations validate basic operation

## Runner-Agnostic Design

The framework is intentionally **runner-agnostic**: the same scenario plan runs across all deployment targets. Understanding which operational concerns apply to each runner helps you choose the right fit.

| Concern | Host | Compose | Kubernetes |
|---------|------|---------|------------|
| **Topology** | Full support | Full support | Full support |
| **Workloads** | All workloads | All workloads | All workloads |
| **Expectations** | All expectations | All expectations | All expectations |
| **Chaos / Node Control** | Not supported | Supported | Not yet |
| **Metrics / Observability** | Manual setup | External stack | Cluster-wide |
| **Log Collection** | Temp files | Container logs | Pod logs |
| **Isolation** | Process-level | Container | Pod + namespace |
| **Setup Time** | < 1 min | 2-5 min | 5-10 min |
| **CI Recommended?** | Smoke tests | Primary | Large-scale only |

**Key insight:** Operational concerns (prerequisites, environment variables) are largely **consistent** across runners, while deployment-specific concerns (isolation, chaos support) vary by backend.

## Operational Workflow

```mermaid
flowchart LR
    Setup[Prerequisites & Setup] --> Run[Run Scenarios]
    Run --> Monitor[Monitor & Observe]
    Monitor --> Debug{Success?}
    Debug -->|No| Triage[Failure Triage]
    Triage --> Setup
    Debug -->|Yes| Done[Complete]
```

1. **Setup**: Verify prerequisites, configure environment, prepare assets
2. **Run**: Execute scenarios using appropriate runner (host/compose/k8s)
3. **Monitor**: Collect logs, metrics, and observability signals
4. **Triage**: When failures occur, map to root causes and fix prerequisites

## Documentation Structure

This Operations & Deployment section covers:

- [Prerequisites & Setup](prerequisites.md) — Required files, binaries, and environment setup
- [Running Examples](running-examples.md) — How to run scenarios across different runners
- [CI Integration](ci-integration.md) — Automating tests in continuous integration pipelines
- [Environment Variables](environment-variables.md) — Complete reference of configuration variables
- [Logging & Observability](logging-observability.md) — Log collection, metrics, and debugging

**Philosophy:** Treat operational hygiene—assets present, prerequisites satisfied, observability reachable—as the first step to reliable scenario outcomes.
Reorganize scripts into subdirectories Move helper scripts under scripts/{run,build,setup,ops,lib} and update all references across docs, CI, Docker, and Rust call sites. 2025-12-18 17:26:02 +01:00			`# Operations & Deployment Overview`

			`Operational readiness focuses on prerequisites, environment fit, and clear signals that ensure your test scenarios run reliably across different deployment targets.`

			`## Core Principles`

			`- Prerequisites First: Ensure all required files, binaries, and assets are in place before attempting to run scenarios`
			`- Environment Fit: Choose the right deployment target (host, compose, k8s) based on your isolation, reproducibility, and resource needs`
			`- Clear Signals: Verify runners report node readiness before starting workloads to avoid false negatives`
			`- Failure Triage: Map failures to specific causes—missing prerequisites, platform issues, or unmet expectations`

			`## Key Operational Concerns`

			`Prerequisites:`
			- `versions.env` file at repository root (required by helper scripts)
docs: sync book with current framework 2026-01-26 16:36:51 +01:00			- Node binaries (`logos-blockchain-node`) available or built on demand
Reorganize scripts into subdirectories Move helper scripts under scripts/{run,build,setup,ops,lib} and update all references across docs, CI, Docker, and Rust call sites. 2025-12-18 17:26:02 +01:00			`- Platform requirements met (Docker for compose, cluster access for k8s)`
docs: sync book with current framework 2026-01-26 16:36:51 +01:00			`- Circuit assets for proof generation`
Reorganize scripts into subdirectories Move helper scripts under scripts/{run,build,setup,ops,lib} and update all references across docs, CI, Docker, and Rust call sites. 2025-12-18 17:26:02 +01:00
			`Artifacts:`
docs: sync book with current framework 2026-01-26 16:36:51 +01:00			`- Circuit parameters required by the node binary`
Reorganize scripts into subdirectories Move helper scripts under scripts/{run,build,setup,ops,lib} and update all references across docs, CI, Docker, and Rust call sites. 2025-12-18 17:26:02 +01:00			`- Docker images for compose/k8s deployments`
			`- Binary bundles for reproducible builds`

			`Environment Configuration:`
docs: sync book with current framework 2026-01-26 16:36:51 +01:00			- Logging configured via `LOGOS_BLOCKCHAIN_LOG_*` variables
Reorganize scripts into subdirectories Move helper scripts under scripts/{run,build,setup,ops,lib} and update all references across docs, CI, Docker, and Rust call sites. 2025-12-18 17:26:02 +01:00			`- Observability endpoints (Prometheus, Grafana) optional but useful`

			`Readiness & Health:`
			`- Runners verify node readiness before starting workloads`
			`- Health checks prevent premature workload execution`
			`- Consensus liveness expectations validate basic operation`

docs: comprehensive documentation improvements - Rename to 'Logos Blockchain Testing Framework Book' - Rebrand protocol references from Nomos to Logos - Add narrative improvements (Core Concept, learning paths, callouts) - Expand best-practices and what-you-will-learn pages - Add maintenance guide (README.md) with doc-snippets documentation - Add Notion documentation links - Fix code example imports and API signatures - Remove all icons/emojis 2025-12-18 19:47:29 +01:00			`## Runner-Agnostic Design`

			`The framework is intentionally runner-agnostic: the same scenario plan runs across all deployment targets. Understanding which operational concerns apply to each runner helps you choose the right fit.`

			`\| Concern \| Host \| Compose \| Kubernetes \|`
			`\|---------\|------\|---------\|------------\|`
			`\| Topology \| Full support \| Full support \| Full support \|`
			`\| Workloads \| All workloads \| All workloads \| All workloads \|`
			`\| Expectations \| All expectations \| All expectations \| All expectations \|`
			`\| Chaos / Node Control \| Not supported \| Supported \| Not yet \|`
			`\| Metrics / Observability \| Manual setup \| External stack \| Cluster-wide \|`
			`\| Log Collection \| Temp files \| Container logs \| Pod logs \|`
			`\| Isolation \| Process-level \| Container \| Pod + namespace \|`
			`\| Setup Time \| < 1 min \| 2-5 min \| 5-10 min \|`
			`\| CI Recommended? \| Smoke tests \| Primary \| Large-scale only \|`

			`Key insight: Operational concerns (prerequisites, environment variables) are largely consistent across runners, while deployment-specific concerns (isolation, chaos support) vary by backend.`

Reorganize scripts into subdirectories Move helper scripts under scripts/{run,build,setup,ops,lib} and update all references across docs, CI, Docker, and Rust call sites. 2025-12-18 17:26:02 +01:00			`## Operational Workflow`

			```mermaid
			`flowchart LR`
			`Setup[Prerequisites & Setup] --> Run[Run Scenarios]`
			`Run --> Monitor[Monitor & Observe]`
			`Monitor --> Debug{Success?}`
			`Debug -->\|No\| Triage[Failure Triage]`
			`Triage --> Setup`
			`Debug -->\|Yes\| Done[Complete]`
			```

			`1. Setup: Verify prerequisites, configure environment, prepare assets`
			`2. Run: Execute scenarios using appropriate runner (host/compose/k8s)`
			`3. Monitor: Collect logs, metrics, and observability signals`
			`4. Triage: When failures occur, map to root causes and fix prerequisites`

			`## Documentation Structure`

			`This Operations & Deployment section covers:`

			`- [Prerequisites & Setup](prerequisites.md) — Required files, binaries, and environment setup`
			`- [Running Examples](running-examples.md) — How to run scenarios across different runners`
			`- [CI Integration](ci-integration.md) — Automating tests in continuous integration pipelines`
			`- [Environment Variables](environment-variables.md) — Complete reference of configuration variables`
			`- [Logging & Observability](logging-observability.md) — Log collection, metrics, and debugging`

			`Philosophy: Treat operational hygiene—assets present, prerequisites satisfied, observability reachable—as the first step to reliable scenario outcomes.`