Push0 Experiments

Reproducible experiments for the Push0 paper evaluation section. These experiments measure:

Latency Overhead - Dispatcher and collector latency at varying load
Scalability - Throughput vs dispatcher count, queue depth impact
Fault Tolerance - Recovery from crashes, task loss verification

Quick Start

# Setup (one-time)
make setup

# Run all experiments for paper
make run-all-experiments

# Or run individual experiments
make latency
make scalability
make fault-tolerance

Prerequisites

Docker and Docker Compose
Python 3.9+
GitHub credentials (for building orchestrator image)

export GITHUB_USER=your-username
export GITHUB_TOKEN=your-token

Experiment Details

Latency Experiments

Measures orchestration overhead by timing:

Dispatch latency: Task enqueue → dispatcher completion
Collection latency: Result publication → aggregation
End-to-end latency: Full pipeline

# Basic latency test
make latency NUM_TASKS=1000

# Test at varying injection rates (for CDF plot)
make latency-vary-rate NUM_TASKS=500

Output:

results/latency_*.json - Raw latency data with P50/P95/P99 stats
CDF data for plotting latency distributions

Scalability Experiments

Measures throughput scaling with dispatcher instances.

# Test 1, 2, 4, 8 dispatchers
make scalability DISPATCHER_COUNTS=1,2,4,8 NUM_TASKS=10000

# Test queue depth impact on NATS
make scalability-queue-depth

Output:

results/scalability_*.json - Throughput and memory per dispatcher count
Scaling efficiency calculations

Fault Tolerance Experiments

Validates zero task loss under failure conditions.

# Dispatcher crash at 1%, 5%, 10% completion
make fault-dispatcher CRASH_RATES=1,5,10 NUM_TASKS=1000

# Collector crash mid-aggregation
make fault-collector NUM_TASKS=1000

# NATS network partition (30s)
make fault-partition

# Compare ACK timeout settings (10s, 30s, 60s)
make fault-ack-timeout

Output:

results/fault_tolerance_*.json - Recovery times, task loss counts
Baseline comparison for overhead calculation

Results Format

All results are saved as JSON in results/:

{
  "experiment_type": "latency",
  "config": {...},
  "dispatch_stats": {
    "count": 1000,
    "p50_ms": 4.5,
    "p95_ms": 8.2,
    "p99_ms": 12.1,
    "mean_ms": 5.1,
    "stddev_ms": 2.3
  },
  "throughput_tasks_per_sec": 150.0,
  "timestamp": "2024-01-15T10:30:00"
}

Architecture

experiments/
├── docker-compose.experiments.yml  # Experiment infrastructure
├── Makefile                        # Easy experiment execution
├── configs/
│   └── prometheus.yml             # Metrics collection
├── scripts/
│   ├── utils.py                   # Shared utilities
│   ├── latency_experiment.py      # Latency measurement
│   ├── scalability_experiment.py  # Scalability testing
│   └── fault_tolerance_experiment.py  # Fault injection
└── results/                       # Experiment outputs (JSON)

Key Design Decisions

Echo Executor Mode: Uses --features scroll-executor-echo to simulate prover execution without real proving. This isolates orchestration overhead.
Memory-backed NATS: Uses in-memory storage for faster experiments. Production uses file-backed storage.
Reproducibility: All experiments can be run with make run-all-experiments for consistent paper results.
Prometheus Integration: Metrics are scraped at 500ms intervals for fine-grained latency data.

Monitoring

# View live logs
make logs

# Container status
make status

# NATS metrics
make nats-info

# Prometheus UI (http://localhost:9091)
make prometheus

Cleanup

# Stop containers
make down

# Full cleanup (containers, volumes)
make clean

# Clear results only
make clean-results

Troubleshooting

NATS not ready:

make setup-infra

Build failures:

# Ensure GitHub credentials are set
export GITHUB_USER=xxx
export GITHUB_TOKEN=xxx
make build

Python dependencies:

make setup-venv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Push0 Experiments

Quick Start

Prerequisites

Experiment Details

Latency Experiments

Scalability Experiments

Fault Tolerance Experiments

Results Format

Architecture

Key Design Decisions

Monitoring

Cleanup

Troubleshooting

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Push0 Experiments

Quick Start

Prerequisites

Experiment Details

Latency Experiments

Scalability Experiments

Fault Tolerance Experiments

Results Format

Architecture

Key Design Decisions

Monitoring

Cleanup

Troubleshooting