WormValidator Component Design

Purpose & Responsibilities

WormValidator is a standalone binary that provides integration testing capabilities for WormFS by embedding a single-node storage cluster and acting as a simulated FUSE client. Its responsibilities include:

Bootstrapping an embedded single-node storage cluster for testing
Acting as a FUSE client simulator (without kernel FUSE module)
Exercising all FilesystemService APIs through gRPC
Validating end-to-end system behavior from client perspective
Providing reproducible integration testing environment
Generating detailed test reports for debugging
Supporting progressive test scenario development
Enabling manual testing without multi-node VM setup

Architecture & Design

High-Level Architecture

┌─────────────────────────────────────────────────────────┐
│              WormValidator Binary                        │
├─────────────────────────────────────────────────────────┤
│                                                           │
│  ┌─────────────────────────────────────────────────┐   │
│  │     Embedded Storage Cluster                     │   │
│  │  ┌─────────────────────────────────────────┐   │   │
│  │  │         StorageNode                      │   │   │
│  │  │  - StorageRaftMember (single-node)      │   │   │
│  │  │  - FileStore (temp storage)             │   │   │
│  │  │  - MetadataStore (temp DB)              │   │   │
│  │  │  - StorageEndpoint (localhost gRPC)     │   │   │
│  │  │  - All other components                 │   │   │
│  │  └─────────────────────────────────────────┘   │   │
│  └─────────────────────────────────────────────────┘   │
│                         ↑                                │
│                         │ gRPC                           │
│                         ↓                                │
│  ┌─────────────────────────────────────────────────┐   │
│  │     FUSE Client Simulator                       │   │
│  │  - gRPC client to FilesystemService             │   │
│  │  - Simulates FUSE operations                    │   │
│  │  - Validates responses                          │   │
│  └─────────────────────────────────────────────────┘   │
│                         ↓                                │
│  ┌─────────────────────────────────────────────────┐   │
│  │     Test Scenario Runner                        │   │
│  │  - Orchestrates test scenarios                  │   │
│  │  - Collects metrics and results                 │   │
│  │  - Generates reports                            │   │
│  └─────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────┘

Component Breakdown

1. ClusterManager

Responsible for bootstrapping and managing the embedded storage cluster:

Initializes temporary directories for storage
Configures single-node Raft cluster
Starts StorageNode with all components
Manages cluster lifecycle (startup/shutdown)
Handles cleanup of temporary resources

2. FuseClientSimulator

Acts as a gRPC client that mimics FUSE filesystem operations:

Connects to localhost StorageEndpoint
Implements FUSE-like operation wrappers
Translates filesystem operations to gRPC calls
Validates response correctness
Maintains client-side state (open files, locks, etc.)

3. TestScenarioRunner

Orchestrates execution of test scenarios:

Loads and executes test scenarios
Manages test dependencies and ordering
Collects timing and performance metrics
Handles test failures and retries
Generates structured test results

4. ValidationEngine

Verifies expected outcomes:

Compares actual vs expected results
Validates data integrity (checksums, content)
Checks metadata consistency
Verifies system state after operations
Reports validation errors with context

5. ReportGenerator

Creates detailed test reports:

Summarizes test execution results
Generates performance metrics
Creates detailed failure diagnostics
Supports multiple output formats (JSON, HTML, text)
Includes timing breakdowns and resource usage

Test Scenarios

Category 1: Basic File Operations

Scenario: Create-Read-Write-Delete

Create a new file
Write data to file
Read data back and verify
Delete file
Verify file no longer exists

Scenario: Large File Handling

Create file and write 1GB of data
Verify stripe creation and distribution
Read entire file back
Verify data integrity with checksums

Scenario: Concurrent File Access

Create file
Simulate multiple concurrent reads
Verify all reads succeed
Verify data consistency across reads

Category 2: Directory Operations

Scenario: Directory Tree Creation

Create nested directory structure
List directories at each level
Verify hierarchy integrity

Scenario: Directory Listing

Create directory with multiple files
List directory contents
Verify all entries present
Verify metadata accuracy

Category 3: Metadata Operations

Scenario: Metadata Updates

Create file with initial metadata
Update permissions (chmod)
Update ownership (chown)
Verify metadata changes persist

Scenario: File Stats

Create file and write data
Get file stats (getattr)
Verify size, timestamps, permissions
Modify file and verify stat updates

Category 4: Lock Operations

Scenario: Exclusive Lock

Create file
Acquire write lock
Attempt second lock (should fail)
Release lock
Verify second lock now succeeds

Scenario: Shared Locks

Create file
Acquire multiple read locks
Verify all succeed
Attempt write lock (should fail)
Release read locks
Verify write lock succeeds

Scenario: Lock Expiration

Acquire lock with short timeout
Wait for expiration
Verify lock automatically released
Acquire new lock successfully

Category 5: Stripe Operations

Scenario: Direct Stripe Read/Write

Write data to specific stripe
Read stripe back
Verify data integrity
Test stripe-level erasure coding

Scenario: Stripe Distribution

Write large file
Verify stripe creation
Check chunk distribution across disks
Verify redundancy requirements met

Category 6: Error Handling

Scenario: Invalid Operations

Attempt to read non-existent file
Attempt to delete open file
Attempt to write to read-locked file
Verify appropriate error responses

Scenario: Resource Exhaustion

Fill disk to capacity
Attempt write operations
Verify graceful failure
Cleanup and verify recovery

Category 7: Consistency Validation

Scenario: Metadata Consistency

Perform various operations
Query metadata store directly
Verify consistency with filesystem view

Scenario: Snapshot and Recovery

Perform file operations
Trigger snapshot
Verify snapshot contents
Simulate recovery from snapshot

Interfaces

Public API (CLI)

# Run all test scenarios
wormfs-validator

# Run specific scenarios
wormfs-validator --scenarios basic,locks

# Use custom temp directory
wormfs-validator --temp-dir /tmp/wormfs-test

# Keep data after tests for inspection
wormfs-validator --keep-data

# Generate detailed report
wormfs-validator --report /tmp/report.html

# Verbose logging
wormfs-validator --verbose

# Run performance benchmarks
wormfs-validator --benchmark

Rust Implementation

pub struct WormValidator {
    config: ValidatorConfig,
    cluster_manager: ClusterManager,
    client_simulator: FuseClientSimulator,
    scenario_runner: TestScenarioRunner,
}

impl WormValidator {
    pub fn new(config: ValidatorConfig) -> Result<Self, ValidatorError>;
    
    pub async fn run_all_tests(&mut self) -> TestResults;
    
    pub async fn run_scenarios(&mut self, scenarios: &[String]) -> TestResults;
    
    pub async fn cleanup(&mut self) -> Result<(), ValidatorError>;
}

pub struct ClusterManager {
    temp_dir: PathBuf,
    storage_node: Option<Arc<StorageNode>>,
    endpoint_address: SocketAddr,
}

impl ClusterManager {
    pub async fn start(&mut self) -> Result<(), ValidatorError>;
    
    pub async fn stop(&mut self) -> Result<(), ValidatorError>;
    
    pub fn endpoint_address(&self) -> SocketAddr;
}

pub struct FuseClientSimulator {
    grpc_client: FilesystemServiceClient<Channel>,
    open_files: HashMap<FileHandle, FileId>,
    locks: HashMap<LockId, FileId>,
}

impl FuseClientSimulator {
    pub async fn connect(endpoint: SocketAddr) -> Result<Self, ValidatorError>;
    
    // FUSE-like operations
    pub async fn create_file(&mut self, path: &str, mode: u32) -> Result<FileHandle, ValidatorError>;
    
    pub async fn read_file(&mut self, fh: FileHandle, offset: u64, size: u32) -> Result<Vec<u8>, ValidatorError>;
    
    pub async fn write_file(&mut self, fh: FileHandle, offset: u64, data: &[u8]) -> Result<u64, ValidatorError>;
    
    pub async fn delete_file(&mut self, fh: FileHandle) -> Result<(), ValidatorError>;
    
    pub async fn get_attr(&mut self, fh: FileHandle) -> Result<FileAttr, ValidatorError>;
    
    pub async fn set_attr(&mut self, fh: FileHandle, attr: FileAttr) -> Result<(), ValidatorError>;
    
    pub async fn mkdir(&mut self, path: &str, mode: u32) -> Result<(), ValidatorError>;
    
    pub async fn readdir(&mut self, dir: FileHandle) -> Result<Vec<DirEntry>, ValidatorError>;
    
    pub async fn acquire_lock(&mut self, fh: FileHandle, lock_type: LockType) -> Result<LockId, ValidatorError>;
    
    pub async fn release_lock(&mut self, lock_id: LockId) -> Result<(), ValidatorError>;
}

pub struct TestScenarioRunner {
    scenarios: Vec<Box<dyn TestScenario>>,
    results: Vec<ScenarioResult>,
}

impl TestScenarioRunner {
    pub fn load_scenarios(&mut self, filter: Option<&[String]>);
    
    pub async fn run_scenarios(&mut self, client: &mut FuseClientSimulator) -> TestResults;
}

pub trait TestScenario: Send + Sync {
    fn name(&self) -> &str;
    
    fn category(&self) -> &str;
    
    async fn execute(&self, client: &mut FuseClientSimulator) -> ScenarioResult;
}

Data Structures

pub struct ValidatorConfig {
    pub temp_dir: PathBuf,
    pub verbose: bool,
    pub keep_data: bool,
    pub scenarios: Option<Vec<String>>,
    pub report_path: Option<PathBuf>,
    pub benchmark_mode: bool,
}

pub struct TestResults {
    pub total_scenarios: usize,
    pub passed: usize,
    pub failed: usize,
    pub skipped: usize,
    pub duration: Duration,
    pub scenario_results: Vec<ScenarioResult>,
}

pub struct ScenarioResult {
    pub name: String,
    pub category: String,
    pub status: TestStatus,
    pub duration: Duration,
    pub error: Option<String>,
    pub metrics: HashMap<String, f64>,
}

pub enum TestStatus {
    Passed,
    Failed,
    Skipped,
}

#[derive(Debug, thiserror::Error)]
pub enum ValidatorError {
    #[error("Cluster startup failed: {0}")]
    ClusterStartupFailed(String),
    
    #[error("Client connection failed: {0}")]
    ClientConnectionFailed(String),
    
    #[error("Test scenario failed: {0}")]
    TestScenarioFailed(String),
    
    #[error("Configuration error: {0}")]
    ConfigError(String),
    
    #[error("I/O error: {0}")]
    IoError(#[from] std::io::Error),
}

Dependencies

Direct Dependencies

StorageNode: Full embedded storage cluster
FilesystemService gRPC client: For client simulation
tonic: gRPC framework
tokio: Async runtime

External Dependencies

clap: CLI argument parsing
serde: Configuration serialization
tracing: Logging framework
tempfile: Temporary directory management
uuid: Test data generation

Configuration

Default Configuration

[validator]
temp_dir = "/tmp/wormfs-validator"
verbose = false
keep_data = false
benchmark_mode = false

[validator.cluster]
raft_heartbeat_ms = 100
metadata_store_path = "{temp_dir}/metadata.db"
file_store_path = "{temp_dir}/filestore"
snapshot_store_path = "{temp_dir}/snapshots"
transaction_log_path = "{temp_dir}/txlog"

[validator.client]
endpoint = "127.0.0.1:7000"
timeout_secs = 30
max_retries = 3

[validator.scenarios]
# Enable/disable scenario categories
basic_file_ops = true
directory_ops = true
metadata_ops = true
lock_ops = true
stripe_ops = true
error_handling = true
consistency = true

Testing Strategy

Validator Self-Testing

While WormValidator is primarily a testing tool, it should have its own test suite:

Unit Tests

Test scenario parsing and loading
Test result aggregation logic
Test report generation
Test configuration parsing

Integration Tests

Test that validator can start/stop cluster
Test that client simulator can connect
Test basic scenario execution
Test cleanup functionality

Usage Examples

Basic Usage

# Run all tests
wormfs-validator

# Run with verbose output
wormfs-validator --verbose

# Keep test data for inspection
wormfs-validator --keep-data --temp-dir /tmp/wormfs-debug

Selective Testing

# Run only basic file operation tests
wormfs-validator --scenarios basic

# Run multiple specific categories
wormfs-validator --scenarios basic,locks,metadata

Development Workflow

# During feature development, run relevant tests
cargo build && wormfs-validator --scenarios stripe_ops --verbose

# After changes, run full test suite
cargo build && wormfs-validator --report /tmp/report.html

# Debug a specific failure
wormfs-validator --scenarios consistency --keep-data --verbose

CI/CD Integration

# In CI pipeline
cargo build --release
./target/release/wormfs-validator --report /tmp/validator-report.json
if [ $? -ne 0 ]; then
    echo "WormValidator tests failed"
    exit 1
fi

Open Questions

Scenario Definition Format: Should test scenarios be defined in code only, or should we support external scenario definitions (e.g., YAML/JSON)? Answer: Code only for now.
Performance Benchmarking: Should we include performance benchmarks as part of the standard test suite, or keep them separate? Answer: Not at the moment. We will handle benchmarks using native rush bench concepts.
Multi-Node Testing: Should we eventually support multi-node embedded clusters for more comprehensive testing? Answer: yes, we will eventually expand this to support multi-node.
Failure Injection: Should we add capabilities to inject failures (network, disk, etc.) for chaos testing? Answer: Not at the moment.
Test Data Generation: Should we include utilities for generating realistic test data (file trees, workload patterns)? Answer: Not at the moment.
Continuous Validation: Should validator support long-running modes that continuously exercise the system? Answer: Not at the moment.
Scenario Prioritization: Should scenarios have priority levels to enable quick smoke tests vs full validation? Answer: Not at the moment.
Result Comparison: Should we support comparing test results across runs to detect regressions? Answer: Not at the moment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WormValidator Component Design

Purpose & Responsibilities

Architecture & Design

High-Level Architecture

Component Breakdown

1. ClusterManager

2. FuseClientSimulator

3. TestScenarioRunner

4. ValidationEngine

5. ReportGenerator

Test Scenarios

Category 1: Basic File Operations

Category 2: Directory Operations

Category 3: Metadata Operations

Category 4: Lock Operations

Category 5: Stripe Operations

Category 6: Error Handling

Category 7: Consistency Validation

Interfaces

Public API (CLI)

Rust Implementation

Data Structures

Dependencies

Direct Dependencies

External Dependencies

Configuration

Default Configuration

Testing Strategy

Validator Self-Testing

Usage Examples

Basic Usage

Selective Testing

Development Workflow

CI/CD Integration

Open Questions

FilesExpand file tree

12_WormValidator.md

Latest commit

History

12_WormValidator.md

File metadata and controls

WormValidator Component Design

Purpose & Responsibilities

Architecture & Design

High-Level Architecture

Component Breakdown

1. ClusterManager

2. FuseClientSimulator

3. TestScenarioRunner

4. ValidationEngine

5. ReportGenerator

Test Scenarios

Category 1: Basic File Operations

Category 2: Directory Operations

Category 3: Metadata Operations

Category 4: Lock Operations

Category 5: Stripe Operations

Category 6: Error Handling

Category 7: Consistency Validation

Interfaces

Public API (CLI)

Rust Implementation

Data Structures

Dependencies

Direct Dependencies

External Dependencies

Configuration

Default Configuration

Testing Strategy

Validator Self-Testing

Usage Examples

Basic Usage

Selective Testing

Development Workflow

CI/CD Integration

Open Questions