USAGE GUIDE

Voice Manipulation Detection Pipeline

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Quick Start

Installation

cd /home/john/voice
pip install -r requirements.txt

Basic Usage

Option 1: Text User Interface (TUI) - Recommended

# Interactive mode with menu
python tui.py interactive

# Analyze single file
python tui.py analyze sample.wav

# Batch analysis
python tui.py batch ./audio_samples/ -o ./results

# With custom options
python tui.py analyze sample.wav --output-dir ./my_results --no-viz

Option 2: Python API

from pipeline import VoiceManipulationDetector

detector = VoiceManipulationDetector()
report = detector.analyze('sample.wav', output_dir='results/')

print(f"Manipulation Detected: {report['ALTERATION_DETECTED']}")
print(f"Confidence: {report['CONFIDENCE']}")

Option 3: Command Line

# Single file
python pipeline.py sample.wav

# Batch mode
python pipeline.py ./audio_directory --batch

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Features

1. Multi-Phase Analysis

The pipeline executes 4 phases:

PHASE 1: Baseline F0 Analysis (isolates presented pitch)
PHASE 2: Vocal Tract Analysis (extracts formants - physical characteristics)
PHASE 3: Artifact Detection (3 independent methods)
- Pitch-Formant Incoherence
- Mel Spectrogram Artifacts
- Phase Decoherence / Transient Smearing
PHASE 4: Report Synthesis (generates verified output)

2. Verifiable Outputs

All reports include:

SHA-256 checksums of audio file
Cryptographic signatures for tamper detection
Chain of custody metadata
Timestamp and pipeline version

3. Multiple Output Formats

JSON: Machine-readable detailed report
Markdown: Human-readable formatted report
Visualizations: PNG plots showing analysis results

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Output Structure

For each analyzed file sample.wav:

results/sample/
├── sample_report.json              # Detailed JSON report with verification
├── sample_report.md                # Markdown formatted report
├── sample_overview.png             # Comprehensive overview plot
├── sample_mel_spectrogram.png      # Mel spectrogram artifact analysis
├── sample_phase_analysis.png       # Phase coherence plot
└── sample_pitch_formant_comparison.png  # Pitch-formant comparison

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Understanding Reports

Key Metrics

{
  "ASSET_ID": "sample_001",
  "ALTERATION_DETECTED": true,
  "CONFIDENCE": "99% (Very High)",

  "PRESENTED_AS": "Female",          // Based on F0 (pitch)
  "PROBABLE_SEX": "Male",            // Based on formants (physical)

  "DECEPTION_BASELINE_F0": "221.5 Hz (Median)",
  "PHYSICAL_BASELINE_FORMANTS": "F1: 498 Hz, F2: 1510 Hz, F3: 2490 Hz"
}

Evidence Vectors

Three independent detection methods:

Pitch-Formant Incoherence: Mismatch between presented pitch and physical characteristics
Time Manipulation: Phase artifacts from time-stretching
Spectral Artifacts: Unnatural harmonics or consistent noise floor

Confidence Levels

99% (Very High): All 3 detection methods triggered
85% (High): 2 detection methods triggered
60-75% (Medium): 1 detection method triggered
0% (Low): No manipulation detected

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Advanced Usage

Verification System

Verify report integrity:

from verification import OutputVerifier

verifier = OutputVerifier()
result = verifier.verify_report('results/sample_report.json')

if result['valid']:
    print(f"✓ Report verified - created {result['timestamp']}")
else:
    print(f"✗ Verification failed: {result['error']}")

Batch Processing

detector = VoiceManipulationDetector()
reports = detector.batch_analyze(
    audio_dir='./samples',
    output_dir='./batch_results',
    pattern='*.wav'
)

# Summary statistics
manipulated = sum(1 for r in reports if r['ALTERATION_DETECTED'])
print(f"Detected manipulation in {manipulated}/{len(reports)} files")

Export to CSV

from verification import ReportExporter

exporter = ReportExporter()
exporter.export_csv_summary(reports, 'summary.csv')

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Testing

Run comprehensive test suite:

python test_pipeline.py

This will:

Generate 6 synthetic test samples (clean + manipulated)
Analyze each sample
Verify detection accuracy
Test verification system
Generate full reports and visualizations

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Examples

Example 1: Quick Analysis

python tui.py analyze suspicious_call.wav

Example 2: Batch Analysis with Custom Pattern

python tui.py batch /audio/evidence/ -p "*.mp3" -o /results/case_001/

Example 3: Programmatic Access

from pipeline import VoiceManipulationDetector

detector = VoiceManipulationDetector()
report = detector.analyze('evidence.wav')

# Access specific findings
findings = report['DETAILED_FINDINGS']
phase3 = findings['phase3_artifacts']

if phase3['pitch_formant_incoherence']['detected']:
    print(f"Incoherence confidence: {phase3['pitch_formant_incoherence']['confidence']}")

Example 4: Create Test Sample

from example import example_create_test_sample

# Creates a known-manipulated sample for testing
example_create_test_sample()

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Troubleshooting

Issue: "No module named 'librosa'"

Solution: Run pip install -r requirements.txt

Issue: Parselmouth errors

Solution: Ensure audio file is valid WAV/MP3 format

Issue: False positives on synthetic audio

Expected: Synthetic audio has unnatural characteristics that trigger detection

Issue: Memory errors on large files

Solution: Use duration parameter:

y, sr = librosa.load('large.wav', duration=30.0)  # First 30 seconds only

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Technical Details

Supported Formats

WAV, MP3, FLAC, OGG, M4A (via librosa)

System Requirements

Python >= 3.10
4GB RAM minimum
Linux/macOS/Windows

Detection Methods

F0 Extraction: librosa.piptrack for robust pitch detection
Formant Analysis: Praat-Parselmouth Burg algorithm
Phase Analysis: STFT with phase coherence metrics
Spectral Analysis: Mel spectrogram with artifact detection

Security Features

Sandboxed execution recommended
Read-only file permissions
No network access required
Cryptographic verification of all outputs

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

References

Tactical Implementation Specification (TIS): See project README
Source Code: /home/john/voice/
Test Suite: test_pipeline.py
Examples: example.py

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

FilesExpand file tree

usage.md

Latest commit

History

usage.md

File metadata and controls

USAGE GUIDE

Voice Manipulation Detection Pipeline

Quick Start

Installation

Basic Usage

Option 1: Text User Interface (TUI) - Recommended

Option 2: Python API

Option 3: Command Line

Features

1. Multi-Phase Analysis

2. Verifiable Outputs

3. Multiple Output Formats

Output Structure

Understanding Reports

Key Metrics

Evidence Vectors

Confidence Levels

Advanced Usage

Verification System

Batch Processing

Export to CSV

Testing

Examples

Example 1: Quick Analysis

Example 2: Batch Analysis with Custom Pattern

Example 3: Programmatic Access

Example 4: Create Test Sample

Troubleshooting

Issue: "No module named 'librosa'"

Issue: Parselmouth errors

Issue: False positives on synthetic audio

Issue: Memory errors on large files

Technical Details

Supported Formats

System Requirements

Detection Methods

Security Features

References