FFT Pipeline

Automated processing of NHS Friends and Family Test (FFT) data into publishable reports via CLI or web interface.

What This Does

The Friends and Family Test (FFT) is the UK's largest patient feedback programme, collecting ~2 million responses monthly. This pipeline transforms raw monthly FFT Excel data into formatted, suppression-compliant reports published by NHS England. Reports are published with a monthly cadence at the NHS England Friends and Family Test data page.

Available interfaces:

Command line interface for automated/scripted processing
Web interface for interactive use via browser

Supports multiple service types:

Inpatient services (Ward → Site → Trust → ICB)
A&E services (Site → Trust → ICB) coming soon
Ambulance services (Trust → ICB) coming soon

Key Features

Privacy-First Suppression

Implements cascading suppression rules to prevent patient identification:

Any organisation with 1-4 responses gets suppressed (replaced with *)
Second-level suppression prevents reverse calculation
Cascade suppression flows from parent to child levels (ICB → Trust → Site → Ward)

Multi-Level Aggregation

Processes data at multiple geographic levels:

Aggregates responses by Likert scale (Very Good → Very Poor)
Calculates percentage positive/negative at each level
Maintains organisational hierarchy throughout

Web Interface

Simple browser-based interface for interactive pipeline operations:

Service type selection with dynamic file discovery
Month filtering with "All months" option
Real-time status updates and log output
One-click output folder access
Accessible design with dark mode support

Quick Start

# Create virtual environment and install dependencies
uv venv
uv sync

Option 1: Web Interface

# Start web interface (opens automatically in browser)
uv run python src/fft/app/server.py

Option 2: Command Line

# Run for inpatient data (default: latest 2 months)
uv run python -m fft --ip

# Run for A&E data
uv run python -m fft --ae

# Run for ambulance data
uv run python -m fft --amb

Validation

Validates pipeline outputs against VBA ground truth files.

# Validate all months
uv run python -m fft --validate

# Validate specific month
uv run python -m fft --validate --month Aug-25

Results: 75% of sheets validate perfectly. Differences isolated to tie-breaking when organisations have equal response counts. To our understanding, privacy protection remains identical.

Installation Requirements

This project uses uv for dependency management. Your pyproject.toml contains all required packages. Simply:

uv venv
uv sync

Project Structure

fft_pipeline/
├── src/
│   └── fft/
│       ├── app/            # FastHTML web interface
│       │   ├── __init__.py
│       │   ├── __main__.py
│       │   └── server.py   # Main web application
│       ├── config.py       # Centralised configuration (paths, mappings, constants)
│       ├── __init__.py
│       ├── loaders.py      # Data loading from Excel files
│       ├── __main__.py     # CLI entry point
│       ├── processors.py   # Transformation pipeline (rename, aggregate, calculate)
│       ├── suppression.py  # Privacy suppression logic (first/second/cascade)
│       ├── validation.py   # Ground truth comparison and output verification
│       └── writers.py      # Excel output generation
├── data/
│   ├── inputs/
│   │   ├── collections_overview/  # Collections metadata files
│   │   ├── raw/                   # Monthly raw Excel files (FFT_IP_V1 Aug-25.xlsx)
│   │   ├── suppression_files/     # VBA suppression reference files
│   │   └── templates/             # Excel templates (FFT-inpatient-data-template.xlsm)
│   └── outputs/
│       └── ground_truth/          # Reference validation files

Data Flow

graph LR
    A[Raw Excel Files] --> B[loaders.py]
    B --> C[processors.py]
    C --> D[suppression.py]
    D --> E[writers.py]
    E --> F[Output .xlsm]
    
    G[config.py] --> B
    G --> C
    G --> D
    G --> E
    
    H[Template .xlsm] --> E

How Suppression Works

The Problem: Small response counts (< 5) could identify individual patients.

The Solution: Three-level suppression cascade

First-level: Any row with 1-4 responses gets all Likert responses replaced with *
Second-level: The next-lowest responding organisation also gets suppressed (prevents "Total - Known = Suppressed" calculation)
Cascade: If a parent level (e.g., ICB) requires suppression, the two lowest-responding children (e.g., Trusts) also get suppressed

Example:

ICB North (232 responses - suppressed due to having a Trust with 2 responses)
├─ Trust A: 150 responses → Shown
├─ Trust B: 80 responses → * (cascade suppression - Rank 2)
└─ Trust C: 2 responses → * (first-level suppression - Rank 1)

Without cascade suppression, someone could calculate: 232 - 150 = 82, revealing Trust C's data.

Geographic Level Processing Pattern

graph TD
    A[Raw Excel Data] --> B[Ward Level]
    B --> C[Site Level]
    C --> D[Trust/Organisation Level]
    D --> E[ICB Level]
    E --> F[National Level]
    
    B --> B1[Steps:<br/>1. Standardise columns<br/>2. Mark IS1 providers<br/>3. Remove unwanted columns]
    
    C --> C1[Same steps as Ward]
    
    D --> D1[Same steps +<br/>Merge collection modes]
    
    E --> E1[Aggregate from Trust level<br/>Group by ICB Code/Name<br/>Sum responses<br/>Recalculate percentages]
    
    F --> F1[Aggregate from Trust level<br/>Group by Submitter Type<br/>Total + NHS + IS1 rows]

Suppression Cascade Logic

graph TD
    A[ICB Level] -->|Apply suppression| B[Flag ICBs with 1-4 responses]
    B --> C[Second-level: Flag next lowest ICB]
    
    D[Trust Level] -->|Cascade from ICB| E[If parent ICB suppressed<br/>Suppress Rank 1 & 2 Trusts]
    E --> F[Also apply first/second level<br/>for Trusts own data]
    
    G[Site Level] -->|Cascade from Trust| H[If parent Trust suppressed<br/>Suppress Rank 1 & 2 Sites]
    H --> I[Also apply first/second level<br/>for Sites own data]
    
    J[Ward Level] -->|Cascade from Site| K[If parent Site suppressed<br/>Suppress Rank 1 & 2 Wards]
    K --> L[Also apply first/second level<br/>for Wards own data]
    
    style B fill:#E74C3C,color:#fff
    style C fill:#E74C3C,color:#fff
    style E fill:#F39C12,color:#000
    style H fill:#F39C12,color:#000
    style K fill:#F39C12,color:#000

Example File Locations

data/inputs/raw/
├── FFT_IP_V1 Aug-25.xlsx       # Current month inpatient
├── FFT_IP_V1 Jul-25.xlsx       # Previous month inpatient
└── FFT_AE_V1 Aug-25.xlsx       # Current month A&E

data/inputs/templates/
├── FFT_IP_template.xlsm
├── FFT_AE_template.xlsm
└── FFT_Amb_template.xlsm

data/outputs/
├── FFT-inpatient-data-Aug-25.xlsm
└── FFT-ambulance-data-Aug-25.xlsm

Testing

Functions use doctests for inline testing:

# Run doctests quietly (only shows failures)
uv run python -m doctest $(find src/fft/ -name "*.py" -not -name "__main__.py")

Utilities

BS Sheet Population

The pipeline includes a utility to populate template 'BS' sheets with content from suppression files:

# Populate all template BS sheets from corresponding suppression files
uv run python src/fft/utils.py

What it does:

Automatically discovers templates and matches them with suppression files
Validates that both files contain 'BS' sheets before copying
Handles flexible naming conventions:
- FFT_IP_template.xlsm ← IP_Suppression_V3.5.xlsm
- FFT_Amb_template.xlsm ← Aug25_Amb_Suppression.xlsm
- FFT_AE_template.xlsm ← AE_Suppression_V3.5.xlsm

Safe to run multiple times - the script is idempotent and will overwrite BS sheet content to match suppression files without creating duplicates or corruption.

Development Status

Current: Inpatient pipeline (Ward → Site → Trust → ICB)
Next: A&E pipeline (Site → Trust → ICB)
Future: Ambulance pipeline (Trust → ICB)

This pipeline processes official NHS England data. Handle with care and ensure GDPR compliance.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
data		data
src/fft		src/fft
.gitignore		.gitignore
CONTRIBUTE.md		CONTRIBUTE.md
LICENCE		LICENCE
README.md		README.md
pyproject.toml		pyproject.toml
setup_structure.py		setup_structure.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FFT Pipeline

What This Does

Key Features

Privacy-First Suppression

Multi-Level Aggregation

Web Interface

Quick Start

Option 1: Web Interface

Option 2: Command Line

Validation

Installation Requirements

Project Structure

Data Flow

How Suppression Works

Geographic Level Processing Pattern

Suppression Cascade Logic

Example File Locations

Testing

Utilities

BS Sheet Population

Development Status

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

nhsengland/fft_pipeline

Folders and files

Latest commit

History

Repository files navigation

FFT Pipeline

What This Does

Key Features

Privacy-First Suppression

Multi-Level Aggregation

Web Interface

Quick Start

Option 1: Web Interface

Option 2: Command Line

Validation

Installation Requirements

Project Structure

Data Flow

How Suppression Works

Geographic Level Processing Pattern

Suppression Cascade Logic

Example File Locations

Testing

Utilities

BS Sheet Population

Development Status

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages