EPUB Safety Scanner

Detect and fix malicious content embedded in EPUB files. Scans entirely in-memory — no temp files, no disk extraction.

Download

Tip

For security, always download from this GitHub repository directly. Do not download from third-party sources.

Download scan-epub-skill.zip from the latest release.

Features

JavaScript detection — <script> tags, inline event handlers (onclick, onerror, etc.), javascript: URIs, eval(), fetch(), WebSocket, and more
Malicious HTML — <iframe>, <object>, <embed>, <form>, <applet>, <meta refresh>, <base>, data: URIs
Malicious CSS — expression(), -moz-binding, behavior, external url() / @import (tracking pixels), javascript: in CSS
Suspicious files — executables (.exe, .bat, .sh, .dll, .ps1), nested archives (.zip, .rar, .7z), disguised executables (MZ header mismatch)
ZIP security — path traversal (../), zip bomb detection (size & compression ratio), file integrity checks
SVG scanning — scripts and event handlers inside SVG images
External URL detection — passive tracking via src, action, CSS url() flagged as WARNING; safe <a href> links kept as INFO
Auto-fix mode — remove threats and repack as [fixed] filename.epub
Markdown report — export detailed scan results to .md file
Claude.ai Skill — package as a skill for use in Claude.ai

Quick Start

git clone https://github.com/stanwu/epub-safety-scanner.git
cd epub-safety-scanner

# Scan all EPUBs on your Desktop
python3 epub_safety_scanner.py --path ~/Desktop/

# Fix any threats found
python3 epub_safety_scanner.py --path ~/Desktop/ --fix

No external dependencies required — Python 3.9+ stdlib only.

Requirements

Python 3.9+
No external dependencies (stdlib only)

Installation

git clone https://github.com/stanwu/epub-safety-scanner.git
cd epub-safety-scanner

For development (linting, testing):

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements-dev.txt

CLI Reference

python3 epub_safety_scanner.py --path PATH [OPTIONS]

Flag	Description
`--path PATH`	(Required) EPUB file, directory, or glob pattern. Supports `~` expansion.
`--fix`	Remove threats and save as `[fixed] filename.epub` in the same directory.
`--report FILE`	Write a detailed Markdown report to the specified file.
`-v, --verbose`	Show INFO-level findings (external hyperlinks, hidden by default).
`--no-color`	Disable colored terminal output.

Exit codes: 1 if any CRITICAL findings, 0 otherwise.

Usage Scenarios

Scenario 1: Scan a Single EPUB

python3 epub_safety_scanner.py --path ~/Desktop/book.epub

Outputs a severity summary per file. CLEAN means no threats detected.

Scenario 2: Batch Scan a Directory

python3 epub_safety_scanner.py --path ~/Desktop/

Automatically finds all *.epub files in the directory. Displays per-file results and a final summary showing how many files have issues.

Scenario 3: Fix Threats and Verify

# Step 1: Fix all threats
python3 epub_safety_scanner.py --path ~/Desktop/ --fix

# Step 2: Verify the fixed files are clean
python3 epub_safety_scanner.py --path ~/Desktop/"[fixed]*"

Each fixed EPUB is saved as [fixed] original.epub in the same directory. The original file is left untouched.

Scenario 4: Generate a Report for Review

python3 epub_safety_scanner.py --path ~/Desktop/ --report report.md

Creates a Markdown report with:

Scan date and file count
Summary table (status per file)
Per-file details grouped by threat category
Evidence snippets for each finding

Scenario 5: Full Workflow (Scan + Fix + Report)

python3 epub_safety_scanner.py --path ~/Desktop/ --fix --report report.md

Scans all EPUBs, fixes threats, and exports a report — all in one command.

Scenario 6: Verbose Mode — Inspect External Links

python3 epub_safety_scanner.py --path ~/Desktop/ -v

Shows INFO-level findings (external <a href> links) that are hidden by default. Useful for auditing what external URLs an EPUB references. URLs are color-coded: green for safe <a href> links, red for suspicious external resources.

Understanding the Output

Severity	Meaning	Default
CRITICAL	High risk — JavaScript, executables, disguised files, `<iframe>`, `<applet>`	Shown
WARNING	Medium risk — external resource loading (`src`, `action`), suspicious CSS, nested archives	Shown
INFO	Low risk — external hyperlinks (`<a href>`)	Hidden (use `-v`)

Files with only INFO findings display as CLEAN by default
URLs in the output are color-coded: green = safe <a href>, red = suspicious
Each finding includes the internal file path and an evidence snippet

Fix Mode

--fix removes threats and saves a clean copy as [fixed] original.epub:

Threat	Action
`.js` files	Removed
Executables (`.exe`, `.dll`, etc.)	Removed
Nested archives (`.zip`, `.rar`, etc.)	Removed
Path traversal entries (`../`)	Removed
`<script>`, `<iframe>`, `<applet>`, `<object>`, `<embed>`	Stripped from HTML
Event handlers (`onclick`, `onerror`, etc.)	Stripped from attributes
`javascript:` / `data:text/html` URIs	Neutralized to `#`
`<meta refresh>`, `<base>`	Stripped
External `src`, `action`, `poster`, `data` URLs	Removed
CSS `expression()`, `-moz-binding`, `behavior`	Removed
CSS `url(https://...)`, `@import url(https://...)`	Removed
`<a href="https://...">`	Preserved (normal for ebooks)

If no threats are found, no fixed file is created.

Claude.ai Skill

Use the downloaded scan-epub-skill.zip as a Claude.ai Skill — no installation required.

Go to claude.ai
Navigate to Customize > Skills
Click + and select Upload a skill
Upload scan-epub-skill.zip

Once uploaded, Claude will use the scanner when you ask it to check EPUB files for security threats.

Or build the skill ZIP locally:

make skill

Development

make test          # Run unit tests (111 tests)
make lint          # Run linters (ruff, bandit, mypy)
make check         # Run all checks (lint + test)
make format        # Auto-format code
make skill         # Package claude.ai skill ZIP

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
skills/scan-epub		skills/scan-epub
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
epub_safety_scanner.py		epub_safety_scanner.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
test_epub_safety_scanner.py		test_epub_safety_scanner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EPUB Safety Scanner

Download

Features

Quick Start

Requirements

Installation

CLI Reference

Usage Scenarios

Scenario 1: Scan a Single EPUB

Scenario 2: Batch Scan a Directory

Scenario 3: Fix Threats and Verify

Scenario 4: Generate a Report for Review

Scenario 5: Full Workflow (Scan + Fix + Report)

Scenario 6: Verbose Mode — Inspect External Links

Understanding the Output

Fix Mode

Claude.ai Skill

Development

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EPUB Safety Scanner

Download

Features

Quick Start

Requirements

Installation

CLI Reference

Usage Scenarios

Scenario 1: Scan a Single EPUB

Scenario 2: Batch Scan a Directory

Scenario 3: Fix Threats and Verify

Scenario 4: Generate a Report for Review

Scenario 5: Full Workflow (Scan + Fix + Report)

Scenario 6: Verbose Mode — Inspect External Links

Understanding the Output

Fix Mode

Claude.ai Skill

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages