tylerbcrawford
diff --git a/‎.dockerignore‎
Lines changed: 35 additions & 0 deletions b/‎.dockerignore‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎.env.example‎
Lines changed: 108 additions & 0 deletions b/‎.env.example‎
Lines changed: 108 additions & 0 deletions
diff --git a/‎.github/ISSUE_TEMPLATE/bug_report.md‎
Lines changed: 27 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/bug_report.md‎
Lines changed: 27 additions & 0 deletions
diff --git a/‎.github/ISSUE_TEMPLATE/feature_request.md‎
Lines changed: 19 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/feature_request.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎.github/PULL_REQUEST_TEMPLATE.md‎
Lines changed: 10 additions & 0 deletions b/‎.github/PULL_REQUEST_TEMPLATE.md‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎.github/workflows/docker-publish.yml‎
Lines changed: 68 additions & 0 deletions b/‎.github/workflows/docker-publish.yml‎
Lines changed: 68 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 54 additions & 0 deletions b/‎.gitignore‎
Lines changed: 54 additions & 0 deletions
diff --git a/‎CHANGELOG.md‎
Lines changed: 43 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 43 additions & 0 deletions
@@ -0,0 +1,35 @@
+# Version control
+.git
+.gitignore
+
+# Python
+__pycache__
+*.pyc
+*.pyo
+.venv
+venv
+
+# IDE
+.vscode
+.idea
+
+# Logs and runtime data
+deepgram-logs/
+*.log
+
+# Documentation (not needed in image)
+docs/
+_devdocs/
+examples/
+CONTRIBUTING.md
+LICENSE
+README.md
+Makefile
+
+# Tests
+tests/
+
+# Environment files (secrets — never bake into image)
+.env
+.env.*
+!.env.example
@@ -0,0 +1,108 @@
+# ============================================================================
+# Deepgram API Configuration (Required for both CLI and Web UI)
+# ============================================================================
+# Get your API key from: https://console.deepgram.com/
+DEEPGRAM_API_KEY=your_deepgram_api_key_here
+
+# ============================================================================
+# Media Library Path (Required)
+# ============================================================================
+# Path to your media library on the host machine.
+# This directory is mounted into the container at /media.
+#
+# Linux:   MEDIA_PATH=/home/username/media
+# macOS:   MEDIA_PATH=/Users/username/Movies
+# Windows: MEDIA_PATH=C:/Users/YourName/Videos
+MEDIA_PATH=/path/to/your/media
+
+# ============================================================================
+# CLI Tool Configuration
+# ============================================================================
+# Optional: Force regenerate existing SRT files (0=no, 1=yes)
+# FORCE_REGENERATE=0
+
+# Optional: Profanity filter mode - "off", "tag", or "remove" (default: off)
+# PROFANITY_FILTER=off
+
+# Optional: Save raw Deepgram JSON responses for debugging (0=no, 1=yes)
+# When enabled, saves raw API responses to Transcripts/JSON/ folder
+# SAVE_RAW_JSON=0
+
+# Nova-3 Quality Enhancements (optional)
+# Convert spoken numbers to digits (e.g., "twenty twenty four" → "2024")
+# NUMERALS=0
+
+# Include filler words like "uh", "um" in transcription (usually off for subtitles)
+# FILLER_WORDS=0
+
+# Convert spoken measurements (e.g., "fifty meters" → "50m")
+# MEASUREMENTS=0
+
+# ============================================================================
+# Language Configuration (50+ Languages with Regional Variants)
+# ============================================================================
+# Language code for transcription (default: en)
+# See docs/languages.md for 50+ supported languages and regional variants
+# Common: en, en-GB, es, es-419, fr, de, ja, ko, pt-BR, multi
+# LANGUAGE=en
+
+# Automatic Language Detection (35 languages supported, batch mode only)
+# When enabled, automatically detects the dominant language from audio
+# Returns detected language code and confidence score (0-1)
+# Overrides LANGUAGE setting when enabled
+# Note: Not available for streaming transcription, batch processing only
+# DETECT_LANGUAGE=0
+
+# ============================================================================
+# Web UI Configuration (Optional)
+# ============================================================================
+
+# Flask Security
+# Generate a secure random key for production: python -c "import secrets; print(secrets.token_hex(32))"
+SECRET_KEY=change-me-in-production
+
+# Redis Configuration
+REDIS_URL=redis://redis:6379/0
+
+# Paths (should match your docker-compose.yml volume mounts)
+MEDIA_ROOT=/media
+LOG_ROOT=/logs
+
+# Transcription Defaults
+DEFAULT_MODEL=nova-3
+DEFAULT_LANGUAGE=en
+# DEFAULT_PROFANITY_FILTER=off
+
+# Security: Email Allowlist (optional)
+# Comma-separated list of allowed email addresses for OAuth access
+# Leave empty to allow all authenticated Google OAuth users
+# Example: ALLOWED_EMAILS=user1@example.com,user2@example.com
+ALLOWED_EMAILS=
+
+# Bazarr Integration (optional)
+# Enable automatic subtitle rescan after batch completion
+# Leave BAZARR_BASE_URL empty to disable integration
+BAZARR_BASE_URL=
+BAZARR_API_KEY=
+
+# Worker Concurrency (optional)
+# Number of concurrent transcription jobs per worker
+# Start with 1, increase to 2-3 if your system can handle it
+WORKER_CONCURRENCY=1
+
+# ============================================================================
+# LLM API Keys for AI-Powered Keyterm Generation (Optional Feature)
+# ============================================================================
+# These keys enable AI-powered generation of keyterms in the Web UI.
+# Keyterms boost transcription accuracy for character names and terminology.
+# This feature is COMPLETELY OPTIONAL - manual keyterms work just as well.
+#
+# Get Anthropic API key: https://console.anthropic.com/
+# Get OpenAI API key: https://platform.openai.com/
+#
+# Get Gemini API key: https://aistudio.google.com/apikey (free tier available)
+#
+# Leave blank to disable AI keyterm generation (manual keyterms still work)
+# ANTHROPIC_API_KEY=
+# OPENAI_API_KEY=
+# GEMINI_API_KEY=
@@ -0,0 +1,27 @@
+---
+name: Bug Report
+about: Report a bug to help improve Subgeneratorr
+title: ''
+labels: bug
+assignees: ''
+---
+
+**Describe the bug**
+A clear description of what the bug is.
+
+**To reproduce**
+Steps to reproduce the behavior:
+1. ...
+2. ...
+
+**Expected behavior**
+What you expected to happen.
+
+**Environment**
+- OS: [e.g., Ubuntu 24.04, macOS 15, Windows 11]
+- Docker version: [e.g., 27.0]
+- Subgeneratorr version: [e.g., v2.0.0]
+- Interface: [CLI / Web UI]
+
+**Logs**
+Paste relevant logs from `docker compose logs web worker` or CLI output.
@@ -0,0 +1,19 @@
+---
+name: Feature Request
+about: Suggest an enhancement or new feature
+title: ''
+labels: enhancement
+assignees: ''
+---
+
+**Is your feature request related to a problem?**
+A clear description of the problem. E.g., "I'm always frustrated when..."
+
+**Describe the solution you'd like**
+What you want to happen.
+
+**Alternatives considered**
+Any alternative solutions or workarounds you've considered.
+
+**Additional context**
+Any other context, screenshots, or examples.
@@ -0,0 +1,10 @@
+## What does this PR do?
+
+Brief description of the change.
+
+## Checklist
+
+- [ ] Tested with `python3 scripts/validate_setup.py`
+- [ ] Docker build succeeds (`docker compose build`)
+- [ ] Updated docs if configuration, CLI flags, or API endpoints changed
+- [ ] Follows code style (PEP 8 for Python, vanilla JS, CSS custom properties)
@@ -0,0 +1,68 @@
+name: Build and Publish Docker Images
+
+on:
+  push:
+    tags:
+      - 'v*'
+
+env:
+  REGISTRY: ghcr.io
+
+jobs:
+  build-and-push:
+    runs-on: ubuntu-latest
+    permissions:
+      contents: read
+      packages: write
+
+    strategy:
+      matrix:
+        include:
+          - image: subgeneratorr-web
+            dockerfile: web/Dockerfile
+            context: .
+          - image: subgeneratorr-worker
+            dockerfile: web/Dockerfile
+            context: .
+          - image: subgeneratorr-cli
+            dockerfile: cli/Dockerfile
+            context: .
+
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+
+      - name: Set up QEMU
+        uses: docker/setup-qemu-action@v3
+
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+
+      - name: Log in to GHCR
+        uses: docker/login-action@v3
+        with:
+          registry: ${{ env.REGISTRY }}
+          username: ${{ github.actor }}
+          password: ${{ secrets.GITHUB_TOKEN }}
+
+      - name: Extract metadata
+        id: meta
+        uses: docker/metadata-action@v5
+        with:
+          images: ${{ env.REGISTRY }}/${{ github.repository_owner }}/${{ matrix.image }}
+          tags: |
+            type=semver,pattern={{version}}
+            type=semver,pattern={{major}}.{{minor}}
+            type=raw,value=latest
+
+      - name: Build and push
+        uses: docker/build-push-action@v6
+        with:
+          context: ${{ matrix.context }}
+          file: ${{ matrix.dockerfile }}
+          platforms: linux/amd64,linux/arm64
+          push: true
+          tags: ${{ steps.meta.outputs.tags }}
+          labels: ${{ steps.meta.outputs.labels }}
+          cache-from: type=gha
+          cache-to: type=gha,mode=max
@@ -0,0 +1,54 @@
+# Environment variables
+.env
+
+# Personal/internal documentation (keep local only)
+_devdocs/
+
+# Claude Code
+.claude/
+
+
+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+env/
+venv/
+ENV/
+build/
+dist/
+*.egg-info/
+.pytest_cache/
+
+# Logs
+deepgram-logs/*.json
+*.log
+
+# Personal video lists (users should create their own)
+video-list.txt
+batch-videos.txt
+test-videos.txt
+
+# Temporary files
+/tmp/
+*.tmp
+*.mp3
+
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+
+# OS
+.DS_Store
+Thumbs.db
+
+# Docker
+docker-compose.yml
+docker-compose.override.yml
+.mcp.json
+.worktrees/
@@ -0,0 +1,43 @@
+# Changelog
+
+All notable changes to Subgeneratorr will be documented in this file.
+
+The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
+and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+
+## [2.0.0] - 2026-02-25
+
+Initial public release.
+
+### Added
+
+- **Core Transcription Engine** — Deepgram Nova-3 speech recognition with SRT subtitle output
+- **Nova-3 Full Feature Coverage** — Model selector (General/Medical), redaction (PCI/PII/numbers), find & replace, dictation mode, multichannel processing, utterance split threshold (0.1–5.0s), and request tagging
+- **Audio Intelligence** — Sentiment analysis, summarization, topic/intent/entity detection, and term search (English only, saved to Intelligence/ folder)
+- **Web UI** — Flask-based interface with dark/light themes, zone-based layout, gear popover for preferences, and collapsible Transcription Settings panel
+- **CLI** — Command-line tool for batch processing directories, individual files, or file lists
+- **LLM-Enhanced Keyterms** — Optional AI-powered generation of character names and terminology using Claude, GPT, or Gemini to improve transcription accuracy
+- **Multi-Language Support** — 50+ languages with regional variants (English, Spanish, French, German, Japanese, Korean, Hindi, and many more)
+- **Multilingual Model** — Special `multi` mode processes 10 languages simultaneously with automatic language detection
+- **Language-Aware Audio Selection** — Automatically selects the correct audio track in multi-language containers with surround sound center channel extraction
+- **Speaker Diarization** — Identify and label speakers in generated transcripts
+- **Subtitle Detection** — Sidecar file glob (`.en.srt`, `.ass`, `.vtt`) with ffprobe fallback to identify existing subtitles before processing
+- **File Browser** — Navigate media directories with client-side filtering and API-backed global search across the entire library
+- **Batch Processing** — Queue multiple files with Celery/Redis, real-time progress tracking, and polling watchdog for reliability
+- **Overwrite Protection** — Confirmation dialog before regenerating existing subtitles
+- **Cost Tracking** — Real-time per-file and session cost estimates with detailed logging (~$0.0043/min)
+- **Smart Skipping** — Automatically skip files that already have subtitles
+- **Docker Deployment** — Docker Compose with `MEDIA_PATH` env var, Dockerfile builds, health checks, and resource limits
+- **GHCR Docker Images** — Multi-arch (amd64 + arm64) pre-built images via GitHub Actions
+- **Media Server Integration** — Output `.eng.srt` files auto-recognized by Plex, Jellyfin, and Emby
+- **Sticky Action Bar** — Language selector and transcribe button remain accessible while scrolling
+- **iOS Safari Compatibility** — Fixed scroll bounce and viewport issues for mobile access
+- **Documentation** — Setup guide, technical reference, language support guide, API docs, contributing guidelines, and community files (CODE_OF_CONDUCT, SECURITY, issue/PR templates)
+
+### Security
+
+- **Path traversal protection** — Input validation on file paths to prevent directory escape
+- **Error path hardening** — Removed bare excepts, added timeout guards, and safe handling of empty API responses
+
+[Unreleased]: https://github.com/tylerbcrawford/subgeneratorr/compare/v2.0.0...HEAD
+[2.0.0]: https://github.com/tylerbcrawford/subgeneratorr/releases/tag/v2.0.0