You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-**Nova-3 Full Feature Coverage** — Model selector (General/Medical), redaction (PCI/PII/numbers), find & replace, dictation mode, multichannel processing, utterance split threshold (0.1–5.0s), and request tagging
16
+
-**Audio Intelligence** — Sentiment analysis, summarization, topic/intent/entity detection, and term search (English only, saved to Intelligence/ folder)
17
+
-**Web UI** — Flask-based interface with dark/light themes, zone-based layout, gear popover for preferences, and collapsible Transcription Settings panel
18
+
-**CLI** — Command-line tool for batch processing directories, individual files, or file lists
19
+
-**LLM-Enhanced Keyterms** — Optional AI-powered generation of character names and terminology using Claude, GPT, or Gemini to improve transcription accuracy
20
+
-**Multi-Language Support** — 50+ languages with regional variants (English, Spanish, French, German, Japanese, Korean, Hindi, and many more)
21
+
-**Multilingual Model** — Special `multi` mode processes 10 languages simultaneously with automatic language detection
22
+
-**Language-Aware Audio Selection** — Automatically selects the correct audio track in multi-language containers with surround sound center channel extraction
23
+
-**Speaker Diarization** — Identify and label speakers in generated transcripts
24
+
-**Subtitle Detection** — Sidecar file glob (`.en.srt`, `.ass`, `.vtt`) with ffprobe fallback to identify existing subtitles before processing
25
+
-**File Browser** — Navigate media directories with client-side filtering and API-backed global search across the entire library
26
+
-**Batch Processing** — Queue multiple files with Celery/Redis, real-time progress tracking, and polling watchdog for reliability
27
+
-**Overwrite Protection** — Confirmation dialog before regenerating existing subtitles
28
+
-**Cost Tracking** — Real-time per-file and session cost estimates with detailed logging (~$0.0043/min)
29
+
-**Smart Skipping** — Automatically skip files that already have subtitles
30
+
-**Docker Deployment** — Docker Compose with `MEDIA_PATH` env var, Dockerfile builds, health checks, and resource limits
-**Media Server Integration** — Output `.eng.srt` files auto-recognized by Plex, Jellyfin, and Emby
33
+
-**Sticky Action Bar** — Language selector and transcribe button remain accessible while scrolling
34
+
-**iOS Safari Compatibility** — Fixed scroll bounce and viewport issues for mobile access
35
+
-**Documentation** — Setup guide, technical reference, language support guide, API docs, contributing guidelines, and community files (CODE_OF_CONDUCT, SECURITY, issue/PR templates)
36
+
37
+
### Security
38
+
39
+
-**Path traversal protection** — Input validation on file paths to prevent directory escape
40
+
-**Error path hardening** — Removed bare excepts, added timeout guards, and safe handling of empty API responses
0 commit comments