Add PGS to SRT OCR subtitle extraction feature#709
Merged
cdgriffith merged 2 commits intocdgriffith:developfrom Feb 7, 2026
Merged
Add PGS to SRT OCR subtitle extraction feature#709cdgriffith merged 2 commits intocdgriffith:developfrom
cdgriffith merged 2 commits intocdgriffith:developfrom
Conversation
Implements OCR-based conversion of PGS (Presentation Graphic Stream) subtitles to SRT format with automatic tool detection. Features: - Auto-detect Tesseract OCR from PATH or Subtitle Edit installations - Auto-detect MKVToolNix from standard install locations - Support multiple language codes (ISO 639-2/3, language names) - GUI checkbox to enable/disable OCR for PGS subtitles - Automatic cleanup of intermediate .sup files after conversion Dependencies: - pgsrip: PGS subtitle OCR engine - pytesseract: Tesseract OCR wrapper - babelfish: Language code handling - opencv-python, cleanit, trakit: Image/metadata processing Known limitation: This feature works when running from source (python -m fastflix) but not in PyInstaller-built executables due to subprocess environment issues with pgsrip. Users needing PGS OCR should run from source.
17 tasks
Contributor
Author
|
Supersedes #701 — original branch history was rewritten so GitHub can’t reopen that PR. |
Contributor
Author
|
@cdgriffith - want me to keep hammering on this? It's been a few months since I last looked at it, as I've written my own pipeline (with built-in SRT generation via OCR. |
Owner
|
Hey @mikeSGman Thanks again for this! I'm going to resolve the conflicts and just let Claude Code do any changes on it we may want |
Merged
Contributor
Author
|
Hey @cdgriffith just saw this, thanks! Hope someone finds it useful. Glad to contribute, thanks for the opportunity. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Implements OCR-based conversion of PGS (Presentation Graphic Stream) subtitles to SRT format with automatic tool detection.
Features:
Dependencies:
Known limitation:
This feature works when running from source (python -m fastflix) but not in PyInstaller-built executables due to subprocess environment issues with pgsrip. Users needing PGS OCR should run from source.