Skip to content

Releases: timminator/VideOCR

VideOCR v1.4.0

10 Feb 20:59

Choose a tag to compare

Highlights:

  • Integrates the latest PaddleOCR 3.4 version which extends support to 110 languages and provides increased accurary for latin, cyrillic, arabic etc.
  • Adds support for batch processing
  • Preprocessing step for vfr videos (Mapping) and Step 1 are now multithreaded (e.g. up to 7x performance increase on a 6-Core 5600x).
  • Adds a progress bar and a status message for better progress indication
  • Adds ETA calculation for every step
  • Tasks can now be paused and resumed
  • Adds support for post-completion tasks, e.g. shutdown, sleep, lock...
  • The operating system is now prevented from going to sleep while the operation is ongoing
  • The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
  • ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language
  • Command Line Arguments for the CLI version are now properly validated to provide a user-friendly error message
  • Adds an option for normalizing traditional chinese to simplified chinese for "Chinese & English" for better results

Fixes:

  • Fixed an issue where subtitles could merge incorrectly due to substring matches
  • Fixed an issue where subtitles were not correctly merged for vfr videos depending on the specified merge gap
  • Fixed usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions
  • Post Processing is now disabled by default due to increased accuracy of PaddleOCR

Notes:

Download Tips:

VideOCR v1.4.0-Beta2

07 Feb 23:13

Choose a tag to compare

VideOCR v1.4.0-Beta2 Pre-release
Pre-release

Highlights:

  • Adds support for batch processing
  • Preprocessing step for vfr videos (Mapping) and Step 1 are now multithreaded (e.g. up to 7x performance increase on a 6-Core 5600x).
  • Adds a progress bar and a status message for better progress indication
  • Adds ETA calculation for every step
  • Tasks can now be paused and resumed
  • The operating system is now prevented from going to sleep while the operation is ongoing
  • The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
  • ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language
  • Command Line Arguments for the CLI version are now properly validated to provide a user-friendly error message
  • Adds an option for normalizing traditional chinese to simplified chinese for "Chinese & English" for better results

Fixes:

  • Fixed an issue where subtitles could merge incorrectly due to substring matches
  • Fixed an issue where subtitles were not correctly merged for vfr videos depending on the specified merge gap
  • Fixed usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions.

Highlight of the v1.3.x release

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages, including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Notes:

Download Tips:

VideOCR v1.4.0-Beta

29 Jan 01:13

Choose a tag to compare

VideOCR v1.4.0-Beta Pre-release
Pre-release

Highlights:

  • Adds support for batch processing
  • Preprocessing step for vfr videos (Mapping) and Step 1 are now multithreaded (e.g. up to 7x performance increase on a 6-Core 5600x).
  • Adds a progress bar and a status message for better progress indication
  • Adds ETA calculation for every step
  • Tasks can now be paused and resumed
  • The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
  • ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language
  • Command Line Arguments for the CLI version are now properly validated to provide a user-friendly error message

Fixes:

  • Fixed an issue where subtitles could merge incorrectly due to substring matches
  • Fixed usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions.

Highlight of the v1.3.x release

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages, including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Notes:

Download Tips:

VideOCR v1.3.3-Beta3

25 Jan 14:00

Choose a tag to compare

VideOCR v1.3.3-Beta3 Pre-release
Pre-release

Highlights:

  • Preprocessing step for vfr videos (Mapping) and Step 1 are now multithreaded (e.g. up to 7x performance increase on a 6-Core 5600x).
  • Adds a progress bar and a status message for better progress indication
  • Adds ETA calculation for every step
  • The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
  • ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language
  • Command Line Arguments for the CLI version are now properly validated to provide a user-friendly error message

Fixes:

  • Fixed an issue where subtitles could merge incorrectly due to substring matches
  • Fixed usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions.

Highlight of the v1.3.x release

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages, including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Notes:

Download Tips:

VideOCR v1.3.3-Beta2

23 Jan 15:28

Choose a tag to compare

VideOCR v1.3.3-Beta2 Pre-release
Pre-release

Highlights:

  • Preprocessing step for vfr videos (Mapping) and Step 1 are now multithreaded (e.g. up to 7x performance increase on a 6-Core 5600x).
  • Adds a progress bar and a status message for better progress indication
  • Adds ETA calculation for every step
  • The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
  • ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language
  • Command Line Arguments for the CLI version are now properly validated to provide a user-friendly error message

Fixes:

  • Fix usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions.

Highlight of the v1.3.x release

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages, including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Notes:

Download Tips:

VideOCR v1.3.3-Beta

18 Jan 13:28

Choose a tag to compare

VideOCR v1.3.3-Beta Pre-release
Pre-release

Highlights:

  • This new beta shows an performance increase up to 5x in Step 1.
  • The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
  • ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language.

Fixes:

  • Fix usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions.

Highlight of the v1.3.x release

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages,
    including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Notes:

Download Tips:

VideOCR v1.3.2

30 Aug 21:45

Choose a tag to compare

Highlights

  • Integrates the latest PaddleOCR 3.2 version which adds two new languages (Greek and Thai) and improved accuracy for the English model

Fixes:

  • Fixes incompatibility with Nvidia 10 Series graphics cards
  • Fixes crash on videos whose frame count is unknown
  • Fixes key error on the "Save SRT in Video Directory" option

Highlight of the v1.3.x release

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages,
    including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Notes:

Download Tips:

VideOCR v1.3.1

20 Aug 20:44

Choose a tag to compare

Highlights

  • Add Blackwell support
  • Add support for .ts and .m2ts files
  • Add downscaling step before performing OCR (increases performance drastically on high resolution videos)
  • Add feature to save crop box selection
  • Add multilingual support for the UI and the Installer
  • Add hardware support check
  • Enhance Subprocess Error Handling and Diagnostic Logging

Fixes:

  • Fixes text output for right to left languages
  • Fixes taskbar progress bug

Highlight of the v1.3.x release

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages,
    including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Notes:

  • Utilizes standalone version of PaddleOCR from https://github.com/timminator/PaddleOCR-Standalone
  • Due to bug with the 10 Series with CUDA 12.8 the GPU version requires an Nvidia graphics card with compute capability between 7.5 and 12.0 (16 Series or newer) and driver version 527.41 (Linux: 525.60.13) or higher!
  • If you are using a 10 Series card please check out the GPU version of the v1.3.0 release for now.
  • Linux version requires glibc 2.35 or newer (can be checked by running ldd --version in a terminal)

Download Tips:

VideOCR v1.3.0

23 Jul 20:51

Choose a tag to compare

This release adds a lot of new features and improvements.

  • The main highlight is the newly added support for PaddleOCR 3.x.x with greatly improved OCR capabilities in more than 40 languages,
    including English, Chinese, Japanese, French, Spanish, Portuguese, Russian, Korean, etc.

Further highlights

  • Add support for videos with variable frame rate
  • Add support for videos with a start time offset
  • Add Dual Zone OCR support
  • Adds a post processing step to split wrongfully merged words
  • Deprecated parameters similar image threshold and similar pixel threshold in favor of SSIM threshold. This new parameter enables speed improvements of over 500% for certain videos. More info about it can be found in the parameters section.
  • Adds Taskbar progress indication for Windows
  • Adds the ability to send notifications if the OCR process finished
  • Add subtitle position parameter
  • Add minimum subtitle duration parameter
  • Adds an About Tab

Fixes:

  • Fix sub sorting behaviour if there are multiple lines
  • Fix video box not correctly scaling on Linux
  • Disables angle classification by default
  • Sets a unique app identifier

Notes:

  • Utilizes standalone version of PaddleOCR from https://github.com/timminator/PaddleOCR-Standalone
  • The GPU version requires an Nvidia graphics card (10 Series or newer) and driver version 522.25 (Linux: 520.56.06) or higher!
  • Nvidia 50 Series is currently not [yet] supported!
  • Linux version requires glibc 2.35 or newer (can be checked by running ldd --version in a terminal)
  • This release deprecates the old CLI in favor of CLI SA
  • Windows releases were updated two days after the initial v1.3.0 release to fix a missing dependency on systems that did not have the Visual C++ Redistributable Packages for Visual Studio 2013 installed.

Download Tips:

VideOCR v1.2.1

21 May 19:13

Choose a tag to compare

  • This release adds a new parameter called "Max Merge Gap". More info about it can be found in the parameters section.
  • Several bugfixes are also included.

Fixes:

  • Subtitles disappeared one frame too early
  • Subtitles which where visible until the end of the video or the specified time had a wrong srt timestamp ending
  • Crop box was not applied if it extended to the top left corner
  • Crop box was not completely seen when it extended all the way to the right and bottom
  • Icon was displayed blurry in the taskbar on Windows
  • Update tooltip for brightness threshold

Notes:

  • Utilizes standalone version of PaddleOCR from https://github.com/timminator/PaddleOCR-Standalone
  • The GPU version requires an Nvidia graphics card (10 Series or newer) and driver version 522.25 (Linux: 520.56.06) or higher!
  • Linux version requires glibc 2.35 or newer (can be checked by running ldd --version in a terminal)

Download Tips: