baxtree
diff --git a/‎.github/workflows/dockerhub-extra.yml‎
Lines changed: 3 additions & 1 deletion b/‎.github/workflows/dockerhub-extra.yml‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎.github/workflows/lint-charts.yml‎
Lines changed: 1 addition & 1 deletion b/‎.github/workflows/lint-charts.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎README.md‎
Lines changed: 173 additions & 119 deletions b/‎README.md‎
Lines changed: 173 additions & 119 deletions
@@ -44,8 +44,10 @@ jobs:
         id: docker_build_latest
         uses: docker/build-push-action@v4
         with:
-          context: .
+          context: ./docker
           file: "./docker/Dockerfile-Ubuntu20"
+          build-args: |
+            "RELEASE_VERSION=${{ env.SUBALIGNER_TAG }}"
           platforms: linux/amd64,linux/arm64
           allow: network.host
           github-token: ${{ github.token }}
 
@@ -12,7 +12,7 @@ env:
 
 jobs:
   lint-helm-charts:
-    runs-on: ubuntu-20.04
+    runs-on: ubuntu-latest
 
     steps:
       - name: Checkout sources
 
@@ -17,186 +17,240 @@ Video/Audio: MP4, WebM, Ogg, 3GP, FLV, MOV, Matroska, MPEG TS, WAV, MP3, AAC, FL
 
 :information_source: <small style="line-height: 1.2;">Subaligner relies on file extensions as default hints to process a wide range of audiovisual or subtitle formats. It is recommended to use extensions widely acceppted by the community to ensure compatibility.</small>
 
-## Dependencies
-Required by basic: [FFmpeg](https://www.ffmpeg.org/)
-```
-$ apt-get install ffmpeg
-```
-or
-```
+## Dependant package
+Required by the basic installation: [FFmpeg](https://www.ffmpeg.org/)
+<details>
+<summary>Install FFmpeg</summary>
+<pre><code>$ apt-get install ffmpeg
 $ brew install ffmpeg
-```
+</code></pre>
+</details>
 
 ## Basic Installation
-```
-$ pip install -U pip && pip install -U setuptools wheel
+<details>
+<summary>Install from PyPI</summary>
+<pre><code>$ pip install -U pip && pip install -U setuptools wheel
 $ pip install subaligner
-```
-or install from source:
-```
-$ git clone git@github.com:baxtree/subaligner.git && cd subaligner
+</code></pre>
+</details>
+<details>
+<summary>Install from source</summary>
+<pre><code>$ git clone git@github.com:baxtree/subaligner.git && cd subaligner
 $ pip install -U pip && pip install -U setuptools
-$ python setup.py install
-```
+$ pip install .
+</code></pre>
+</details>
 :information_source: <small style="line-height: 1.2;">It is highly recommended creating a virtual environment prior to installation.</small>
 
 ## Installation with Optional Packages Supporting Additional Features
-```
-# Install dependencies for enabling translation and transcription
-
-$ pip install 'subaligner[llm]'
-```
-```
-# Install dependencies for enabling forced alignment
-
-$ pip install 'setuptools<65.0.0'
+<details>
+<summary>Install dependencies for enabling translation and transcription</summary>
+<pre><code>$ pip install 'subaligner[llm]'
+</code></pre>
+</details>
+
+<details>
+<summary>Install dependencies for enabling forced alignment</summary>
+<pre><code>$ pip install 'setuptools<65.0.0'
 $ pip install 'subaligner[stretch]'
-```
-```
-# Install dependencies for setting up the development environment
+</code></pre>
+</details>
 
-$ pip install 'setuptools<65.0.0'
+<details>
+<summary>Install dependencies for setting up the development environment</summary>
+<pre><code>$ pip install 'setuptools<65.0.0'
 $ pip install 'subaligner[dev]'
-```
-Note that both `subaligner[stretch]` and `subaligner[dev]` require additional dependencies to be pre-installed:
-```
-$ apt-get install espeak libespeak1 libespeak-dev espeak-data
-```
-or
-```
-$ brew install espeak
-```
-To install all supported features:
-```
-$ pip install 'setuptools<65.0.0'
+</code></pre>
+</details>
+
+
+<details>
+<summary>Install all extra dependencies</summary>
+<pre><code>$ pip install 'setuptools<65.0.0'
 $ pip install 'subaligner[harmony]'
-```
+</code></pre>
+</details>
+
+Note that `subaligner[stretch]`, `subaligner[dev]` and `subaligner[harmony]` require [eSpeak](https://espeak.sourceforge.net/) to be pre-installed:
+<details>
+<summary>Install eSpeak</summary>
+<pre><code>$ apt-get install espeak libespeak1 libespeak-dev espeak-data
+$ brew install espeak
+</code></pre>
+</details>
 
 ## Container Support
-If you prefer using a containerised environment over installing everything locally, run:
+If you prefer using a containerised environment over installing everything locally:
+<details>
+<summary>Run subaligner with a container</summary>
+<pre><code>$ docker run -v `pwd`:`pwd` -w `pwd` -it baxtree/subaligner bash
+</code></pre>
+</details>
 
-```
-$ docker run -v `pwd`:`pwd` -w `pwd` -it baxtree/subaligner bash
-```
 For Windows users, you can use Windows Subsystem for Linux ([WSL](https://learn.microsoft.com/en-us/windows/wsl/install)) to install Subaligner.
 Alternatively, you can use [Docker Desktop](https://docs.docker.com/docker-for-windows/install/) to pull and run the image.
-Assuming your media assets are stored under `d:\media`, open built-in command prompt, PowerShell, or Windows Terminal and run:
-```
-docker pull baxtree/subaligner
+Assuming your media assets are stored under `d:\media`, open built-in command prompt, PowerShell, or Windows Terminal:
+<details>
+<summary>Run the subaligner container on Windows</summary>
+<pre><code>docker pull baxtree/subaligner
 docker run -v "/d/media":/media -w "/media" -it baxtree/subaligner bash
-```
+</code></pre>
+</details>
 
 ## Usage
-```
-# Single-stage alignment (high-level shift with lower latency)
-
-$ subaligner -m single -v video.mp4 -s subtitle.srt
+<details>
+<summary>Single-stage alignment (high-level shift with lower latency)</summary>
+<pre><code>$ subaligner -m single -v video.mp4 -s subtitle.srt
 $ subaligner -m single -v https://example.com/video.mp4 -s https://example.com/subtitle.srt -o subtitle_aligned.srt
-```
-```
-# Dual-stage alignment (low-level shift with higher latency)
+</code></pre>
+</details>
 
-$ subaligner -m dual -v video.mp4 -s subtitle.srt
+<details>
+<summary>Dual-stage alignment (low-level shift with higher latency)</summary>
+<pre><code>$ subaligner -m dual -v video.mp4 -s subtitle.srt
 $ subaligner -m dual -v https://example.com/video.mp4 -s https://example.com/subtitle.srt -o subtitle_aligned.srt
-```
-```
-# Generate subtitles by transcribing audiovisual files
-$ subaligner -m transcribe -v video.mp4 -ml eng -mr whisper -mf small -o subtitle_aligned.srt
-$ subaligner -m transcribe -v video.mp4 -ml zho -mr whisper -mf medium -o subtitle_aligned.srt
-$ subaligner -m transcribe -v video.mp4 -ml eng -mr whisper -mf turbo -ip "your initial prompt" -o subtitle_aligned.srt
-$ subaligner -m transcribe -v video.mp4 -ml eng -mr whisper -mf turbo -ip "your initial prompt" --word_time_codes -o raw_subtitle.json
-$ subaligner -m transcribe -v video.mp4 -s subtitle.srt -ml eng -mr whisper -mf turbo -o subtitle_aligned.srt
-$ subaligner -m transcribe -v video.mp4 -s subtitle.srt --use_prior_prompting -ml eng -mr whisper -mf turbo -o subtitle_aligned.srt
+</code></pre>
+</details>
 
-```
-```
-# Alignment on segmented plain texts (double newlines as the delimiter)
+<details>
+<summary>Generate subtitles by transcribing audiovisual files</summary>
+<pre><code>$ subaligner -m transcribe -v video.mp4 -ml eng -mr whisper -mf small -o subtitle_aligned.srt
+$ subaligner -m transcribe -v video.mp4 -ml zho -mr whisper -mf medium -o subtitle_aligned.srt
+</code></pre>
+</details>
+
+<details>
+<summary>Pass in a global prompt for the entire audio transcription</summary>
+<pre><code>$ subaligner -m transcribe -v video.mp4 -ml eng -mr whisper -mf turbo -ip "your initial prompt" -o subtitle_aligned.srt 
+</code></pre>
+</details>
+
+<details>
+<summary>Use the full subtitle content as a prompt</summary>
+<pre><code>$ subaligner -m transcribe -v video.mp4 -s subtitle.srt -ml eng -mr whisper -mf turbo -o subtitle_aligned.srt
+</code></pre>
+</details>
+
+<details>
+<summary>Use the previous subtitle segment as the prompt when transcribing the following segment</summary>
+<pre><code>$ subaligner -m transcribe -v video.mp4 -s subtitle.srt --use_prior_prompting -ml eng -mr whisper -mf turbo -o subtitle_aligned.srt
+</code></pre>
+</details>
+
+(For details on the prompt crafting for transcription, please refer to [Whisper prompting guide](https://cookbook.openai.com/examples/whisper_prompting_guide).)
+
+<details>
+<summary>Alignment on segmented plain texts (double newlines as the delimiter)</summary>
+<pre><code>$ subaligner -m script -v video.mp4 -s subtitle.txt -o subtitle_aligned.srt
+$ subaligner -m script -v https://example.com/video.mp4 -s https://example.com/subtitle.txt -o subtitle_aligned.srt
+</code></pre>
+</details>
 
-$ subaligner -m script -v video.mp4 -s subtitle.txt -o subtitle_aligned.srt
+<details>
+<summary>Generate JSON raw subtitle with per-word timings</summary>
+<pre><code>$ subaligner -m transcribe -v video.mp4 -ml eng -mr whisper -mf turbo -ip "your initial prompt" --word_time_codes -o raw_subtitle.json
 $ subaligner -m script -v video.mp4 -s subtitle.txt --word_time_codes -o raw_subtitle.json
-$ subaligner -m script -v https://example.com/video.mp4 -s https://example.com/subtitle.txt -o subtitle_aligned.srt
-```
-```
-# Alignment on multiple subtitles against the single media file
+</code></pre>
+</details>
+
 
-$ subaligner -m script -v video.mp4 -s subtitle_lang_1.txt -s subtitle_lang_2.txt
+<details>
+<summary>Alignment on multiple subtitles against the single media file</summary>
+<pre><code>$ subaligner -m script -v video.mp4 -s subtitle_lang_1.txt -s subtitle_lang_2.txt
 $ subaligner -m script -v video.mp4 -s subtitle_lang_1.txt subtitle_lang_2.txt
-```
-```
-# Alignment on embedded subtitles
+</code></pre>
+</details>
 
-$ subaligner -m single -v video.mkv -s embedded:stream_index=0 -o subtitle_aligned.srt
+<details>
+<summary>Alignment on embedded subtitles</summary>
+<pre><code>$ subaligner -m single -v video.mkv -s embedded:stream_index=0 -o subtitle_aligned.srt
 $ subaligner -m dual -v video.mkv -s embedded:stream_index=0 -o subtitle_aligned.srt
-```
-```
-# Translative alignment with the ISO 639-3 language code pair (src,tgt)
+</code></pre>
+</details>
 
-$ subaligner --languages
+<details>
+<summary>Translative alignment with the ISO 639-3 language code pair (src,tgt)</summary>
+<pre><code>$ subaligner --languages
 $ subaligner -m single -v video.mp4 -s subtitle.srt -t src,tgt
 $ subaligner -m dual -v video.mp4 -s subtitle.srt -t src,tgt
 $ subaligner -m script -v video.mp4 -s subtitle.txt -o subtitle_aligned.srt -t src,tgt
 $ subaligner -m dual -v video.mp4 -s subtitle.srt -tr helsinki-nlp -o subtitle_aligned.srt -t src,tgt
 $ subaligner -m dual -v video.mp4 -s subtitle.srt -tr facebook-mbart -tf large -o subtitle_aligned.srt -t src,tgt
 $ subaligner -m dual -v video.mp4 -s subtitle.srt -tr facebook-m2m100 -tf small -o subtitle_aligned.srt -t src,tgt
 $ subaligner -m dual -v video.mp4 -s subtitle.srt -tr whisper -tf small -o subtitle_aligned.srt -t src,eng
-```
-```
-# Transcribe audiovisual files and generate translated subtitles
+</code></pre>
+</details>
+
+<details>
+<summary>Transcribe audiovisual files and generate translated subtitles</summary>
+<pre><code>$ subaligner -m transcribe -v video.mp4 -ml src -mr whisper -mf small -tr helsinki-nlp -o subtitle_aligned.srt -t src,tgt
+</code></pre>
+</details>
 
-$ subaligner -m transcribe -v video.mp4 -ml src -mr whisper -mf small -tr helsinki-nlp -o subtitle_aligned.srt -t src,tgt
-```
-```
-# Shift subtitle manually by offset in seconds
 
-$ subaligner -m shift --subtitle_path subtitle.srt -os 5.5
+<details>
+<summary>Shift subtitle manually by offset in seconds</summary>
+<pre><code>$ subaligner -m shift --subtitle_path subtitle.srt -os 5.5
 $ subaligner -m shift --subtitle_path subtitle.srt -os -5.5 -o subtitle_shifted.srt
-```
-```
-# Run batch alignment against directories
+</code></pre>
+</details>
 
-$ subaligner_batch -m single -vd videos/ -sd subtitles/ -od aligned_subtitles/
+<details>
+<summary>Run batch alignment against directories</summary>
+<pre><code>$ subaligner_batch -m single -vd videos/ -sd subtitles/ -od aligned_subtitles/
 $ subaligner_batch -m dual -vd videos/ -sd subtitles/ -od aligned_subtitles/
 $ subaligner_batch -m dual -vd videos/ -sd subtitles/ -od aligned_subtitles/ -of ttml
-```
-```
-# Run alignments with pipx
+</code></pre>
+</details>
 
-$ pipx run subaligner -m single -v video.mp4 -s subtitle.srt
+<details>
+<summary>Run alignments with pipx</summary>
+<pre><code>$ pipx run subaligner -m single -v video.mp4 -s subtitle.srt
 $ pipx run subaligner -m dual -v video.mp4 -s subtitle.srt
-```
-```
-# Run the module as a script
-$ python -m subaligner -m single -v video.mp4 -s subtitle.srt
+</code></pre>
+</details>
+
+<details>
+<summary>Run the module as a script</summary>
+<pre><code>$ python -m subaligner -m single -v video.mp4 -s subtitle.srt
 $ python -m subaligner -m dual -v video.mp4 -s subtitle.srt
-```
-```
-# Run alignments with the docker image
+</code></pre>
+</details>
 
-$ docker pull baxtree/subaligner
+<details>
+<summary>Run alignments with the docker image</summary>
+<pre><code>$ docker pull baxtree/subaligner
 $ docker run -v `pwd`:`pwd` -w `pwd` -it baxtree/subaligner subaligner -m single -v video.mp4 -s subtitle.srt
 $ docker run -v `pwd`:`pwd` -w `pwd` -it baxtree/subaligner subaligner -m dual -v video.mp4 -s subtitle.srt
 $ docker run -it baxtree/subaligner subaligner -m single -v https://example.com/video.mp4 -s https://example.com/subtitle.srt -o subtitle_aligned.srt
 $ docker run -it baxtree/subaligner subaligner -m dual -v https://example.com/video.mp4 -s https://example.com/subtitle.srt -o subtitle_aligned.srt
-```
+</code></pre>
+</details>
+
+![](figures/screencast.gif)
+
 The aligned subtitle will be saved at `subtitle_aligned.srt`. To obtain the subtitle in raw JSON format for downstream
 processing, replace the output file extension with `.json`. For details on CLIs, run `subaligner -h` or `subaligner_batch -h`,
 `subaligner_convert -h`, `subaligner_train -h` and `subaligner_tune -h` for additional utilities. `subaligner_1pass` and `subaligner_2pass` are shortcuts for running `subaligner` with `-m single` and `-m dual` options, respectively.
 
-![](figures/screencast.gif)
-
 ## Advanced Usage
-You can train a new model with your own audiovisual files and subtitle files:
-```
-$ subaligner_train -vd VIDEO_DIRECTORY -sd SUBTITLE_DIRECTORY -tod TRAINING_OUTPUT_DIRECTORY
-```
+You can train a new model with your own audiovisual files and subtitle files,
+<details>
+<summary>Train a custom model</summary>
+<pre><code>$ subaligner_train -vd VIDEO_DIRECTORY -sd SUBTITLE_DIRECTORY -tod TRAINING_OUTPUT_DIRECTORY
+</code></pre>
+</details>
+
 Then you can apply it to your subtitle synchronisation with the aforementioned commands. For more details on how to train and tune your own model, please refer to [Subaligner Docs](https://subaligner.readthedocs.io/en/latest/advanced_usage.html).
 
-For larger media files taking longer to process, you can reconfigure various timeouts using the following options:
-```
--mpt [Maximum waiting time in seconds when processing media files]
+For larger media files taking longer to process, you can reconfigure various timeouts using the following:
+<details>
+<summary>Options for tuning timeouts</summary>
+<pre><code>-mpt [Maximum waiting time in seconds when processing media files]
 -sat [Maximum waiting time in seconds when aligning each segment]
 -fet [Maximum waiting time in seconds when embedding features for training]
-```
+</code></pre>
+</details>
 
 ## Anatomy
 Subtitles can be out of sync with their companion audiovisual media files for a variety of causes including latency introduced by Speech-To-Text on live streams or calibration and rectification involving human intervention during post-production.