Skip to content

sgnoob/subtitles

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 

Repository files navigation

subtitles

Generate JP and EN subtitles

Open All Colab


How does this work?

1. Extract vocals from input for better transcription

Using Mel-Band-Roformer.

2. Generate Transcription

Using a variant of OpenAI Whisper.

3. Translate to English

Using Google Gemini.


How to use the Colab file?

1. Choose an Input Method

Option A - Download via yt-dlp

image

Option B - Google Drive

  • Upload the video to Google Drive.
  • Then specify the path to the file in Google Drive.
  • Keep it simple. No spaces or special characters. No effort was made to accomodate those. image

2. Choose an Output folder in your Google Drive

image

3. Input Google Gemini API key(s)

You need at least 1 key. If desired, you can specify 2 keys from different accounts to try to get around Google free API limits.

Option A - Just fill in the keys.

image

Option B - Use Colab's Secret (Better)

Put your keys in Colab's Secret by clicking the key icon. Make sure that Notebook access is enabled. Key names are GEMINI_API_KEY_1 and GEMINI_API_KEY_2.

image

4. Run

A. Make sure that you enable GPU.

image image

B. Run.

image

About

Generate JP and EN subtitles

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published