Whisper - change output to srt with timings #1880

Bazza195 · 2023-12-07T10:01:53Z

Bazza195
Dec 7, 2023

I have python script which transcribes .mp4 and outputs the transcript in text format. However i want the output to be in .srt with the timings. How can i do this?

`import os
import whisper
from tqdm import tqdm

Define the folder where the mp4 files are located

root_folder = "C:\Video"

Set up Whisper client

print("Loading whisper model...")
model = whisper.load_model("base")
print("Whisper model complete.")

Get the number of mp4 files in the root folder and its sub-folders

print("Getting number of files to transcribe...")
num_files = sum(1 for dirpath, dirnames, filenames in os.walk(root_folder) for filename in filenames if filename.endswith(".mp4"))
print("Number of files: ", num_files)

Transcribe the mp4 files and display a progress bar

with tqdm(total=num_files, desc="Transcribing Files") as pbar:
for dirpath, dirnames, filenames in os.walk(root_folder):
for filename in filenames:
if filename.endswith(".mp4"):
filepath = os.path.join(dirpath, filename)
result = model.transcribe(filepath, fp16=False, verbose=True)
transcription = result['text']
# Write transcription to text file
filename_no_ext = os.path.splitext(filename)[0]
with open(os.path.join(dirpath, filename_no_ext + '.txt'), 'w') as f:
f.write(transcription)
pbar.update(1)`

ryanheise · 2023-12-07T10:06:50Z

ryanheise
Dec 7, 2023

See for example #1586

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Whisper - change output to srt with timings #1880

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Whisper - change output to srt with timings #1880

Uh oh!

Bazza195 Dec 7, 2023

Define the folder where the mp4 files are located

Set up Whisper client

Get the number of mp4 files in the root folder and its sub-folders

Transcribe the mp4 files and display a progress bar

Replies: 1 comment

Uh oh!

ryanheise Dec 7, 2023

Bazza195
Dec 7, 2023

ryanheise
Dec 7, 2023