Transcribing video file directly instead of its path #1641

ilyasbendev · 2023-09-01T06:28:47Z

ilyasbendev
Sep 1, 2023

Hello everyone,
I'm working with whisper actually, my goal is to transcribe a video file.
It's ok when i enter the path of file as an input but its not working when i'm dealing with files directly.

Its ok when i'm dealing with audios, this is the code that i wrote (for the audio)

import io
import librosa
import whisper
model = whisper.load_model("base")
buf = io.BytesIO(file.read())
data, sr = librosa.load(buf)
if sr != 16000:
        data = librosa.resample(data, orig_sr=sr, target_sr=16000)
    
model.transcribe(data)

Does anyone have an idea about working with videos also ? (i tried to extract audio from the video and after that execute the code above, but its not working

Answered by threeal

Sep 2, 2023

I've tried using the decode_audio function from faster_whisper to transcribe audio from a WebM file, and it worked. I'm not 100% sure if it's going to work with other video formats, but one thing is for sure: the decode_audio function uses FFmpeg and the av library to load audio files. You can still see the implementation of that function in case it doesn't work with other video formats.

from faster_whisper.audio import decode_audio

data = decode_audio(file)
model.transcribe(data)

View full answer

threeal · 2023-09-02T14:20:39Z

threeal
Sep 2, 2023

I've tried using the decode_audio function from faster_whisper to transcribe audio from a WebM file, and it worked. I'm not 100% sure if it's going to work with other video formats, but one thing is for sure: the decode_audio function uses FFmpeg and the av library to load audio files. You can still see the implementation of that function in case it doesn't work with other video formats.

from faster_whisper.audio import decode_audio

data = decode_audio(file)
model.transcribe(data)

1 reply

ilyasbendev Sep 8, 2023
Author

thank you @threeal for your response.
Yes the idea is to manipulate FFmpeg to transform video to audio.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transcribing video file directly instead of its path #1641

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Transcribing video file directly instead of its path #1641

Uh oh!

Uh oh!

ilyasbendev Sep 1, 2023

Replies: 1 comment · 1 reply

Uh oh!

threeal Sep 2, 2023

Uh oh!

ilyasbendev Sep 8, 2023 Author

ilyasbendev
Sep 1, 2023

Replies: 1 comment 1 reply

threeal
Sep 2, 2023

ilyasbendev Sep 8, 2023
Author