Replies: 2 comments
-
You shoud be able to just let the library do the trimming and padding by writing: import whisper
file = "audio.mp3"
model = whisper.load_model("base")
result = model.transcribe(file, fp16 = False})
print(result["text"])
|
Beta Was this translation helpful? Give feedback.
0 replies
-
heyy!! @hansolGib I am facing the same problem that whisper is not fully transcribing, did you get the solution?? please do tell me as well. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
hello, i'm trying to access lower-level to transcribe more than 30 sec audio file.
Since pad_or_trim only cut it down to 30 second, i decided to split the audio array into sublist, and then give it to pad_or_trim using for loop so that i can transcribe the whole audio.
It's kinda successful, but i found, between the results of sublists, there are some speech which wasn't transcribed.
How can i fully transcribe the audio longer than 30sec in lower level?
below is my code
Beta Was this translation helpful? Give feedback.
All reactions