I want to ask about whisper ai #2529

cns00 · 2025-02-15T18:32:12Z

cns00
Feb 15, 2025

I have some questions about whisper ai.

I am following https://www.youtube.com/watch?v=ABFqbY_rmEk to install whisper ai. I have Windows 11 on my laptop.

I did the 5 steps in the video. Assume that the audio file is test.wav. I said:
whisper test.wav

It is downloading large-v3-turbo.pt since this is the first time that I ran the command. It is 1.5gb. The problem is that my internet is horrible. The download is failing. It reached 18% and then the download fails and I get an error message that says:
RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.

I tried it again. It reached 34% and failed. I tried it again. It reached 55% and failed. I even tested it by using a VPN and still no luck. In the end the download got stuck on say 34% and it's not proceeding. The download got stuck 3 times.

I saw #1027. They said that the issue was fixed if you deleted:
"C:\Users.cache\torch\whisperx-vad-segmentation.bin"
"C:\Users\USER-NAME.cache\torch\whisperx-vad-segmentation.bak"

I don't have those files. What do I do? How do I download large-v3-turbo.pt?

I have subtitle files for the non English parts of my movies in which lines show before\after a person starts speaking and they go away before\after a person stops speaking and the delays are noticeable.

I want to make the lines show exactly when a person starts speaking and they go away exactly when a person stops speaking. Can whisper do that?

Assume that I have an English movie that has French dialogue and I have a subtitle file for the French part. ChatGPT said that I can use whisper to generate a subtitle file for the movie and then the timing in that subtitle file will be used to fix the timing of my subtitle file and the command is:
python sync_subtitles.py my_movie.mp4 my_subtitles.srt

https://drive.google.com/file/d/1ShP5MmmOrr1CCuHPkPHhZgRKOVBzWyMt/view?usp=sharing is sync_subtitles.py that ChatGPT made.

After that it said this:
https://imgur.com/a/nCcZBm6

I said yes and it made for me a modified file which is modified_sync_subtitles.py and it is https://drive.google.com/file/d/1bvLnCeG9WUfXOydFCQWiUamj_-aS7sA_/view?usp=sharing. The modified script does this:
https://imgur.com/a/k0ql3OM

I want to ask is the coding in both scripts correct? Which script should I use?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

I want to ask about whisper ai #2529

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

I want to ask about whisper ai #2529

Uh oh!

Uh oh!

cns00 Feb 15, 2025

Replies: 0 comments

cns00
Feb 15, 2025