You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I did the 5 steps in the video. Assume that the audio file is test.wav. I said:
whisper test.wav
It is downloading large-v3-turbo.pt since this is the first time that I ran the command. It is 1.5gb. The problem is that my internet is horrible. The download is failing. It reached 18% and then the download fails and I get an error message that says:
RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.
I tried it again. It reached 34% and failed. I tried it again. It reached 55% and failed. I even tested it by using a VPN and still no luck. In the end the download got stuck on say 34% and it's not proceeding. The download got stuck 3 times.
I saw #1027. They said that the issue was fixed if you deleted:
"C:\Users.cache\torch\whisperx-vad-segmentation.bin"
"C:\Users\USER-NAME.cache\torch\whisperx-vad-segmentation.bak"
I don't have those files. What do I do? How do I download large-v3-turbo.pt?
I have subtitle files for the non English parts of my movies in which lines show before\after a person starts speaking and they go away before\after a person stops speaking and the delays are noticeable.
I want to make the lines show exactly when a person starts speaking and they go away exactly when a person stops speaking. Can whisper do that?
Assume that I have an English movie that has French dialogue and I have a subtitle file for the French part. ChatGPT said that I can use whisper to generate a subtitle file for the movie and then the timing in that subtitle file will be used to fix the timing of my subtitle file and the command is:
python sync_subtitles.py my_movie.mp4 my_subtitles.srt
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I have some questions about whisper ai.
I am following https://www.youtube.com/watch?v=ABFqbY_rmEk to install whisper ai. I have Windows 11 on my laptop.
I did the 5 steps in the video. Assume that the audio file is test.wav. I said:
whisper test.wav
It is downloading large-v3-turbo.pt since this is the first time that I ran the command. It is 1.5gb. The problem is that my internet is horrible. The download is failing. It reached 18% and then the download fails and I get an error message that says:
RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.
I tried it again. It reached 34% and failed. I tried it again. It reached 55% and failed. I even tested it by using a VPN and still no luck. In the end the download got stuck on say 34% and it's not proceeding. The download got stuck 3 times.
I saw #1027. They said that the issue was fixed if you deleted:
"C:\Users.cache\torch\whisperx-vad-segmentation.bin"
"C:\Users\USER-NAME.cache\torch\whisperx-vad-segmentation.bak"
I don't have those files. What do I do? How do I download large-v3-turbo.pt?
I have subtitle files for the non English parts of my movies in which lines show before\after a person starts speaking and they go away before\after a person stops speaking and the delays are noticeable.
I want to make the lines show exactly when a person starts speaking and they go away exactly when a person stops speaking. Can whisper do that?
Assume that I have an English movie that has French dialogue and I have a subtitle file for the French part. ChatGPT said that I can use whisper to generate a subtitle file for the movie and then the timing in that subtitle file will be used to fix the timing of my subtitle file and the command is:
python sync_subtitles.py my_movie.mp4 my_subtitles.srt
https://drive.google.com/file/d/1ShP5MmmOrr1CCuHPkPHhZgRKOVBzWyMt/view?usp=sharing is sync_subtitles.py that ChatGPT made.
After that it said this:
https://imgur.com/a/nCcZBm6
I said yes and it made for me a modified file which is modified_sync_subtitles.py and it is https://drive.google.com/file/d/1bvLnCeG9WUfXOydFCQWiUamj_-aS7sA_/view?usp=sharing. The modified script does this:
https://imgur.com/a/k0ql3OM
I want to ask is the coding in both scripts correct? Which script should I use?
Beta Was this translation helpful? Give feedback.
All reactions