Differences in Whisper Results When Executing via Code vs. Console Command with the Same Parameters #1892
Igarugueri
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello whisper community,
I am encountering unexpected behaviour when using Whisper, OpenAI's voice-to-text transcription model. I've noticed that I get different results when running the model through a Python script compared to direct execution in the console, even though I am using the same parameters in both cases.
-Parameters Used-
Execution Environment: Running on Windows 11, Python 3.9.9
Whisper Version: 20231117
Parameters Used:
In the script: (file_path: str, model_size: str = "small", word_timestamps: bool = True,
language: str = "Spanish", translate: bool = True)
In the console: > whisper file.wav --word_timestamps True --language es --task translate --model small
-Issue-
When using Whisper, I am encountering differences in performance between running it through a Python script and executing the same command directly in the console:
This discrepancy is puzzling, especially since the parameters and environment are consistent across both methods of execution.
I would like to understand why there is this discrepancy. Could it be due to differences in the execution environment, or is there something else I might be overlooking?
I appreciate any guidance or suggestions you can provide to help me solve this mystery.
Thank you in advance for your time and help!
Best regards,
Igarugueri.
Note: Additional Audio File Details
Just as an additional note, here are some key details of the audio file (file.wav) I am using for the transcription:
Beta Was this translation helpful? Give feedback.
All reactions