Replies: 2 comments 3 replies
-
Is there a difference in the quality of the two results?
…On Mon, Jul 10, 2023 at 5:23 PM Xi23000 ***@***.***> wrote:
Understanding Descript and Whisper are targeted to different demographics
I would like to understand why the abysmal difference in the speed it takes
to transcribe the same audio. (30 mins wav file).
I there particular factors that speed up / slow down transcription speed?
whisper audio.wav --model tiny
takes about 20 minutes
Descript takes about 110 secs
—
Reply to this email directly, view it on GitHub
<#1515>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAGW5AZHS3Q3HFF4XENIGBLXPSFGVANCNFSM6AAAAAA2FGEFVQ>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
You may have accidentally run Whisper in CPU mode, which is much slower.
If you run
Where it says (I think, in an install without a recognized GPU, Whisper would say
For example, a 30 minute audio... On my RTX 3060 on the latest Whisper master:
If you look at the table speeds given in the Whisper readme:
it would take:
CPU is much, much slower than GPU—and anything beyond
Yes, there are settings that can have a huge effect. For example:
See some benchmarks given back in October 2022:
By default, the commandline version of Whisper defaults to:
If you lower those, you can get huge speedups at the cost of accuracy (and more hallucinations), because Whisper will be checking less possible results against each other. For some more details on those 2 settings, see: Also, there are forks of Whisper, like: which lower the precision from fp32 -> fp16 or int8. This allows major speedups + massively lowered RAM/VRAM usage. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Understanding Descript and Whisper are targeted to different demographics I would like to understand why the abysmal difference in the speed it takes to transcribe the same audio. (30 mins wav file).
I there particular factors that speed up / slow down transcription speed?
whisper audio.wav --model tiny
takes about 20 minutes
Descript takes about 110 secs
Beta Was this translation helpful? Give feedback.
All reactions