Very slow transcribing on CPU #369

s1lverkin · 2022-10-19T13:13:39Z

s1lverkin
Oct 19, 2022

Hello,

I know that transcribing on GPU will be much faster than on CPU, but still, I have a server with Ryzen 5 3600, and transcribing times are abysmally long.

I am using Polish language, with medium model, and for file with 31s duration it takes 11 minutes and 16 seconds, while CPU is being maxed on 6 threads.

Also, every time I am getting a warning that FP16 is not supported on CPU; using FP32 instead, could it be a cause of this slowness?

rjwilmsi · 2022-10-19T13:52:47Z

rjwilmsi
Oct 19, 2022

That is to be expected. CPU is very slow, particularly on the larger models. Something like an NVIDIA GTX 1660 would be ~10x the speed of a 6/8 core CPU. I've found both base and small models to be very accurate (for English), with most differences between models only being punctuation (tiny model does tend to have more errors though still not that many). On your CPU the base model may be close to 1x.

On my Ryzen 4500U on the small.en model transcription is about 0.10x realtime. So if the medium model is 0.05x realtime on your Ryzen 3600 that sounds about right.

You can also try the openvino version: https://github.com/zhuzilin/whisper-openvino. On the base.en model it's about twice as fast for me.

Another option, depending on how much you have to transcribe and any data security concerns is to run whisper within a free Google Colab GPU instance, which ran at about 8x realtime for me on small.en model.

There is another project: https://github.com/ggerganov/whisper.cpp which is much much faster on CPU. I assume it is trading speed for accuracy but I don't understand how/where accuracy is reduced.

1 reply

phil123456 Nov 9, 2022

That is to be expected. CPU is very slow,

well yeah, python is nice to display a ui, not to do AI, it has to use CUDA or CPP as a backend

FurkanGozukara · 2022-10-22T15:19:15Z

FurkanGozukara
Oct 22, 2022

RTX 3060 is 22 times faster than my 16 cores 10700f @ 4.5 GHZ

think yourself :)

0 replies

MrEdwards007 · 2022-11-02T02:21:26Z

MrEdwards007
Nov 2, 2022

No changes made to Whisper and we have great acceleration on CPUs

#432

Using 011 of 16CPUs for the "tiny.en" model, a transcription speed of 32.713x
Using 007 of 16CPUs for the "base.en" model, a transcription speed of 16.416x
Using 009 of 16CPUs for the "small.en" model, a transcription speed of 5.595x

Machine -- MacBook, macOS Big Sur using 2.3 GHz Intel Core i9, 16 cores, with 16G of RAM.
Testing of "medium.en" model was very limited because I quickly ran out of memory, so those tests were not included.

https://github.com/MrEdwards007/WhisperTaskAcceleration

0 replies

horvathgergo · 2023-05-14T20:19:19Z

horvathgergo
May 14, 2023

I tried Whisper on a Jetson tx2 development kit (ubuntu 18.04, Nvidia Jetpack 4.6.3, python 3.8, PyTorch 2.0, CUDA 10.2) with a 2 sec mp3 audio and it took 12 sec to transcribe with --language en --model tiny parameters. I was not able to use CUDA due to compatibility issues with pytorch 2.0. I tried installing pytorch 1.10 Nvidia wheel with python 3.8 but it was unsuccessful. Do you know somebody who could utilise CUDA on this device?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Very slow transcribing on CPU #369

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Very slow transcribing on CPU #369

Uh oh!

s1lverkin Oct 19, 2022

Replies: 4 comments · 1 reply

Uh oh!

rjwilmsi Oct 19, 2022

Uh oh!

phil123456 Nov 9, 2022

Uh oh!

FurkanGozukara Oct 22, 2022

Uh oh!

MrEdwards007 Nov 2, 2022

Uh oh!

Uh oh!

horvathgergo May 14, 2023

s1lverkin
Oct 19, 2022

Replies: 4 comments 1 reply

rjwilmsi
Oct 19, 2022

FurkanGozukara
Oct 22, 2022

MrEdwards007
Nov 2, 2022

horvathgergo
May 14, 2023