large-v3 behaves weird #1806

abg149 · 2023-11-14T11:44:53Z

abg149
Nov 14, 2023

Hi! I've been using Whisper for a while now, it's wonderful :D
I've been testing this new model for a couple of days and it's not working as intended.
Using the same source of audio with v2 and v3 results in drastically different outputs.
The most notable difference is that when it doesn't recognize a phrase, proceeds to repeat it many times.
This happened sometimes with v2, but in this case the timestamps are the same through multiple lines and the repetitions last longer.
Another thing is that it works better with automatic language detection than with the language of the video itself. That is, even though the video is in Spanish, it works better if I put auto on it.
If anyone has an explanation for why is this happening and perhaps needs more specific information, I'll be glad to give it. I haven't done it now since the only change I make to the code is changing from large-v2 to large-v3, so it shouldn't be a code problem.

ThioJoe · 2023-11-29T02:15:34Z

ThioJoe
Nov 29, 2023

Honestly Large-V3 seems to suck in my experience. I tried it on my latest video and it hallucinated in multiple places, which V2 almost never does. Sometimes V2 mishears a word or something, but it is very very rare that it completely hallucinates.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

large-v3 behaves weird #1806

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

large-v3 behaves weird #1806

Uh oh!

Uh oh!

abg149 Nov 14, 2023

Replies: 1 comment

Uh oh!

ThioJoe Nov 29, 2023

abg149
Nov 14, 2023

ThioJoe
Nov 29, 2023