Replies: 1 comment 2 replies
-
They're both being worked on, I think hallucinations is the biggest issue at the moment but I think pretty soon it will be solved to a large extent. I'm pretty optimistic about this PR for example: #1155 |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The main issues I see with it are hallucinations during silence and word timings being off. Other than those two things, for ASR, it's pretty impressive when compared to AWS Transcribe.
Is there any effort to improve either of the above issues internally with whisper? Or are these baked into how it works, and any solutions for either of them will require using custom post-processing or third-party tools (ie, whisperX, stable-ts)?
Beta Was this translation helpful? Give feedback.
All reactions