-
At present I have a collection of audio clips ranging from a couple seconds to half a minute to a few minutes. To process them I get the file duration and based on that information determine what I'd think is an appropriate and respective model. As an example, "tiny.en" on clips that are a couple seconds and "large" on the ones that are a few minutes. If it's better to just use the "upper bound" then I'm happy to stick to it. I'm also using a prompt to fix grammar and proper nouns throughout the transcripts. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
If you have the horsepower, I wouldn't consider using anything below a medium model regardless of audio length. You can compare tiny vs large in a direct comparison on the same file(s). In my experience, the quality difference has been substantial. |
Beta Was this translation helpful? Give feedback.
If you have the horsepower, I wouldn't consider using anything below a medium model regardless of audio length.
You can compare tiny vs large in a direct comparison on the same file(s). In my experience, the quality difference has been substantial.