What file formats are supported for input? #799
-
Just wondering what file types are supported by the model. Are ogg vorbis files acceptable input? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 11 replies
-
ffmpeg is used to load audio (see the link to the code below), so the input type must be supported by ffmpeg. Note that you can also use a container such as .mp4 of audio+video as input. Line 22 in 28769fc |
Beta Was this translation helpful? Give feedback.
-
Can we ask for support by Whisper API for OGG/OGA and FLAC? As I read on https://platform.openai.com/docs/api-reference/audio/create#audio/create-file , official API does not support currently two very nice opensource formats: I have tons of recordings and it as default on a lot of software and devices. As we see from example snippet OGG/OGA is supported:
( from https://stackoverflow.com/a/62267248/ )
from https://stackoverflow.com/a/23380032/544721 Can we please add support in Whipser API for OGG/OGA and FLAC formats? Line 46 in c09a7ae Otherwise, anyway I would be thankful for making sure it works well with github.com/openai/whisper for local deployments. |
Beta Was this translation helpful? Give feedback.
-
.opus and .amr supported, too. (used by messengers) |
Beta Was this translation helpful? Give feedback.
ffmpeg is used to load audio (see the link to the code below), so the input type must be supported by ffmpeg. Note that you can also use a container such as .mp4 of audio+video as input.
whisper/whisper/audio.py
Line 22 in 28769fc