I had this idea during this use case:
- I needed to transcribe an 1-hour long interview in Brazilian Portuguese (the file was 80MB+ and was in M4A format)
- I split the original file in 5 OGG parts using ffmpeg (~8MB each part)
- I sent the files to the bot, received lots of 4k-chars messages in reply and copied to a text editor (this was boring)
- I found some errors when reading the text in the editor, but it was hard to find the error chunk in the audio file (so I could listen and fix it manually)
Being able to download the transcription in a text file will solve problem in item 3. Using a subtitle file format (like srt) would help a lot in item 4. The behavior of attaching the file could be triggered automatically for files longer than 1 minute.
I'm willing to implement this feature if the maintainers accept the proposal.