adjusting srt length in chinese #1332
-
Hello, I'm currently utilizing Whisper for transcribing a video file. I'm working with the large-v2 model and transcribing in Chinese (Traditional), using the command: whisper 1111.mp4 --model large-v2 --language "Chinese" --initial_prompt "one talk one line" The transcription I'm receiving is accurate, but each caption is quite long, which is not ideal for my application. I've tried various methods to adjust the caption length, but I haven't found an effective solution. Here's an example of the output I'm currently getting: Any help or guidance would be greatly appreciated! Thanks in advance. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 10 replies
-
There are some options in the latest git version to wrap at a fixed line width and line count:
If you want a more custom solution, you could do things such as spitting after each comma if you write your own custom python code (although note this quickly gets complicated since there may still be long sentences without commas and you'd need to use something like spaCy to figure out more logical places to split that indicate some kind of semantic boundary.) |
Beta Was this translation helpful? Give feedback.
I tried your wav file and the options were working fine:
Although the quality of the "tiny" model is lower, I am just testing the line widths here, and every line is indeed wrapped at a maximum of 20 characters and a maximum of 2 lines.