Skip to content
Discussion options

You must be logged in to vote

I tried your wav file and the options were working fine:

$ ffmpeg -t 30 -i 123.wav 123-30.wav        # trim to 30 seconds
$ whisper --model tiny --word_timestamps True --max_line_width 20 --max_line_count 2 123-30.wav
$ cat 123-30.srt

1
00:00:01,380 --> 00:00:10,000
如果干证的哭吧,习惯功能充分替了他就会突
破我们的最后一道发现开始启动不得没应党

2
00:00:10,640 --> 00:00:17,140
清楚了吧?那就死不住了一代买子,即便偶
尔,没有完全被消化他都学首先是当我们的干

3
00:00:17,140 --> 00:00:24,520
仗,过了干仗才知道我们大循环干仗会提供给
积极这个新鲜他的个细胞叫哭吧,是吧这是他

4
00:00:24,520 --> 00:00:28,760
平日撤到东西,今天你看他撤到八百分不应党

Although the quality of the "tiny" model is lower, I am just testing the line widths here, and every line is indeed wrapped at a maximum of 20 characters and a maximum of 2 lines.

Replies: 1 comment 10 replies

Comment options

You must be logged in to vote
10 replies
@ryanheise
Comment options

@raccoonchiu
Comment options

@raccoonchiu
Comment options

@ryanheise
Comment options

Answer selected by raccoonchiu
@raccoonchiu
Comment options

@ryanheise
Comment options

@raccoonchiu
Comment options

@raccoonchiu
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants