Skip to content
Discussion options

You must be logged in to vote

If you're referring to the <|notimestamps|> token, the command line interface (and the transcribe() function) does not have an option to do so because it uses the timestamps to determine the next 30-second window to run transcription.

If you're only interested in short audio segments (shorter than 30s), you can use decode() with without_timestamps=True:

whisper/whisper/decoding.py

Lines 107 to 108 in 248b6cb

# timestamp sampling options
without_timestamps: bool = False # use <|notimestamps|> to sample text tokens only

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by jongwook
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants