Skip to content
Discussion options

You must be logged in to vote

I used whisper to transcribe but the result is a long blob of text, not in the dialog format. [...] Is whisper capable of transcribing conversation into dialog format?

"Person A: ..."
"Person B: ..."

No. Whisper can't currently do this.

The technical term for this is called:

  • Speaker Diarization.

You can find discussion on this in other topics:

Currently, the closest answer was given by jongwook:

where you can "hack" initial_prompt with hyphens, in order to nudge Whisper to potentially output dashes between speakers.

[...] You can also use this for "prompt engineering", to in…

Replies: 3 comments 3 replies

Comment options

You must be logged in to vote
2 replies
@fanyangxyz
Comment options

@Majdoddin
Comment options

Answer selected by fanyangxyz
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@envious
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
6 participants