speaker diarization and whisper segment #1255

Atefeh197 · 2023-04-18T19:21:52Z

Atefeh197
Apr 18, 2023

Hi

I want to do speaker diarization on whisper's output.
I know Whisper generates the segments for each audio based on this result:
'segments': [{'seek': 0.0, 'start': 0.46, 'end': 1.98, 'text': ' Hi, how are you', ......

My method is that extract embeddings for each segment and then use a diarizing model for labeling.

I would like to know if the segments that are extracted by whisper are based on speaker change detection or something like that.
I mean Is there only one speaker speaking in each segment?

HeadStudios · 2023-05-11T06:28:54Z

HeadStudios
May 11, 2023

Following.

1 reply

Majdoddin Jul 21, 2023

@Atefeh197 @HeadStudios
www.lexicaps.com seamlessly adds diarization to Whispers transcription. No 3rd party packages.
Announcement: #1537
Repo: https://github.com/Majdoddin/lexicaps

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

speaker diarization and whisper segment #1255

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

speaker diarization and whisper segment #1255

Uh oh!

Atefeh197 Apr 18, 2023

Replies: 1 comment · 1 reply

Uh oh!

HeadStudios May 11, 2023

Uh oh!

Majdoddin Jul 21, 2023

Atefeh197
Apr 18, 2023

Replies: 1 comment 1 reply

HeadStudios
May 11, 2023