-
Hi all, Thanks for the great library. I am trying to segment audio with the import torch
from pyannote.audio.features import RawAudio
pipeline = torch.hub.load("pyannote/pyannote-audio", "dia")
OWN_FILE = {"audio": input_file}
diarization = pipeline(OWN_FILE)
segments = diarization.for_json()["content"] However, I need to specify minimum and maximum durations for segments. I checked the parameters of the parts of the pipeline and could not find a solution. Do you have any idea? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
There is no easy way to do that (though I am not sure why you would need to specify a maximum duration). FYI, pyannote.audio 1.x will soon be superseded by pyannote.audio 2.0 with a much better speaker diarization pipeline, which has a |
Beta Was this translation helpful? Give feedback.
There is no easy way to do that (though I am not sure why you would need to specify a maximum duration).
FYI, pyannote.audio 1.x will soon be superseded by pyannote.audio 2.0 with a much better speaker diarization pipeline, which has a
min_duration_on
hyper-parameter that could you use to set the minimum duration of speech turns.