Howto / Allow Inject initial embeddings inside pipeline cluster #1764
heralight
started this conversation in
Development
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Help Needed: Reinjection of Previous Embeddings in
pyannote/speaker-diarization-3.1
PipelineIssue Type:
🆘 Support Request
Description
Yet another question on how identify speaker on multiple source...
Hello PyAnnote Team,
I am using the
pyannote/speaker-diarization-3.1
pipeline for speaker diarization in my project. I aim to improve speaker consistency across multiple audio chunks by reinjecting embeddings from a previous chunk into the pipeline when processing the next chunk.Attempts Made:
Modifying Clustering:
SpeakerDiarization
pipeline to store and utilize previous embeddings and cluster centroids.compute_embeddings
andcluster
methods to concatenate previous embeddings and initialize KMeans with prior centroids.Using Hooks:
initial_embeddings
during the second call to maintain speaker identity.Example Use Case:
Specifically, I ideally want to perform the following:
Guidance on Reinjection Mechanism:
How can I effectively pass initial_embeddings from one pipeline call to the next to maintain speaker consistency?
Pipeline Customization:
Example Implementation:
Environment
pyannote.audio version: 3.1
Python version: 3.11
Torch version: 2.4.1+cu121
GPU: NVIDIA CUDA-enabled device
Thank you for your assistance!
Best regards,
Alexandre
Beta Was this translation helpful? Give feedback.
All reactions