Specifying number of speakers in the Huggingface Inference API #1088
Unanswered
Jeannotisintheplace
asked this question in
Q&A
Replies: 1 comment
-
I do not think this is supported. I guess it should be possible as there are tasks for which options can be passed to the API: This part is handled outside of pyannote in Huggingface repo: https://github.com/huggingface/api-inference-community/tree/main/docker_images/pyannote_audio cc @julien-c who might be able to indicate what change should be made to add this feature to the API. This is more or less what is run on Huggingface servers: pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization")
pipeline("audio.wav") This is what @Jeannotisintheplace would like to run: pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization")
pipeline("audio.wav", num_speakers=2) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi team,
Could you please explain how to pass the known-in-advance number of speakers to the API ?
Shall "num_speaker" be specified in the Headers (or parameters?), and in which format ?
Sorry for possible trivial question but thanks in advance,
Julien
Beta Was this translation helpful? Give feedback.
All reactions