Use a thirty-party voice activity detection package with audio chunk feed programmatically in next.js v15? #82730
Replies: 2 comments 1 reply
-
Looks like they've got it here? Look at the next.config.js files, and usage of the vad-web library. |
Beta Was this translation helpful? Give feedback.
1 reply
-
After painstakingly tries, I got it working finally. Here are the code segments for creating the VAD:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
I am working in next.js v15 and typescript env. Here is my user case:
a MediaStreamAudioSourceNode object,
a AudioWorkletNode object,
a Web Worker for ricky123 VAD process.
The local mic audio (or remote peer audio) is streamed to the MediaStreamAudioSourceNode object, which is connected to the AudioWorkletNode object for preprocessing, which feeds the audio chunks to the WebWorker for ricky123 VAD in a browser. The code segments are as follows.
In main thread UI component,
In ricky123 VAD Web Worker,
Running the code, it hangs on
await MicVAD.new
and thusconsole.log("Initialize VAD object.")
is never called. I have copiedsilero_vad_v5.onnx
intopublic/vad-models
.My questions:
Thank you
Additional information
No response
Example
No response
Beta Was this translation helpful? Give feedback.
All reactions