-
Notifications
You must be signed in to change notification settings - Fork 5k
ruby : VAD separately from ASR #3518
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
A question. Did you manage to run the VAD on a GPU or is the VAD CPU-only? |
|
Thank you for apprival! |
I didn't care about. It seems CPU is used: Lines 4658 to 4660 in a1867e0
|
Thanks for the answer and the link to the code. Indeed CPU-only. |
# By Georgi Gerganov (80) and others # Via GitHub * ggerganov/master: (441 commits) ruby : VAD separately from ASR (ggml-org#3518) sync : llama.cpp sync : ggml vulkan: iGPU memory reporting fix (llama/17110) vulkan: fix mmq out of bounds reads (llama/17108) vulkan: fuse mul_mat_id + mul (llama/17095) metal : retain src and dst buffers during async ops (llama/17101) vulkan: Use spec constants for conv2d s/d/p and kernel W/H (llama/16978) Revert "CUDA: add expert reduce kernel (ggml/16857)" (llama/17100) CUDA: skip fusion for repeating adds in bias (llama/17080) vulkan: Increase BK to 32; use BK/4 for non-CM mul_mm.comp (llama/16636) ggml: disable vxe for cross-compilation by default (llama/16966) vulkan: fuse rms_norm + mul + rope (+ view + set_rows) (llama/16977) vulkan: Fix test-thread-safety crashes (llama/17024) CUDA: fix MMQ stream-k fixup ne1 indices (llama/17089) ggml webgpu: faster matrix multiplication/matrix-vector multiplication (llama/17031) CUDA: properly handle nb00=nb02 case for cpy (llama/17081) vulkan : refactor buffer handling in vk_op_f32 (llama/16840) CUDA: fix should_use_mmvf for ne11 == 1 (llama/17085) Revert "ggml-cpu: detect correct cpu flags for arm64 (llama/16229) (#16239)" (llama/17084) ... # Conflicts: # examples/CMakeLists.txt
Hi,
I found that we can use VAD feature separately from ASR and added API for that to Ruby bindings.
Thank you for such useful feature and API.