ruby : VAD separately from ASR #3518

KitaitiMakoto · 2025-11-11T22:52:30Z

Hi,

I found that we can use VAD feature separately from ASR and added API for that to Ruby bindings.

Thank you for such useful feature and API.

jwijffels · 2025-11-12T19:03:04Z

A question. Did you manage to run the VAD on a GPU or is the VAD CPU-only?

KitaitiMakoto · 2025-11-13T01:15:07Z

Thank you for apprival!

KitaitiMakoto · 2025-11-13T01:19:03Z

Did you manage to run the VAD on a GPU or is the VAD CPU-only?

I didn't care about. It seems CPU is used:

whisper.cpp/src/whisper.cpp

Lines 4658 to 4660 in a1867e0

    
           // TODO: GPU VAD is forced disabled until the performance is improved 
        
           //whisper_context_params.use_gpu    = vctx->params.use_gpu; 
        
           whisper_context_params.use_gpu    = false;

jwijffels · 2025-11-13T09:30:54Z

Did you manage to run the VAD on a GPU or is the VAD CPU-only?

I didn't care about. It seems CPU is used:

whisper.cpp/src/whisper.cpp

Lines 4658 to 4660 in a1867e0

// TODO: GPU VAD is forced disabled until the performance is improved

//whisper_context_params.use_gpu = vctx->params.use_gpu;

whisper_context_params.use_gpu = false;

Thanks for the answer and the link to the code. Indeed CPU-only.

# By Georgi Gerganov (80) and others # Via GitHub * ggerganov/master: (441 commits) ruby : VAD separately from ASR (ggml-org#3518) sync : llama.cpp sync : ggml vulkan: iGPU memory reporting fix (llama/17110) vulkan: fix mmq out of bounds reads (llama/17108) vulkan: fuse mul_mat_id + mul (llama/17095) metal : retain src and dst buffers during async ops (llama/17101) vulkan: Use spec constants for conv2d s/d/p and kernel W/H (llama/16978) Revert "CUDA: add expert reduce kernel (ggml/16857)" (llama/17100) CUDA: skip fusion for repeating adds in bias (llama/17080) vulkan: Increase BK to 32; use BK/4 for non-CM mul_mm.comp (llama/16636) ggml: disable vxe for cross-compilation by default (llama/16966) vulkan: fuse rms_norm + mul + rope (+ view + set_rows) (llama/16977) vulkan: Fix test-thread-safety crashes (llama/17024) CUDA: fix MMQ stream-k fixup ne1 indices (llama/17089) ggml webgpu: faster matrix multiplication/matrix-vector multiplication (llama/17031) CUDA: properly handle nb00=nb02 case for cpy (llama/17081) vulkan : refactor buffer handling in vk_op_f32 (llama/16840) CUDA: fix should_use_mmvf for ne11 == 1 (llama/17085) Revert "ggml-cpu: detect correct cpu flags for arm64 (llama/16229) (#16239)" (llama/17084) ... # Conflicts: # examples/CMakeLists.txt

KitaitiMakoto added 22 commits November 12, 2025 07:38

Add Whisper::VAD::Context

ec83e47

Add test for Whisper::VAD::Context

3e18da0

Add Whisper::VAD::Segment

9a7572d

Add Whisper::VAD::Segments

775a014

Add Whisper::VAD::Context#detect

b9407de

Define Whisper::VAD::Segments#each

b938ea6

Define Whisper::VAD::Segment#start_time and #end_time

494104a

Define Whisper::VAD::Segment#deconstruct_keys

006bc60

Add tests for Whisper::VAD family

de57dd9

Add signatures for VAD family

09eba1c

Add document on VAD in README

a581048

Define Whisper::VAD::Segments#length

ae40ff4

Add test for Whisper::VAD::Segments#length

1330b52

Add signature of Segments#length

d7dffb7

Make vad_segments responsible to initialize VAD::Segments

c0c8f0b

Remove meaningless argument check

a8ce4ee

Check NULL of segments member

e0d1b0b

Add tests for Whisper::VAD::Segments

42441d7

Initialize Whisper::VAD::Segment on .allocate

a625718

Add tests for Whisper::VAD::Segment

f69336b

Check NULL of context member

b89944e

Add test for Whisper::VAD::Context.allocate

781a3ac

ggerganov approved these changes Nov 12, 2025

View reviewed changes

KitaitiMakoto merged commit d9b7613 into ggml-org:master Nov 13, 2025
64 of 66 checks passed

KitaitiMakoto deleted the ruby-vad branch November 13, 2025 01:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ruby : VAD separately from ASR #3518

ruby : VAD separately from ASR #3518

Uh oh!

KitaitiMakoto commented Nov 11, 2025 •

edited

Loading

Uh oh!

jwijffels commented Nov 12, 2025

Uh oh!

KitaitiMakoto commented Nov 13, 2025

Uh oh!

Uh oh!

KitaitiMakoto commented Nov 13, 2025

Uh oh!

jwijffels commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ruby : VAD separately from ASR #3518

ruby : VAD separately from ASR #3518

Uh oh!

Conversation

KitaitiMakoto commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jwijffels commented Nov 12, 2025

Uh oh!

KitaitiMakoto commented Nov 13, 2025

Uh oh!

Uh oh!

KitaitiMakoto commented Nov 13, 2025

Uh oh!

jwijffels commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

KitaitiMakoto commented Nov 11, 2025 •

edited

Loading