if GGML_BACKEND_DL defined use one of the ggml_backend_load functions #3057

peardox · 2025-04-16T20:16:13Z

As suggested when I exposed whisper_load_backends
Updated bench, cli and stream examples with conditional backend load calls
Added a helpful error message if null device access attempted in the two places I've seen it happen

Next I need to add load_best etc to allow specific device only to be demanded (which can cause other problems)

….cpp

…load

…ay be more cases, two identified and trapped

peardox · 2025-04-17T05:26:20Z

I decided to remove all references to whisper_load_backends and update bench, cli and stream to illustrate basic GGML_BACKEND_DL usage

In ggml-backend.cpp # 1475 at start of ggml_backend_sched_new you assert that the last available device is a CPU
I had to set a conditional for GGML_BACKEND_DL to remove that check as it is possible to now select a specific device
For example, in bench.cpp, you can change the conditional ggml_backend_load_all() to ggml_backend_load_best("gpu", true, nullptr) and only a GPU will be available (would cause issues with use_gpu = false).

The examples could now allow selectable device if using GGML_BACKEND_DL (e.g. benchmark CPU vs Vulkan vs CUDA)

slaren

The changes to whisper.cpp look good to me, but the changes to ggml-backend.cpp/h need to be removed. ggml is a separate library and cannot be modified in this way. There are also good reasons why ggml_backend_sched requires a CPU backend, even when using a GPU the CPU backend must be available.

peardox · 2025-04-17T13:31:02Z

OK - I'll modify (and un-modify) as required.

I presume cpu is requred as a fallback if other fails? Just had that happen to me with blas.

slaren · 2025-04-17T14:12:45Z

examples/bench/bench.cpp

+        if(ggml_backend_load_best(params.device.c_str(), true, nullptr) == nullptr) {
+            fprintf(stderr, "error: could not load device %s\n", params.device.c_str());
+            return 5;
+        }


This function is not currently available to applications, but I agree that it should. I will make the change necessary to make this function public, but at the moment it cannot be used here.

There is also an important distinction between devices and backends. Backends may have multiple devices, e.g. in a system with multiple GPUs, and it would be good to add the ability to whisper.cpp to choose what device to use, but that would need to be done in a different way (e.g. by making whisper.cpp accept a ggml_backend_dev_t object in whisper_context_params). The implementation in llama.cpp may be useful to use as a guide, although it is a bit more complicated since llama.cpp can use multiple devices at the same time.

In conclusion:

Adding a --backend parameter to choose which backend to load would be good, but either needs to use ggml_backend_load to load specifically the file given by the user, or it would need to wait until ggml_backend_load_best is made public

Adding a --device parameter to choose which device to use would also be good, but it must be a separate setting

Probably better left for a separate PR.

So in the meanwhile I can safely expose ggml_backend_load_best before it's public? That's a good start from my point of view (fairly lost without it loading cpu on older machines).

I'll wrap it in a whisper_load_device function making sure that there's at least one cpu at end of list which should be OK?

Two modifications I made to ggml-backend.cpp were dealing with passing a nullptr to functions that wanted to return a member of the passed parameter.

peardox · 2025-04-18T16:50:48Z

Removed all objections as far as I can see

peardox added 8 commits April 16, 2025 18:41

Expose ggml_backend_load_best

3d3493e

Fix header ggml\include\ggml_backend.h

81965b5

if GGML_BACKEND_DL defined don't use whisper_load_backends in whisper…

bee63b8

….cpp

Update examples bench, cli and stream to cater for alternate backend …

cc265ab

…load

Fix dumb type eneif -> endif

b96679c

Add print_error_no_device and trigger if null backend passed. There m…

3a46af0

…ay be more cases, two identified and trapped

Remove all references to whisper_load_backends

5fb957a

Enable ggml_backend_load_best and allow only one device to be requested

0146f98

peardox marked this pull request as ready for review April 17, 2025 05:26

peardox added 2 commits April 17, 2025 11:04

Rename WHISPER_BACKEND_DL -> GGML_BACKEND_DL

e1ba412

Modify bench.cpp to add -d/--device option for GGML_BACKEND_DL only

1fffca3

slaren reviewed Apr 17, 2025

View reviewed changes

Remove ggml-backend.cpp/h alterations

0ebfeb5

peardox marked this pull request as draft April 17, 2025 13:51

slaren reviewed Apr 17, 2025

View reviewed changes

peardox added 2 commits April 18, 2025 12:23

Temp state

4f310d1

Tidy up and play nice

1d7a9c4

peardox marked this pull request as ready for review April 18, 2025 16:50

Tidy up and play nice

18edf34

peardox marked this pull request as draft April 18, 2025 18:40

peardox marked this pull request as ready for review April 19, 2025 19:21

peardox closed this Apr 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

if GGML_BACKEND_DL defined use one of the ggml_backend_load functions #3057

if GGML_BACKEND_DL defined use one of the ggml_backend_load functions #3057

Uh oh!

peardox commented Apr 16, 2025

Uh oh!

peardox commented Apr 17, 2025 •

edited

Loading

Uh oh!

slaren left a comment

Uh oh!

peardox commented Apr 17, 2025

Uh oh!

slaren Apr 17, 2025 •

edited

Loading

Uh oh!

peardox Apr 17, 2025

Uh oh!

peardox commented Apr 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

if GGML_BACKEND_DL defined use one of the ggml_backend_load functions #3057

if GGML_BACKEND_DL defined use one of the ggml_backend_load functions #3057

Uh oh!

Conversation

peardox commented Apr 16, 2025

Uh oh!

peardox commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren left a comment

Choose a reason for hiding this comment

Uh oh!

peardox commented Apr 17, 2025

Uh oh!

slaren Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peardox Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

peardox commented Apr 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

peardox commented Apr 17, 2025 •

edited

Loading

slaren Apr 17, 2025 •

edited

Loading