rpc : report actual free memory #16616

rgerganov · 2025-10-16T13:45:10Z

~~Start tracking the free memory on every device and report it appropriately.~~

Start reporting the free memory on every device instead of using fixed values. Now llama-cli users can get a nice memory breakdown when using RPC devices.

ggml/src/ggml-rpc/ggml-rpc.cpp

Start reporting the free memory on every device instead of using fixed values. Now llama-cli users can get a nice memory breakdown when using RPC devices.

rgerganov · 2025-10-17T07:41:46Z

@slaren I am thinking to drop the -m option of the rpc-server which overrides the reported total_mem. It is very confusing as it is not enforced and users can achieve the same with --tensor-split on the client side. What do you think?

slaren · 2025-10-17T10:10:46Z

@slaren I am thinking to drop the -m option of the rpc-server which overrides the reported total_mem. It is very confusing as it is not enforced and users can achieve the same with --tensor-split on the client side. What do you think?

Yes, I think that would be a good change.

* rpc : report actual free memory Start reporting the free memory on every device instead of using fixed values. Now llama-cli users can get a nice memory breakdown when using RPC devices. * drop --mem in rpc-server

rgerganov requested a review from slaren October 16, 2025 13:45

rgerganov force-pushed the rpc-track-mem branch from 3e06539 to 83e5300 Compare October 16, 2025 13:58

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Oct 16, 2025

slaren reviewed Oct 16, 2025

View reviewed changes

ggml/src/ggml-rpc/ggml-rpc.cpp Show resolved Hide resolved

rpc : report actual free memory

075f1e0

Start reporting the free memory on every device instead of using fixed values. Now llama-cli users can get a nice memory breakdown when using RPC devices.

rgerganov force-pushed the rpc-track-mem branch from 83e5300 to 075f1e0 Compare October 17, 2025 07:35

rgerganov requested a review from ggerganov as a code owner October 17, 2025 07:35

rgerganov changed the title ~~rpc : track free memory~~ rpc : report actual free memory Oct 17, 2025

github-actions bot added the examples label Oct 17, 2025

slaren approved these changes Oct 17, 2025

View reviewed changes

drop --mem in rpc-server

8d847c9

rgerganov merged commit 41386cf into ggml-org:master Oct 17, 2025
70 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rpc : report actual free memory #16616

rpc : report actual free memory #16616

Uh oh!

rgerganov commented Oct 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

rgerganov commented Oct 17, 2025

Uh oh!

slaren commented Oct 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rpc : report actual free memory #16616

rpc : report actual free memory #16616

Uh oh!

Conversation

rgerganov commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

rgerganov commented Oct 17, 2025

Uh oh!

slaren commented Oct 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rgerganov commented Oct 16, 2025 •

edited

Loading