Update the docs on -t --threads #16236

takasurazeem · 2025-09-24T23:40:13Z

It's a documentation update for more clear wording on what -t parameter actually means.

ericcurtin · 2025-09-24T23:58:00Z

Are we sure this is correct? I was chatting with @doringeman about this recently. I tested this is the past and the default was certainly low, 4 threads (unless something changed in the meantime):

llama.cpp/ggml/include/ggml.h

Line 228 in 3a59971

#define GGML_DEFAULT_N_THREADS 4

It was super apparent in the past when testing on an Ampere system with 100+ cores

takasurazeem · 2025-09-25T04:01:34Z

Are we sure this is correct? I was chatting with @doringeman about this recently. I tested this is the past and the default was certainly low, 4 threads (unless something changed in the meantime):

llama.cpp/ggml/include/ggml.h

Line 228 in 3a59971

#define GGML_DEFAULT_N_THREADS 4

It was super apparent in the past when testing on an Ampere system with 100+ cores

Oh, my bad, must have been an oversight, I will go through the code and confirm. In any case the docs could benefit from more descriptive wording, thanks for the review.

adhusch · 2025-09-26T11:49:47Z

Are we sure this is correct? I was chatting with @doringeman about this recently. I tested this is the past and the default was certainly low, 4 threads (unless something changed in the meantime):

llama.cpp/ggml/include/ggml.h

Line 228 in 3a59971

#define GGML_DEFAULT_N_THREADS 4

It was super apparent in the past when testing on an Ampere system with 100+ cores

I can confirm that it now by default uses 100+ threads when you have 100+ cores. As this is likely not desired (but on the other hand the -1 default likely makes sense for the majority of users) the improved documentation is very valuable.

ngxson

This table is auto-generated, changes here will be discarded. Make your changes to arg.cpp instead.

llama.cpp/tools/server/README.md

Line 25 in eba9734

This reverts commit eba9734.

… all available cores

takasurazeem · 2025-10-14T04:56:28Z

This table is auto-generated, changes here will be discarded. Make your changes to arg.cpp instead.

llama.cpp/tools/server/README.md

Line 25 in eba9734

Addressed and added to correct file.

slaren · 2025-10-14T08:13:49Z

common/arg.cpp

    add_opt(common_arg(
        {"-t", "--threads"}, "N",
-        string_format("number of threads to use during generation (default: %d)", params.cpuparams.n_threads),
+        string_format("number of CPU threads to use during generation (default: %d, use all available.)", params.cpuparams.n_threads),


The default is not to use all available, the logic is more complex than that.

Ok, I have taken out the "use all available" part, but kept the CPU because it makes it clearer. I will take a look at the logic and put up another PR with more descriptive message.

@slaren I was thinking the same, saw it being set as 4 on an Ampere ARM machine with a bazillion cores in the past

There is some logic to avoid using logical cores (e.g. from SMT), but it may not work well in the Ampere CPU.

Update the docs on -t --threads

eba9734

takasurazeem requested review from ggerganov and ngxson as code owners September 24, 2025 23:40

github-actions bot added examples server labels Sep 24, 2025

ngxson requested changes Sep 29, 2025

View reviewed changes

takasurazeem added 2 commits October 8, 2025 21:58

Revert "Update the docs on -t --threads"

511cff3

This reverts commit eba9734.

docs: clarify -t/--threads parameter uses CPU threads and defaults to…

d8077e9

… all available cores

takasurazeem requested a review from ngxson October 9, 2025 01:58

slaren reviewed Oct 14, 2025

View reviewed changes

Update arg.cpp

df30781

takasurazeem requested a review from slaren October 15, 2025 14:00

slaren approved these changes Oct 15, 2025

View reviewed changes

ggerganov merged commit 6f5d924 into ggml-org:master Oct 16, 2025
70 checks passed

takasurazeem deleted the patch-2 branch October 18, 2025 02:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update the docs on -t --threads #16236

Update the docs on -t --threads #16236

takasurazeem commented Sep 24, 2025

Uh oh!

ericcurtin commented Sep 24, 2025 •

edited

Loading

Uh oh!

takasurazeem commented Sep 25, 2025 •

edited

Loading

Uh oh!

adhusch commented Sep 26, 2025

Uh oh!

ngxson left a comment •

edited

Loading

Uh oh!

takasurazeem commented Oct 14, 2025

Uh oh!

slaren Oct 14, 2025

Uh oh!

takasurazeem Oct 15, 2025

Uh oh!

ericcurtin Oct 15, 2025

Uh oh!

slaren Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Update the docs on -t --threads #16236

Update the docs on -t --threads #16236

Conversation

takasurazeem commented Sep 24, 2025

Uh oh!

ericcurtin commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

takasurazeem commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adhusch commented Sep 26, 2025

Uh oh!

ngxson left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

takasurazeem commented Oct 14, 2025

Uh oh!

slaren Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

takasurazeem Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

ericcurtin Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

slaren Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ericcurtin commented Sep 24, 2025 •

edited

Loading

takasurazeem commented Sep 25, 2025 •

edited

Loading

ngxson left a comment •

edited

Loading