Count model parameters and size using only the GGUF tensors

### Discussed in https://github.com/ggerganov/llama.cpp/discussions/10274

The number of parameters and size of the model currently is calculated from the tensors created after the model is loaded, which in some cases may contain duplicated tensors, resulting in an inaccurate and inconsistent reporting of the model size. To address this, `llama_model_n_params` and `llama_model_size` should be modified to return the value as calculated in `llama_model_loader::n_elements` and `n_bytes`, which could be stored in `llama_model` while loading the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Count model parameters and size using only the GGUF tensors #10285

Discussed in #10274

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Count model parameters and size using only the GGUF tensors #10285

Description

Discussed in #10274

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions