UPSTREAM PR #14891: imatrix: calculate activation-based statistics for new format (GGUF) imatrices #240

DajanaV · 2025-11-17T18:42:06Z

Following up from #9400 and #12718, I've started tinkering with activation-based statistics, in addition to what's currently available via --show-statistics.

At the moment, I'm exploring three options going from from easy to implement and OK approximation, to some assembly required but fairly accurate:

L2 norm of activation difference: where larger values would suggest the tensor has significantly transformed the input with respect to the previous layer.
KL Divergence reduction using a pre-computed logit file: using a similar approach as described by nostalgebraist in logit lens, and based on a pre-computed logit file (e.g. from a previous llama-perplexity --save-all-logits run)
Given that llama-imatrix already generates the actual logits to compute PPL, use Thông T. Nguyễn's logit prism approach to calculate the exact contribution of each layer to the final logit scores

Sharing with the readers, and in particular @compilade and @jukofyork, in case anyone's willing to double check assumptions and/or suggest alternative approaches I haven't considered.

EAddario added 30 commits July 26, 2025 17:06

Use activations to calculate the stats

09bc7c2

Refactor variable names

2097f03

Fix problem up when GGUF does not have in_sum

78ddb47

Determine calculation mode

9744a4a

Compute entropy for activations

cce514a

Compute cosine similarity based on activations

b7fb362

Compute l2 norm

9b841eb

Adjust threshold

ee2509f

Update table display

fc8f925

Remove inactive

4c01f51

Reformat report layout

a32a2ec

Refactor variables

4d1325e

Update table layout

5324558

Refactor lambda into compute_tensor_averages() function

fce05aa

Refactor function names

be60469

Add compute_layer_statistics() function

a6155a8

Update aggregated statistic report layout

2117c4e

Minor cosmetic changes

90cb1be

Fix printing l2 norm when calc_mode = 1

f1c2a4c

Refactor variable name

c39c4e2

Merge branch 'master' into imatrix

adbff66

Do not resize if in_sum is null

5e40cf4

Compute aggregated (per layer) l2 norm

b373934

Update aggregated sum of squared activations per layer

906548a

Make ZD Score two-tailed

aea9b31

Refactor variable names

49996a1

Update report layout

4c3fea8

Refactor legacy mode

88854c9

Merge branch 'master' into imatrix

030ed3c

Merge branch 'master' into imatrix

c7959ed

loci-dev force-pushed the main branch 30 times, most recently from 074b005 to ff6ae69 Compare December 9, 2025 12:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

UPSTREAM PR #14891: imatrix: calculate activation-based statistics for new format (GGUF) imatrices #240

UPSTREAM PR #14891: imatrix: calculate activation-based statistics for new format (GGUF) imatrices #240

DajanaV commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

UPSTREAM PR #14891: imatrix: calculate activation-based statistics for new format (GGUF) imatrices #240

Are you sure you want to change the base?

UPSTREAM PR #14891: imatrix: calculate activation-based statistics for new format (GGUF) imatrices #240

Conversation

DajanaV commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants