android : Calculate required KV cache size by summing up tokens size and response token length (#12211) #12212

hanyin-arm · 2025-03-05T22:29:39Z

Calculate required KV cache size by summing up tokens size and response token length.

previously:
auto n_kv_req = tokens_list.size() + (n_len - tokens_list.size());

now:
auto n_kv_req = tokens_list.size() + n_len;

…se token length

ggerganov · 2025-03-06T06:23:20Z

Looks like the n_kv_req is use just to print an error message, so this change should not affect the output of the example.

Calculate required KV cache size by summing up tokens size and respon…

9efa3ea

…se token length

github-actions bot added android Issues specific to Android examples labels Mar 5, 2025

hanyin-arm mentioned this pull request Mar 5, 2025

Eval bug: Incorrect KV cache calculation in llama.android example #12211

Closed

ggerganov approved these changes Mar 6, 2025

View reviewed changes

ggerganov merged commit 57b6abf into ggml-org:master Mar 6, 2025
47 checks passed

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

android : fix KV cache log message condition (ggml-org#12212)

bf972c1

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Mar 19, 2025

android : fix KV cache log message condition (ggml-org#12212)

a77ce0c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

android : Calculate required KV cache size by summing up tokens size and response token length (#12211) #12212

android : Calculate required KV cache size by summing up tokens size and response token length (#12211) #12212

Uh oh!

hanyin-arm commented Mar 5, 2025

Uh oh!

Uh oh!

ggerganov commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

android : Calculate required KV cache size by summing up tokens size and response token length (#12211) #12212

android : Calculate required KV cache size by summing up tokens size and response token length (#12211) #12212

Uh oh!

Conversation

hanyin-arm commented Mar 5, 2025

Uh oh!

Uh oh!

ggerganov commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants