Eval bug: Incorrect KV cache calculation in llama.android example

### Name and Version

version: 4818 (dfd6b2c0)
built with Android (10552028, +pgo, +bolt, +lto, -mlgo, based on r487747d) clang version 17.0.2 (https://android.googlesource.com/toolchain/llvm-project d9f89f4d16663d5012e5c09495f3b30ece3d2362) for x86_64-apple-darwin23.6.0




### Operating systems

Other? (Please let us know in description)

### GGML backends

CPU

### Hardware

CPU: Google Tensor G4 (Pixel 9)

### Models

_No response_

### Problem description & steps to reproduce

[Line 364](https://github.com/ggml-org/llama.cpp/blob/master/examples/llama.android/llama/src/main/cpp/llama-android.cpp#L364) of file `llama-android.cpp`'s KV cache size calculation doesn't make sense, it is simply assigning `n_len` to `n_kv_req`:

```cpp
auto n_kv_req = tokens_list.size() + (n_len - tokens_list.size());
```

Since `tokens_list` is tokenized from the input text (either formatted or not), while `n_len` is the max length of the tokens to be generated, the required KV cache size would naturally be the sum of them.


### First Bad Commit

_No response_

### Relevant log output

```shell
(empty, no tokens are generated. Because when I send a long message formatted with system prompt and user prmopt, `nlen` with a default value of `64` becomes the actual `n_kv_req` and gets burst)
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eval bug: Incorrect KV cache calculation in llama.android example #12211

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Eval bug: Incorrect KV cache calculation in llama.android example #12211

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions