Misc. bug: [llama.android] Model keeps replying and cannot be stopped normally until it exceeds the context

### Name and Version

The latest [b4491](https://github.com/ggerganov/llama.cpp/releases/tag/b4491) version with `llama.android` example

### Operating systems

Other? (Please let us know in description)

### Which llama.cpp modules do you know to be affected?

Other (Please specify in the next section)

### Command line

_No response_

### Problem description & steps to reproduce

1. Run `llama.android` example
2. Loaded with [SmolLM2](https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct-GGUF) model
3. Send with chat template (User message: `Tell a joke`), the template are generated by the `common_chat_apply_template()` method

> Code location: llama.cpp/examples/llama.android/app/src/main/java/com/example/llama/MainViewModel.kt

```kotlin
val smollm2msg = "<|im_start|>system\n" +
    "You are a helpful AI assistant<|im_end|>\n" +
    "<|im_start|>user\n" +
    "$text<|im_end|>\n" +
    "<|im_start|>assistant\n"
viewModelScope.launch {
    llamaAndroid.send(smollm2msg)
        .catch {
            Log.e(tag, "send() failed", it)
            messages += it.message!!
        }
        .collect { messages = messages.dropLast(1) + (messages.last() + it) }
}
```
4. Issue reproduce, the model keeps replying and cannot be stopped normally until it exceeds the context, and it will contain tokens such as `<|im_start|>` and `<|im_end|>`

![image](https://github.com/user-attachments/assets/ec179ab9-26b5-4892-967a-bf19ee9e3be1)


### Comparison test:
This problem cannot be reproduced using the command line program `llama-cli` with the same LLM model

> ./llama-cli -m models/smollm2-360m-instruct-q8_0.gguf -p "You are a helpful assistant" -cnv

### First Bad Commit

Since the establishment of `llama.android`

### Relevant log output

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Misc. bug: [llama.android] Model keeps replying and cannot be stopped normally until it exceeds the context #11264

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

Comparison test:

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Misc. bug: [llama.android] Model keeps replying and cannot be stopped normally until it exceeds the context #11264

Description

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

Comparison test:

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions