Kilo Code appears to be stuck in a loop, attempting the same action (apply_diff) repeatedly. #1423

bharatgajjar · 2025-07-20T21:43:42Z

bharatgajjar
Jul 20, 2025

Kilo Code is having trouble...
Kilo Code appears to be stuck in a loop, attempting the same action (apply_diff) repeatedly. This might indicate a problem with its current strategy. Consider rephrasing the task, providing more specific instructions, or guiding it towards a different approach.

devlux76 · 2025-07-21T00:46:50Z

devlux76
Jul 21, 2025

I've noticed this happens on the same spot context wise for every single model. For the GPT models it seems to happen at 16.3k while for gemini it seems to happen around 88k. The key here is how consistent it is.

I was running my own tests with llama.cpp and also ollama so I could watch what's being sent to the server. Just to say hello is over 10k of context tokens (imagine using 30k words to say hello).

Local models have a default context length of 2048 or 4096. Seeing the pattern here?
16384 (4x4096) is where rope scaling asshattery kicks in. It straddles 14k-18.5k I call this the danger zone. That range right there is where we're going to get stuck if we're getting stuck today.

The fact is we're getting close to an edge where one type of memory gives way to the other. You either have to blow through that by generated up to 4k new tokens, or reduce context to stay under it. You can't just trip the light fantastic on the edge of a rope.

Here's what I'm doing and it seems to work because it generates enough tokens to get us off the edge.

Use chain of thought to reason step by step about the following problem. You seem to be struggling with this task and I'm here to help. Give me an indepth review of what you believe is wrong at this point and what you've tried so far. Then let's discuss the options of each approach. I want a full compare and contrast of all the possible options so we can make an informed choice about how to proceed.

Doing this gets the model a page or two past that barrier point and it won't come back until we get close to the next edge which seems to be around 32k, 64k etc. I'll bet the 88k of gemini is a similar issue.

0 replies

bhaktatejas922 · 2025-07-21T22:02:33Z

bhaktatejas922
Jul 21, 2025

this pr should fix this with Morphs fast apply model! #1428

0 replies

bharatgajjar · 2025-07-21T22:09:22Z

bharatgajjar
Jul 21, 2025
Author

this is what Usage Overview shows, i think i should configure limit request to 20-30 ? what is the impact if we limit ?
Date | Cost | Requests | Input Tokens | Output Tokens | Cache Hits
Jul 19, 2025 | $4.17 | 108 | 5,186,366 | 131,699 | 4,130,654

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kilo Code appears to be stuck in a loop, attempting the same action (apply_diff) repeatedly. #1423

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Kilo Code appears to be stuck in a loop, attempting the same action (apply_diff) repeatedly. #1423

Uh oh!

bharatgajjar Jul 20, 2025

Replies: 3 comments

Uh oh!

devlux76 Jul 21, 2025

Uh oh!

bhaktatejas922 Jul 21, 2025

Uh oh!

bharatgajjar Jul 21, 2025 Author

bharatgajjar
Jul 20, 2025

devlux76
Jul 21, 2025

bhaktatejas922
Jul 21, 2025

bharatgajjar
Jul 21, 2025
Author