Replies: 1 comment
-
Another Observation: LoopDetector added another call to flash. The fun thing is that sometimes Flash actually causes a loop where there wasn't one. A better solution would be to inform the model at hand (pro) in response to its request that it seems to be in a loop, and to evaluate its approach. Also sometimes pro reacts to whatever flash is doing by really abbherent behavior. those times where it deletes and reimpleents or does generally "weird stuff" seem to be after a loop detection. But...often it wasn't looping. It also seems that repeated tool calls with different parameters can trigger the loop (including 10 shell commands in a row) but often these aren't loops. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
So I forked gemini-cli https://github.com/acoliver/llxprt-code/ and keep up to main by an llm-back auto-cherrypicker script
The nextSpeakerChecker is just a bad idea - sending the whole conversation potentially multiple times per request to ask whose turn it is feels wasteful (of tokens, money, time). It is also why Gemini 2.5-pro feels so slow. After the request is done it says something like "I fixed the problem and it should work now.." then you sit for several seconds while nextSpeakerChecker sends the whole thing to flash-lite.
So far I haven't hit a case where pro stops like GPT-4o and just forgets whose turn it is. It seems smart enough to know what to do. In the case it isn't a heuristic could do most of the work rather than requiring a full trip to flash.
Aside from that, I realize this is the code-assist team but just using the full endpoint when the user has it enabled instead of some code-assist and some full endpoint would probably clear up the issues for the paid/power users.
I'd like these things worked through a bit because if I fix them downstream my merges get more expensive. However, I think it would make the software more robust and stable for everyone.
Beta Was this translation helpful? Give feedback.
All reactions