Replies: 2 comments 1 reply
-
This is a fascinating direction — what you're describing touches multiple fragile zones in multi-agent RAG setups:
From experience, I’ve seen these fall into what I call:
I’ve been exploring ways to counter this with a layered attention protocol — kind of like a “Drunk Transformer” approach: each head gets its own identity, entropy push, and illegal route suppression. Happy to share more if you’re diving deeper into this zone. |
Beta Was this translation helpful? Give feedback.
-
Love the direction — especially the image routing part. If you’re already seeing multiple agent projects growing, you might find this full failure-mode breakdown map useful: 🔗 https://github.com/onestardao/WFGY/blob/main/ProblemMap/README.md Covers stuff like Logic Collapse, Multi-Agent Chaos, Creative Freeze, and a few weird edge cases where control tokens silently kill reasoning flow. Would be super curious to hear what modes you’re hitting most often in your builds. I’m actively tuning the fallback recovery patterns (esp. for vision agent chains). |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi has anyone put thought into handling different stop tokens for multimode models and RAG agent processes? It looks like there is already a framework for
is_control_token()
but not much is being done with it. This only does a continue instead of handling different types of control tokens.https://github.com/ggerganov/llama.cpp/blob/6b2a849d1f43d46b82d2f9c08c3275137b528784/src/llama.cpp#L15732
For example:
bc
.I see a lot more models recently supporting agents or external tooling. Arcee for example.
https://www.marktechpost.com/2024/07/06/arcee-ai-introduces-arcee-agent-a-cutting-edge-7b-parameter-language-model-specifically-designed-for-function-calling-and-tool-use/
Obviously with security implications this should be disabled by default and proper warnings to admins but the possibilities are interesting and could support more GPT4o-like behaviour.
Beta Was this translation helpful? Give feedback.
All reactions