-
Notifications
You must be signed in to change notification settings - Fork 42
Description
I wanted to ask whether a failure mode taxonomy and debugging guide for RAG systems would be in scope as an additional resource for this learning repo.
What WFGY Problem Map is
WFGY is an open source project that defines a 16 mode failure map (No.1 to No.16) for LLM plus RAG pipelines. The modes include:
- retrieval hallucination even when relevant evidence is present
- vector store ingestion and index fragmentation
- bootstrap ordering and infra race conditions between API gateway and vector DB
- secrets and config drift only visible on first production deploy
- and other quietly dangerous failure modes
Problem Map README:
https://github.com/onestardao/WFGY/tree/main/ProblemMap#readme
For each mode, there is a description, typical symptoms and suggested minimal countermeasures.
Why it might help readers of this repo
This repository collects learning materials and recordings around AI infra. In many teams, the missing piece is not “how to build a RAG stack”, but “how to reason about the ways it fails in production”.
A concrete failure mode map gives learners and practitioners:
- a vocabulary to discuss what went wrong in real incidents
- a checklist to run through before declaring a RAG system ready for use
- a structured way to document lessons learned from outages and silent failures
Possible entry
If you think this fits the scope, a short entry under a Debugging, reliability, or hardening section could be:
WFGY 16 Problem Map, RAG failure mode taxonomy
Open source map of 16 real world failure modes (No.1 to No.16) for LLM plus RAG systems, with debugging checklists and mitigation ideas.
https://github.com/onestardao/WFGY/tree/main/ProblemMap#readme
If you prefer to keep the repo focused on a smaller set of course-specific materials, feel free to ignore this suggestion. Thanks again for curating and sharing these resources.