You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: ARCHITECTURE.md
+9-7Lines changed: 9 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -36,7 +36,9 @@ ChaosPilot is a full-stack AI platform for log analysis, incident detection, and
36
36
-**Agent Manager:**
37
37
- Receives user requests and delegates to specialized sub-agents.
38
38
-**Agent Handoffs:**
39
-
- Workflows are designed for agent handoff (e.g., detector → planner → fixer/notifier).
39
+
- Workflows are designed for dynamic agent handoff (e.g., detector → planner → fixer/notifier). If a sub-agent determines another agent is better suited for the next step, it delegates the task.
40
+
-**Sub-Agent LLM Access:**
41
+
- Sub-agents can directly invoke Gemini (Google AI Platform) for LLM-powered analysis, classification, and planning, not just the main agent manager.
40
42
-**Dynamic Toolsets:**
41
43
- Each agent can invoke tools from the ADK toolbox, with toolsets defined per agent type.
42
44
-**Frontend Visualization:**
@@ -47,14 +49,14 @@ ChaosPilot is a full-stack AI platform for log analysis, incident detection, and
47
49
## 4. Google Cloud & AI Services (including Gemini)
48
50
49
51
-**BigQuery:**
50
-
- Stores and queries logs, incident data. Agents use BigQuery for analytics and context retrieval.
52
+
- Stores and queries logs, incident data. Agents use the mcp-toolbox to connect to BigQuery, retrieve logs, and perform analytics and context retrieval.
51
53
-**Cloud Logging:**
52
-
- Ingests and manages raw logs. Scripts in `/scripts/` support log injectionand management.
54
+
- Ingests and manages raw logs. Logs are exported (sunk) from Cloud Logging to BigQuery for structured querying and analysis. Scripts in `/scripts/` support log injection, management, and sink setup.
53
55
-**Gemini LLM (Google AI Platform):**
54
-
-Backend calls Gemini for advanced log analysis, incident classification, and remediation planning.
56
+
-Sub-agents and the main agent manager can each call Gemini for advanced log analysis, incident classification, and remediation planning.
55
57
- All LLM calls are backend-only. There is currently no retrieval-augmented generation (RAG) pipeline, embedding generation, or vector similarity search implemented in the codebase. If RAG is implemented in the future, it will follow strict security and privacy guidelines.
56
58
-**ADK Toolbox:**
57
-
- All tools and toolsets are defined for use by agents, ensuring schema compliance and dynamic extensibility.
59
+
- All tools and toolsets are defined for use by agents, ensuring schema compliance and dynamic extensibility. The mcp-toolbox provides the interface and schema for agents to interact with BigQuery and other data sources.
58
60
59
61
---
60
62
@@ -83,8 +85,8 @@ ChaosPilot is a full-stack AI platform for log analysis, incident detection, and
83
85
3. Frontend sends authenticated request to FastAPI backend.
84
86
4. Backend authenticates and invokes the main ADK agent.
85
87
5. Agent manager delegates to the appropriate sub-agent.
86
-
6. Sub-agent queries BigQuery, retrieves relevant logs, and may send those logs or summaries to the LLM (Gemini or Azure) for analysis.
87
-
7. Agent manager may hand off to other agents as needed.
88
+
6. Sub-agent queries BigQuery (via the mcp-toolbox), retrieves relevant logs, and may send those logs or summaries to Gemini for LLM-powered analysis.
89
+
7. Agent handoff is dynamic: if a sub-agent determines another is better suited for the next step, it delegates the task.
88
90
8. Backend streams response to frontend, which visualizes the multi-agent workflow.
0 commit comments