agent-debugging

Here are 34 public repositories matching this topic...

liaohch3 / claude-tap

Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI, OpenCode, Kimi/Kimi Code, Pi, and Hermes in a local trace viewer.

Updated Jun 24, 2026
Python

najeed / ai-agent-eval-harness

Star

The open-source MultiAgentOps evaluation and verification harness for any industry business workflow.

Updated Jun 24, 2026
Python

OthmanAdi / langsmith-fetch-skill

Sponsor

Star

🔍 AI observability skill for Claude Code. Debug LangChain/LangGraph agents by fetching execution traces from LangSmith Studio directly in your terminal.

developer-tools observability ai-agents langchain langsmith llm-ops langsmith-tracing developer-tools-ai-agent claude-skills claude-skills-creator claude-skills-hub claude-skills-libary agent-debugging

Updated Apr 6, 2026

cylestio / agent-inspector

Star

Local open-source dev tool to debug, secure, and evaluate LLM agents. Provides static analysis, dynamic security checks, and runtime monitoring - integrates with Cursor and Claude Code.

behavior-analysis agent-trace ai-security-tool agent-security cursor-integration claude-code-plugin agent-debugging

Updated Jan 15, 2026
Python

sentinelrca / sentinel

Star

Root cause analysis for AI agents. Detects agent loops, retry storms, and optimization opportunities in LangSmith, Langfuse, Arize Phoenix, and OpenTelemetry traces.

phoenix multi-agent ai-agents arize root-cause-analysis opentelemetry ai-observability langsmith langfuse arize-phoenix llm-observability llm-debugging agent-debugging

Updated Jun 16, 2026
Python

converra / agent-triage

Star

Diagnose your AI agents in production. Extract policies from prompts, evaluate traces, generate diagnostic reports.

Updated Mar 10, 2026
TypeScript

Ylsssq926 / clawclip

Star

Cut your OpenClaw / ZeroClaw token bill. Find which model earns its cost. Prove whether optimizations actually work. Local, no upload.

hermes ai-agent ai-observability cost-reduction local-ai agent-tools llm-cost token-optimization agent-debugging openclaw zeroclaw hermes-agent agent-analytics prompt-efficiency

Updated Jun 20, 2026
TypeScript

aaronlab / browsertrace

Star

Local replay debugger for Browser Use failures with screenshots, model I/O, failed-step timelines, and public-safe HTML exports.

Updated May 14, 2026
Python

aryanVijaywargia / Continua

Star

Self-hosted durable execution engine with built-in observability for AI agent runs

react go debugging postgres typescript openapi self-hosted tracing observability ai-agents agent-debugging

Updated Jun 21, 2026
Go

amitmishrg / agenticlens

Star

Visual debugging, tracing, and replay for agent workflows.

nodejs ai reactjs devtools tracing developer-tools visualizations observability debugging-tools ai-agents log-visualization jsonl ai-observability llm agentic-ai agent-workflows workflow-visualization agent-debugging execution-tracing

Updated Mar 27, 2026
JavaScript

Chopin998 / agent-replay

Star

Local-first replay and regression checks for AI coding-agent sessions.

gemini regression-testing codex ai-agents opentelemetry llm-observability claude-code agent-debugging

Updated Jun 1, 2026
Python

kangjinghang / agent-chatlens

Star

🔍 A beautiful web viewer for AI agent session files. Browse Claude Code & OpenClaw conversations with chat-style UI, timeline visualization, and zero setup.

react visualization typescript developer-tools dark-mode chat-ui claude conversation-analysis jsonl vite ai-agent session-viewer claude-code agent-debugging openclaw jsonl-viewer tool-call-visualization

Updated May 19, 2026
TypeScript

ChainWatch is a flight data recorder for multi-step AI systems. It's a CLI-based tool that records every step in an AI decision chain, links them together in order, prevents tampering, and allows you to verify the chain's integrity and replay the full decision flow.

ai artificial-intelligence audit-log autonomous-agents ai-agents ai-engineering ai-observability llm llmops ai-tracing agent-observability ai-audit agent-debugging tool-using-agents decision-tracing

Updated Jan 22, 2026
Python

xiaoshuo1988130 / deepseek-compat-kit

Star

Compatibility and diagnostics for DeepSeek V4 tool-calling agents

json-schema llm-proxy deepseek tool-calling openai-compatible agent-debugging deepseek-v4 reasoning-content

Updated May 27, 2026
JavaScript

davccavalcante / agenticstash

Sponsor

Star

Deterministic record and replay for agent runs. A zero-runtime-dependency TypeScript library and CLI that captures every source of non-determinism an agent touches (model output, tools, MCP, clock, randomness) and replays it exactly, with fork, diff, a tamper-evident seal, and redaction.

nodejs redaction diff typescript fork mcp edge openai compliance reproducibility tamper-evident record-replay llm anthropic ai-infrastructure time-travel-debugging deterministic-replay agent-debugging

Updated Jun 19, 2026
HTML

Exploreunive / agentlens

Star

Explain why your agent failed — root-cause debugging, memory attribution, and run divergence for LLM agents.

python memory tracing developer-tools observability ai-agents llm agent-debugging

Updated Mar 31, 2026
Python

jigjoy-ai / kaleidoskop

Star

Kaleidoskop — replay your baro/Mozaik agent runs visually. Audit log → hexagonal neural firing in your browser.

visualization typescript multi-agent replay mozaik observability ai-agents baro llm agent-orchestration agent-debugging jigjoy

Updated May 31, 2026
TypeScript

joshualamerton / AgentLens

Star

A real-time observability and debugging layer for AI agents.

python machine-learning ai machine-learning-algorithms devtools agents ai-agents machine-learning-projects llms ai-devtools agent-debugging

Updated Mar 11, 2026
Python

valani9 / vstack

Star

AI agents fail like junior teammates, looping on bad ideas, ignoring feedback, and escalating commitment. vstack ports 34 of the most-cited organizational-behavior frameworks so you can diagnose your agents the same way you'd diagnose your team.

python docker multi-agent-systems ai-agents fastapi psychological-safety organizational-behavior llmops llm-evaluation model-context-protocol mcp-server agent-evaluation agent-observability agent-debugging after-action-review

Updated Jun 23, 2026
Python

zengin0201 / AI_Debugger

Star

ai-agents react-flow langchain visual-debugger agent-debugging langchain-debugger

Updated May 27, 2026
Python

Improve this page

Add a description, image, and links to the agent-debugging topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the agent-debugging topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agent-debugging

Here are 34 public repositories matching this topic...

liaohch3 / claude-tap

najeed / ai-agent-eval-harness

OthmanAdi / langsmith-fetch-skill

cylestio / agent-inspector

sentinelrca / sentinel

converra / agent-triage

Ylsssq926 / clawclip

aaronlab / browsertrace

aryanVijaywargia / Continua

amitmishrg / agenticlens

Chopin998 / agent-replay

kangjinghang / agent-chatlens

Tarunjit45 / ChainWatch

xiaoshuo1988130 / deepseek-compat-kit

davccavalcante / agenticstash

Exploreunive / agentlens

jigjoy-ai / kaleidoskop

joshualamerton / AgentLens

valani9 / vstack

zengin0201 / AI_Debugger

Improve this page

Add this topic to your repo