AI Agent Development Guide

This document provides essential guidelines for AI agents working on this LangGraph FastAPI Agent project.

Project Overview

This is a production-ready AI agent application built with:

LangGraph for stateful, multi-step AI agent workflows
FastAPI for high-performance async REST API endpoints
Langfuse for LLM observability and tracing
PostgreSQL + pgvector for long-term memory storage (mem0ai)
JWT authentication with session management
Prometheus + Grafana for monitoring

Quick Reference: Critical Rules

Import Rules

All imports MUST be at the top of the file - never add imports inside functions or classes

Logging Rules

Use structlog for all logging
Log messages must be lowercase_with_underscores (e.g., "user_login_successful")
NO f-strings in structlog events - pass variables as kwargs
Use logger.exception() instead of logger.error() to preserve tracebacks
Example: logger.info("chat_request_received", session_id=session.id, message_count=len(messages))

Retry Rules

Always use tenacity library for retry logic
Configure with exponential backoff
Example: @retry(stop=stop_after_attempt(3), wait=wait_exponential(multiplier=1, min=4, max=10))

Output Rules

Always enable rich library for formatted console outputs
Use rich for progress bars, tables, panels, and formatted text

Caching Rules

Only cache successful responses, never cache errors
Use appropriate cache TTL based on data volatility

FastAPI Rules

All routes must have rate limiting decorators
Use dependency injection for services, database connections, and auth
All database operations must be async

Code Style Conventions

Python/FastAPI

Use async def for asynchronous operations
Use type hints for all function signatures
Prefer Pydantic models over raw dictionaries
Use functional, declarative programming; avoid classes except for services and agents
File naming: lowercase with underscores (e.g., user_routes.py)
Use the RORO pattern (Receive an Object, Return an Object)

Error Handling

Handle errors at the beginning of functions
Use early returns for error conditions
Place the happy path last in the function
Use guard clauses for preconditions
Use HTTPException for expected errors with appropriate status codes

LangGraph & LangChain Patterns

Graph Structure

Use StateGraph for building AI agent workflows
Define clear state schemas using Pydantic models (see app/schemas/graph.py)
Use CompiledStateGraph for production workflows
Implement AsyncPostgresSaver for checkpointing and persistence
Use Command for controlling graph flow between nodes

Tracing

Use LangChain's CallbackHandler from Langfuse for tracing all LLM calls
All LLM operations must have Langfuse tracing enabled

Memory (mem0ai)

Use AsyncMemory for semantic memory storage
Store memories per user_id for personalized experiences
Use async methods: add(), get(), search(), delete()

Authentication & Security

Use JWT tokens for authentication
Implement session-based user management (see app/api/v1/auth.py)
Use get_current_session dependency for protected endpoints
Store sensitive data in environment variables
Validate all user inputs with Pydantic models

Database Operations

Use SQLModel for ORM models (combines SQLAlchemy + Pydantic)
Define models in app/models/ directory
Use async database operations with asyncpg
Use LangGraph's AsyncPostgresSaver for agent checkpointing

Performance Guidelines

Minimize blocking I/O operations
Use async for all database and external API calls
Implement caching for frequently accessed data
Use connection pooling for database connections
Optimize LLM calls with streaming responses

Observability

Integrate Langfuse for LLM tracing on all agent operations
Export Prometheus metrics for API performance
Use structured logging with context binding (request_id, session_id, user_id)
Track LLM inference duration, token usage, and costs

Testing & Evaluation

Implement metric-based evaluations for LLM outputs (see evals/ directory)
Create custom evaluation metrics as markdown files in evals/metrics/prompts/
Use Langfuse traces for evaluation data sources
Generate JSON reports with success rates

Configuration Management

Use environment-specific configuration files (.env.development, .env.staging, .env.production)
Use Pydantic Settings for type-safe configuration (see app/core/config.py)
Never hardcode secrets or API keys

Key Dependencies

FastAPI - Web framework
LangGraph - Agent workflow orchestration
LangChain - LLM abstraction and tools
Langfuse - LLM observability and tracing
Pydantic v2 - Data validation and settings
structlog - Structured logging
mem0ai - Long-term memory management
PostgreSQL + pgvector - Database and vector storage
SQLModel - ORM for database models
tenacity - Retry logic
rich - Terminal formatting
slowapi - Rate limiting
prometheus-client - Metrics collection

10 Commandments for This Project

All routes must have rate limiting decorators
All LLM operations must have Langfuse tracing
All async operations must have proper error handling
All logs must follow structured logging format with lowercase_underscore event names
All retries must use tenacity library
All console outputs should use rich formatting
All caching should only store successful responses
All imports must be at the top of files
All database operations must be async
All endpoints must have proper type hints and Pydantic models

Common Pitfalls to Avoid

❌ Using f-strings in structlog events
❌ Adding imports inside functions
❌ Forgetting rate limiting decorators on routes
❌ Missing Langfuse tracing on LLM calls
❌ Caching error responses
❌ Using logger.error() instead of logger.exception() for exceptions
❌ Blocking I/O operations without async
❌ Hardcoding secrets or API keys
❌ Missing type hints on function signatures

When Making Changes

Before modifying code:

Read the existing implementation first
Check for related patterns in the codebase
Ensure consistency with existing code style
Add appropriate logging with structured format
Include error handling with early returns
Add type hints and Pydantic models
Verify Langfuse tracing is enabled for LLM calls

References

LangGraph Documentation: https://langchain-ai.github.io/langgraph/
LangChain Documentation: https://python.langchain.com/docs/
FastAPI Documentation: https://fastapi.tiangolo.com/
Langfuse Documentation: https://langfuse.com/docs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI Agent Development Guide

Project Overview

Quick Reference: Critical Rules

Import Rules

Logging Rules

Retry Rules

Output Rules

Caching Rules

FastAPI Rules

Code Style Conventions

Python/FastAPI

Error Handling

LangGraph & LangChain Patterns

Graph Structure

Tracing

Memory (mem0ai)

Authentication & Security

Database Operations

Performance Guidelines

Observability

Testing & Evaluation

Configuration Management

Key Dependencies

10 Commandments for This Project

Common Pitfalls to Avoid

When Making Changes

References

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

AI Agent Development Guide

Project Overview

Quick Reference: Critical Rules

Import Rules

Logging Rules

Retry Rules

Output Rules

Caching Rules

FastAPI Rules

Code Style Conventions

Python/FastAPI

Error Handling

LangGraph & LangChain Patterns

Graph Structure

Tracing

Memory (mem0ai)

Authentication & Security

Database Operations

Performance Guidelines

Observability

Testing & Evaluation

Configuration Management

Key Dependencies

10 Commandments for This Project

Common Pitfalls to Avoid

When Making Changes

References