PAF Core Agent

A cloud-native Python microservice implementing the UPEE (Understand → Plan → Execute → Evaluate) loop for intelligent chat interactions with multi-provider LLM support.

🚀 Quick Start

Prerequisites

Python 3.11+
pip (or Poetry)

Installation & Setup

Clone the repository

git clone https://github.com/your-org/paf-core-agent.git
cd paf-core-agent

Create and activate virtual environment

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies
```
pip install -r requirements.txt
```

Set up environment variables

Create a .env file in the root directory:

# Required: At least one LLM provider API key
OPENAI_API_KEY=sk-your-openai-key-here
# ANTHROPIC_API_KEY=sk-ant-your-anthropic-key-here
# AWS_REGION=us-east-1  # For AWS Bedrock

# Optional: Configuration
DEBUG=true
DEFAULT_MODEL=gpt-4o
MAX_CONTEXT_TOKENS=4000

Install file processing dependencies (optional)

# For Excel/CSV file processing
pip install pandas openpyxl

# For additional file types
pip install python-docx PyPDF2 pillow

Start the development server

chmod +x scripts/start.sh
./scripts/start.sh

Or manually:

uvicorn app.main:app --host 0.0.0.0 --port 8000 --reload

🎯 Quick Test

Once the server is running, test it:

# Basic health check
curl http://localhost:8000/api/health

# Chat test
curl -X POST http://localhost:8000/api/chat/stream \
  -H "Content-Type: application/json" \
  -d '{
    "message": "Hello! Can you help me analyze data?",
    "show_thinking": true,
    "model": "gpt-4o"
  }'

The service will be available at:

API: http://localhost:8000
Interactive Documentation: http://localhost:8000/docs
Health Check: http://localhost:8000/api/health
Debug Tools: http://localhost:8000/api/debug/inspect-request

📋 Minimum Requirements

Required:

Python 3.11+
At least one LLM provider API key (OpenAI, Anthropic, or AWS Bedrock)

Optional:

File processing libraries (pandas, openpyxl) for Excel/CSV support
Docker for containerized deployment

🏗️ Architecture

UPEE Loop

The core cognitive loop consists of four phases:

Understand - Parse and analyze user input with context
Plan - Develop response strategy and identify required resources
Execute - Generate response using appropriate LLM providers
Evaluate - Assess response quality and refine if needed

Key Features

🔄 Server-Sent Events (SSE) - Real-time streaming chat responses
🧠 Multi-Provider LLM - OpenAI, Anthropic Claude, AWS Bedrock support
📁 File Context - Intelligent file processing and summarization
🔗 gRPC Integration - Communication with downstream worker agents
📊 Observability - Structured logging, Prometheus metrics, AWS X-Ray tracing
🔒 Security - JWT/HMAC authentication, mTLS for gRPC
🐳 Container Ready - Docker support with optimized image size
☁️ Cloud Native - AWS Fargate deployment with auto-scaling

📡 API Endpoints

Chat Streaming

POST /api/chat/stream
Content-Type: application/json

{
  "message": "Hello, how can you help me?",
  "show_thinking": true,
  "files": [...],
  "model": "gpt-4",
  "temperature": 0.7
}

Response: Server-Sent Events stream with:

thinking events (UPEE phase insights)
content events (response chunks)
complete event (metadata and stats)
done event (stream termination)

Health Check

GET /api/health

Returns service health status including LLM provider availability.

Available Models

GET /api/chat/models

Lists all available LLM models and their status.

🛠️ Development

Project Structure

paf-core-agent/
├── app/
│   ├── api/                 # FastAPI routers
│   ├── core/                # UPEE logic
│   ├── llm_providers/       # Multi-provider abstraction
│   ├── grpc_clients/        # gRPC client implementations
│   ├── utils/               # Utilities (logging, auth, metrics)
│   ├── schemas.py           # Pydantic models
│   └── settings.py          # Configuration
├── tests/                   # Test suites
├── scripts/                 # Development scripts
├── proto/                   # Protocol buffer definitions
└── requirements.txt         # Python dependencies

Environment Variables

Variable	Description	Required	Default
`OPENAI_API_KEY`	OpenAI API key	At least one provider	-
`ANTHROPIC_API_KEY`	Anthropic API key	At least one provider	-
`AWS_REGION`	AWS region for Bedrock	No	us-east-1
`DEBUG`	Enable debug mode	No	false
`MAX_CONTEXT_TOKENS`	Maximum context window	No	4000
`DEFAULT_MODEL`	Default LLM model	No	gpt-4o

Running Tests

pytest tests/ -v --cov=app

Code Quality

# Format code
black app/ tests/

# Sort imports
isort app/ tests/

# Lint
flake8 app/ tests/

# Type checking
mypy app/

🐳 Docker

Build and run with Docker:

# Build image
docker build -t paf-core-agent .

# Run container
docker run -p 8000:8000 \
  -e OPENAI_API_KEY=your_key_here \
  paf-core-agent

☁️ Deployment

AWS Fargate

The service is designed for deployment on AWS Fargate with:

Application Load Balancer for HTTP/HTTPS traffic
Auto Scaling based on CPU/memory metrics
ECS service with health checks
CloudWatch logging and monitoring

See terraform/ directory for Infrastructure as Code examples.

Environment Configuration

For production deployment:

Use AWS Secrets Manager for API keys
Configure VPC with private subnets for gRPC traffic
Set up CloudWatch dashboards for monitoring
Enable AWS X-Ray for distributed tracing

📊 Monitoring

Metrics

The service exposes Prometheus metrics at /metrics:

Request latency and throughput
Token usage per provider
UPEE phase timing
Error rates and types

Logging

Structured JSON logs include:

Request tracing with correlation IDs
UPEE phase events
LLM provider calls
Performance metrics

Health Checks

/api/health - Comprehensive health status
/api/health/live - Liveness probe
/api/health/ready - Readiness probe

🔧 Configuration

LLM Providers

Configure multiple providers for redundancy and cost optimization:

# Environment variables
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
AWS_REGION=us-east-1

# Default routing
DEFAULT_MODEL=gpt-3.5-turbo

Performance Tuning

MAX_CONCURRENT_REQUESTS=150 - Concurrent request limit
REQUEST_TIMEOUT=30 - Request timeout in seconds
MAX_CONTEXT_TOKENS=4000 - Context window size

🔒 Security

Authentication: HMAC signatures or JWT tokens
Transport: HTTPS for client traffic, mTLS for gRPC
Secrets: AWS Secrets Manager integration
Network: VPC isolation for inter-service communication

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests
Run quality checks
Submit a pull request

📄 License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0) - see the LICENSE file for details.

🆘 Support

For questions and support:

Create an issue in the repository
Check the documentation at /docs
Review the health status at /api/health

🚀 Features Status

✅ Core UPEE Loop - Fully implemented with streaming support
✅ Multi-Provider LLM - OpenAI, Anthropic ✅ File Processing - Excel, CSV, and text file support with agentic processing
✅ Memory Support - Short-term conversation history
✅ Streaming Chat - Real-time Server-Sent Events
✅ Debug Tools - Request inspection and troubleshooting endpoints
✅ Health Monitoring - Comprehensive health checks and metrics

Status: ✅ Production Ready - Core functionality complete

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
app		app
scripts		scripts
.flake8		.flake8
.gitignore		.gitignore
.last_install		.last_install
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
alembic.ini		alembic.ini
requirements.txt		requirements.txt
test_no_db.py		test_no_db.py

Folders and files

Latest commit

History

Repository files navigation

PAF Core Agent

🚀 Quick Start

Prerequisites

Installation & Setup

🎯 Quick Test

📋 Minimum Requirements

🏗️ Architecture

UPEE Loop

Key Features

📡 API Endpoints

Chat Streaming

Health Check

Available Models

🛠️ Development

Project Structure

Environment Variables

Running Tests

Code Quality

🐳 Docker

☁️ Deployment

AWS Fargate

Environment Configuration

📊 Monitoring

Metrics

Logging

Health Checks

🔧 Configuration

LLM Providers

Performance Tuning

🔒 Security

🤝 Contributing

📄 License

🆘 Support

🚀 Features Status

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages