AI Travel Concierge Agent

An intelligent, production-ready AI agent that provides comprehensive travel planning assistance using Microsoft Semantic Kernel, Azure OpenAI, and Azure Cosmos DB. The agent orchestrates 7 specialized tools to deliver personalized travel recommendations including weather forecasts, currency conversion, restaurant discovery, credit card optimization, calendar scheduling, and multi-language translation.

Project Highlights

Aspect	Details
AI Framework	Microsoft Semantic Kernel with Azure OpenAI (GPT-4)
Architecture	Multi-agent tool orchestration with state machine
Memory Systems	Dual-layer: Short-term (session) + Long-term (Cosmos DB)
RAG Pipeline	Vector search with Azure Cosmos DB for knowledge retrieval
Tools Integrated	7 specialized tools with real API integrations
Evaluation	LLM-as-Judge evaluation system with correction generation
Test Coverage	Comprehensive unit and integration tests

Demo Output

$ python -m app.main

{
  "destination": "Paris",
  "travel_dates": "2026-06-01 to 2026-06-08",
  "weather": {
    "temperature_c": 22.5,
    "conditions": "Partly cloudy",
    "recommendation": "Great weather for sightseeing!"
  },
  "results": [
    {
      "title": "Le Clarence",
      "snippet": "Two-Michelin-starred haute cuisine in a luxurious mansion...",
      "url": "https://www.timeout.com/paris/en/restaurants/best-restaurants-in-paris",
      "category": "restaurant"
    }
  ],
  "card_recommendation": {
    "card": "Chase Sapphire Reserve",
    "benefit": "3x points on travel and dining",
    "fx_fee": "No FX fees",
    "source": "CardTools - Rules-based recommendation"
  },
  "currency_info": {
    "usd_to_eur": 0.8516,
    "sample_meal_usd": 100.0,
    "sample_meal_eur": 85.16,
    "points_earned": 300
  },
  "citations": [
    "https://www.timeout.com/paris/en/restaurants/best-restaurants-in-paris",
    "https://open-meteo.com - Weather data",
    "https://www.frankfurter.app - Currency rates"
  ],
  "next_steps": [
    "Book your flights to Paris",
    "Reserve accommodations in Paris",
    "Research local attractions and create an itinerary",
    "Notify your credit card company of travel plans"
  ]
}

Architecture

High-Level System Architecture

flowchart TB
    subgraph UserLayer["👤 User Interface"]
        UI[/"Natural Language Input"/]
        Output[/"JSON Travel Plan"/]
    end

    subgraph AgentCore["🤖 AI Travel Concierge Agent"]
        SK["Semantic Kernel<br/>Orchestrator"]
        SM["State Machine<br/>Controller"]

        subgraph Tools["🔧 Tool Plugins"]
            direction LR
            WT["☀️ Weather<br/>Open-Meteo"]
            FX["💱 FX<br/>Frankfurter"]
            SR["🔍 Search<br/>Azure AI + Bing"]
            CD["💳 Card<br/>Rules Engine"]
            KB["📚 Knowledge<br/>RAG Vector"]
            CL["📅 Calendar<br/>Scheduler"]
            TR["🌐 Translation<br/>Azure"]
        end

        subgraph Memory["🧠 Memory Systems"]
            STM["Short-Term Memory<br/>Session Context"]
            LTM["Long-Term Memory<br/>Cosmos DB"]
        end

        subgraph Eval["📊 Evaluation"]
            Judge["LLM-as-Judge<br/>Quality Scoring"]
        end
    end

    subgraph Azure["☁️ Azure Services"]
        AOI["Azure OpenAI<br/>GPT-4 + Embeddings"]
        CDB["Azure Cosmos DB<br/>Vector Database"]
        AIF["Azure AI Foundry<br/>Bing Grounding"]
    end

    subgraph External["🌐 External APIs"]
        OM["Open-Meteo API"]
        FF["Frankfurter API"]
    end

    UI --> SK
    SK <--> SM
    SK --> Tools
    SK <--> Memory
    SK --> Eval

    WT <--> OM
    FX <--> FF
    SR <--> AIF
    KB <--> CDB
    TR <--> AOI

    STM <--> AOI
    LTM <--> CDB
    Judge <--> AOI

    SK --> Output

    style AgentCore fill:#e8f4f8,stroke:#0078d4,stroke-width:2px
    style Azure fill:#fff4e6,stroke:#ff8c00,stroke-width:2px
    style Tools fill:#f0f9ff,stroke:#0ea5e9,stroke-width:1px
    style Memory fill:#fdf4ff,stroke:#a855f7,stroke-width:1px

Agent State Machine

The agent follows a robust execution workflow with error handling and recovery states:

stateDiagram-v2
    [*] --> Init: Start

    Init --> ClarifyRequirements: Extract requirements

    ClarifyRequirements --> PlanTools: Requirements clear
    ClarifyRequirements --> AwaitingUserClarification: Need more info
    AwaitingUserClarification --> ClarifyRequirements: User responds

    PlanTools --> ExecuteTools: Tools selected

    ExecuteTools --> ValidatingResults: Tools complete
    ExecuteTools --> HandlingToolError: Tool failed

    HandlingToolError --> RetryingTools: Can retry
    HandlingToolError --> EscalatingError: Max retries
    RetryingTools --> ExecuteTools: Retry

    ValidatingResults --> Synthesize: Validation passed
    ValidatingResults --> RetryingTools: Validation failed

    Synthesize --> Done: Plan generated

    EscalatingError --> Done: Error response

    Done --> [*]: Complete

    note right of Init: Initialize agent\nand validate config
    note right of ExecuteTools: Run Weather, FX,\nSearch, Card tools
    note right of Synthesize: Generate TripPlan\nJSON response

Data Flow Diagram

flowchart LR
    subgraph Input
        A["🗣️ User Query<br/>'Plan a trip to Paris...'"]
    end

    subgraph Processing["Agent Processing Pipeline"]
        B["📝 Extract<br/>Requirements"]
        C["🔧 Execute<br/>Tools"]
        D["🔀 Aggregate<br/>Results"]
        E["✨ Synthesize<br/>Response"]
    end

    subgraph ToolCalls["Parallel Tool Execution"]
        T1["☀️ Weather API"]
        T2["💱 FX API"]
        T3["🔍 Bing Search"]
        T4["💳 Card Rules"]
    end

    subgraph Output
        F["📋 TripPlan JSON"]
    end

    A --> B
    B --> C
    C --> T1 & T2 & T3 & T4
    T1 & T2 & T3 & T4 --> D
    D --> E
    E --> F

    style Processing fill:#dbeafe,stroke:#3b82f6
    style ToolCalls fill:#dcfce7,stroke:#22c55e

Tool Orchestration

flowchart TB
    subgraph SK["Semantic Kernel"]
        Kernel["Kernel Instance"]

        subgraph Plugins["Registered Plugins"]
            P1["WeatherTools"]
            P2["FxTools"]
            P3["SearchTools"]
            P4["CardTools"]
            P5["KnowledgeTools"]
            P6["CalendarTools"]
            P7["TranslationTools"]
        end
    end

    subgraph Functions["@kernel_function Decorators"]
        F1["get_weather(lat, lon)"]
        F2["convert_fx(amount, base, target)"]
        F3["web_search(query, max_results)"]
        F4["recommend_card(mcc, amount, country)"]
        F5["get_card_recommendation(mcc, country)"]
        F6["check_availability(start, end)"]
        F7["translate_text(text, target_lang)"]
    end

    subgraph APIs["External Services"]
        A1["Open-Meteo"]
        A2["Frankfurter"]
        A3["Azure AI + Bing"]
        A4["Rules Engine"]
        A5["Cosmos DB RAG"]
        A6["In-Memory"]
        A7["Azure Translator"]
    end

    Kernel --> Plugins
    P1 --> F1 --> A1
    P2 --> F2 --> A2
    P3 --> F3 --> A3
    P4 --> F4 --> A4
    P5 --> F5 --> A5
    P6 --> F6 --> A6
    P7 --> F7 --> A7

    style SK fill:#f3e8ff,stroke:#9333ea,stroke-width:2px
    style Plugins fill:#fef3c7,stroke:#f59e0b
    style Functions fill:#dbeafe,stroke:#3b82f6

Memory Architecture

flowchart TB
    subgraph STM["Short-Term Memory"]
        direction TB
        S1["Conversation History"]
        S2["Tool Call Records"]
        S3["Session Context"]
        S4["Sliding Window<br/>Eviction"]
    end

    subgraph LTM["Long-Term Memory"]
        direction TB
        L1["MemoryItem Storage"]
        L2["Importance Scoring"]
        L3["Access Tracking"]
        L4["Pruning Strategies"]
    end

    subgraph Pruning["Pruning Strategies"]
        direction LR
        PR1["By Importance"]
        PR2["By Age"]
        PR3["By Access Freq"]
        PR4["Hybrid + AI"]
    end

    subgraph Storage["Cosmos DB"]
        DB[("Vector Database<br/>with Embeddings")]
    end

    User["User Session"] --> STM
    STM --> |"Important memories"| LTM
    LTM --> L4
    L4 --> Pruning
    LTM <--> DB

    style STM fill:#fef3c7,stroke:#f59e0b,stroke-width:2px
    style LTM fill:#dbeafe,stroke:#3b82f6,stroke-width:2px
    style Storage fill:#dcfce7,stroke:#22c55e,stroke-width:2px

Technology Stack

Core Technologies

Technology	Purpose	Why This Choice
Python 3.11+	Primary language	Modern async support, type hints
Microsoft Semantic Kernel	AI orchestration	Enterprise-grade tool orchestration, native Azure integration
Azure OpenAI (GPT-4)	LLM backbone	Production reliability, enterprise security
Azure Cosmos DB	Vector database	Scalable NoSQL with vector search, global distribution
Pydantic	Data validation	Type-safe models, JSON serialization

Azure Services

Azure OpenAI Service - GPT-4 for chat, text-embedding-3-small for embeddings
Azure Cosmos DB - Vector storage for RAG and long-term memory
Azure AI Foundry - Bing grounding for web search
Azure Translator - Multi-language translation support

External APIs

Open-Meteo - Weather forecasts (free, no key required)
Frankfurter - Real-time currency exchange rates (free)
Bing Search - Web search via Azure AI Foundry Agent

Key Features

1. Intelligent Tool Orchestration

Seven specialized tools that work together to provide comprehensive travel assistance:

Tool	Function	API/Source
WeatherTools	7-day weather forecasts	Open-Meteo API
FxTools	Real-time currency conversion	Frankfurter API
SearchTools	Restaurant & attraction discovery	Azure AI + Bing
CardTools	Credit card optimization	Rules-based engine
KnowledgeTools	RAG-based knowledge retrieval	Cosmos DB vectors
CalendarTools	Availability checking & scheduling	Built-in scheduler
TranslationTools	Multi-language phrasebook	Azure Translator

2. Dual-Layer Memory System

Short-Term Memory (app/memory.py)

Session-based conversation context
Sliding window eviction (by items and tokens)
Tool call tracking and search

Long-Term Memory (app/rag/long_term_memory/)

Persistent cross-session memory in Cosmos DB
Importance-based pruning strategies
AI-optimized memory reordering
Access frequency tracking

3. RAG Pipeline with Vector Search

Semantic search using Azure OpenAI embeddings
Cosmos DB vector indexing for fast retrieval
Knowledge base with credit card benefits data
Cosine similarity scoring for relevance ranking

4. LLM-as-Judge Evaluation System

Comprehensive evaluation with 6 criteria:

Accuracy (25%) - Correctness of information
Completeness (20%) - Coverage of user needs
Relevance (20%) - Appropriateness to query
Tool Usage (15%) - Effective tool orchestration
Structure (10%) - Response organization
Citations (10%) - Source attribution

Enhanced Features:

Automatic correction generation for low-scoring responses
Tool usage suggestions and optimization
Debugging insights for workflow issues

5. Structured Output with Pydantic Models

Type-safe data models ensuring consistent, validated responses:

class TripPlan(BaseModel):
    destination: str
    travel_dates: str
    weather: Optional[Weather]
    results: Optional[List[SearchResult]]
    card_recommendation: CardRecommendation
    currency_info: CurrencyInfo
    citations: Optional[List[str]]
    next_steps: List[str]

Project Structure

AI-Travel-Concierge/
├── app/
│   ├── main.py                 # Entry point - Semantic Kernel orchestration
│   ├── models.py               # Pydantic data models (TripPlan, Weather, etc.)
│   ├── state.py                # Agent state machine (12 phases)
│   ├── memory.py               # Short-term memory system
│   ├── synthesis.py            # Response synthesis and formatting
│   ├── filters.py              # Kernel filters (logging, telemetry, guardrails)
│   │
│   ├── tools/                  # Tool plugins
│   │   ├── weather.py          # Weather forecasts (Open-Meteo)
│   │   ├── fx.py               # Currency conversion (Frankfurter)
│   │   ├── search.py           # Web search (Azure AI + Bing)
│   │   ├── card.py             # Credit card recommendations
│   │   ├── knowledge.py        # RAG knowledge retrieval
│   │   ├── calendar.py         # Calendar scheduling
│   │   └── translation.py      # Multi-language translation
│   │
│   ├── rag/                    # RAG pipeline
│   │   ├── ingest.py           # Data ingestion with embeddings
│   │   ├── retriever.py        # Vector search retrieval
│   │   └── long_term_memory/   # Long-term memory module
│   │       ├── core.py         # LongTermMemory class
│   │       ├── models.py       # MemoryItem dataclass
│   │       ├── db.py           # Cosmos DB connection
│   │       ├── pruning.py      # Memory pruning strategies
│   │       ├── reordering.py   # Memory reordering
│   │       └── optimization.py # AI-optimized performance
│   │
│   ├── eval/                   # Evaluation system
│   │   ├── judge.py            # Simple rule-based evaluation
│   │   └── llm_judge.py        # LLM-as-Judge with corrections
│   │
│   └── utils/                  # Utilities
│       ├── config.py           # Configuration management
│       └── logger.py           # Structured logging
│
├── tests/                      # Test suite
│   ├── test_models.py          # Data model tests
│   ├── test_tools.py           # Tool function tests
│   ├── test_state.py           # State machine tests
│   ├── test_memory.py          # Memory system tests
│   └── test_memory_integration.py  # Integration tests
│
├── scripts/                    # Utility scripts
│   └── ingest_knowledge.py     # Knowledge base ingestion
│
├── requirements.txt            # Python dependencies
├── .env.example                # Environment variables template
└── README.md                   # This file

Installation

Prerequisites

Python 3.11+
Azure subscription with:
- Azure OpenAI Service (GPT-4 and embedding deployments)
- Azure Cosmos DB account
- Azure AI Foundry project (for Bing search)

Setup

Clone the repository

git clone https://github.com/yourusername/AI-Travel-Concierge.git
cd AI-Travel-Concierge

Create virtual environment

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies
```
pip install -r requirements.txt
```

Configure environment variables

cp .env.example .env
# Edit .env with your Azure credentials

Required environment variables:

# Azure OpenAI
AZURE_OPENAI_ENDPOINT=https://your-resource.openai.azure.com/
AZURE_OPENAI_KEY=your_key_here
AZURE_OPENAI_API_VERSION=2025-01-01-preview
AZURE_OPENAI_CHAT_DEPLOYMENT=gpt-4.1
AZURE_OPENAI_EMBED_DEPLOYMENT=text-embedding-3-small

# Azure Cosmos DB
COSMOS_ENDPOINT=https://your-cosmos.documents.azure.com:443/
COSMOS_KEY=your_cosmos_key_here
COSMOS_DB=ragdb
COSMOS_CONTAINER=snippets

# Azure AI Foundry (for Bing Search)
PROJECT_ENDPOINT=https://your-project.services.ai.azure.com/api/projects/proj-default
AGENT_ID=your_agent_id

Ingest knowledge base (optional)
```
python -m app.rag.ingest
```

Usage

Run the Agent

python -m app.main

Programmatic Usage

from app.main import run_request
import json

# Natural language travel planning
result = run_request(
    "I want to go to Paris from 2026-06-01 to 2026-06-08 with my BankGold card"
)
plan = json.loads(result)

print(f"Destination: {plan['destination']}")
print(f"Weather: {plan['weather']['conditions']}")
print(f"Best Card: {plan['card_recommendation']['card']}")

Interactive Chat

python chat.py

Testing

Run All Tests

python -m pytest tests/ -v

Run Specific Test Categories

# Data models
python -m pytest tests/test_models.py -v

# Tool functions
python -m pytest tests/test_tools.py -v

# State machine
python -m pytest tests/test_state.py -v

# Memory systems
python -m pytest tests/test_memory.py -v

Run Evaluation

python -m app.eval.judge

Technical Deep Dive

State Machine Implementation

The agent uses a sophisticated state machine with 12 phases for robust execution:

class Phase(Enum):
    # Core workflow
    Init = "Init"
    ClarifyRequirements = "ClarifyRequirements"
    PlanTools = "PlanTools"
    ExecuteTools = "ExecuteTools"
    Synthesize = "Synthesize"
    Done = "Done"

    # Error handling
    AWAITING_USER_CLARIFICATION = "AwaitingUserClarification"
    HANDLING_TOOL_ERROR = "HandlingToolError"
    VALIDATING_RESULTS = "ValidatingResults"
    RETRYING_TOOLS = "RetryingTools"
    ESCALATING_ERROR = "EscalatingError"

Memory Pruning Strategies

Long-term memory supports multiple pruning strategies:

Importance-based - Remove low-importance memories first
Age-based - Remove oldest memories
Access frequency - Remove least-accessed memories
Hybrid - Combination of all strategies with AI scoring

RAG Implementation

flowchart LR
    subgraph Query["Query Processing"]
        Q1["User Query"]
        Q2["Generate Embedding<br/>text-embedding-3-small"]
    end

    subgraph Search["Vector Search"]
        S1[("Cosmos DB<br/>Vector Index")]
        S2["Cosine Similarity<br/>Calculation"]
        S3["Top-K Ranking"]
    end

    subgraph Results["Results"]
        R1["Retrieved Documents"]
        R2["Context for LLM"]
    end

    Q1 --> Q2
    Q2 --> S1
    S1 --> S2
    S2 --> S3
    S3 --> R1
    R1 --> R2

    style Query fill:#fef3c7,stroke:#f59e0b
    style Search fill:#dbeafe,stroke:#3b82f6
    style Results fill:#dcfce7,stroke:#22c55e

# Vector search with cosine similarity
def retrieve(query: str, k: int = 5) -> List[Dict]:
    # 1. Generate query embedding
    query_embedding = embed_texts([query])[0]

    # 2. Fetch documents from Cosmos DB
    documents = container.query_items(...)

    # 3. Calculate cosine similarity
    for doc in documents:
        similarity = cosine_similarity(query_embedding, doc["embedding"])

    # 4. Return top-k results
    return sorted_results[:k]

What This Project Demonstrates

Software Engineering Skills

Clean Architecture - Separation of concerns, modular design
Design Patterns - State machine, plugin architecture, factory pattern
Error Handling - Graceful degradation, retry logic, fallbacks
Type Safety - Pydantic models, type hints throughout
Testing - Unit tests, integration tests, evaluation harness

AI/ML Engineering Skills

LLM Integration - Azure OpenAI with Semantic Kernel
RAG Pipeline - Vector embeddings, semantic search
Prompt Engineering - Structured prompts, JSON extraction
Agent Design - Multi-tool orchestration, state management
Evaluation - LLM-as-Judge, automated quality assessment

Cloud & DevOps Skills

Azure Services - OpenAI, Cosmos DB, AI Foundry
Configuration Management - Environment variables, validation
Logging & Monitoring - Structured logging, telemetry hooks
Security - Credential management, no hardcoded secrets

Future Enhancements

Flight Booking Integration - Amadeus/Skyscanner API
Hotel Recommendations - Booking.com/Expedia integration
Interactive Maps - Google Maps visualization
Voice Interface - Azure Speech Services
Mobile App - React Native frontend
Multi-user Support - User authentication and profiles
Itinerary Generation - Day-by-day planning with AI

Author

Saurabh Bhardwaj

Building production-ready AI agents with Microsoft Semantic Kernel
Expertise in Azure AI services and cloud architecture
Passionate about creating intelligent, user-friendly applications

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Microsoft Semantic Kernel - AI orchestration framework
Azure OpenAI Service - LLM backbone
Open-Meteo - Free weather API
Frankfurter - Free currency exchange API

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
app		app
scripts		scripts
static		static
tests		tests
.gitignore		.gitignore
.python-version		.python-version
DEVELOPMENT_GUIDE.md		DEVELOPMENT_GUIDE.md
LICENSE		LICENSE
README.md		README.md
chat.py		chat.py
env.example		env.example
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Folders and files

Latest commit

History

Repository files navigation

AI Travel Concierge Agent

Table of Contents

Project Highlights

Demo Output

Architecture

High-Level System Architecture

Agent State Machine

Data Flow Diagram

Tool Orchestration

Memory Architecture

Technology Stack

Core Technologies

Azure Services

External APIs

Key Features

1. Intelligent Tool Orchestration

2. Dual-Layer Memory System

3. RAG Pipeline with Vector Search

4. LLM-as-Judge Evaluation System

5. Structured Output with Pydantic Models

Project Structure

Installation

Prerequisites

Setup

Usage

Run the Agent

Programmatic Usage

Interactive Chat

Testing

Run All Tests

Run Specific Test Categories

Run Evaluation

Technical Deep Dive

State Machine Implementation

Memory Pruning Strategies

RAG Implementation

What This Project Demonstrates

Software Engineering Skills

AI/ML Engineering Skills

Cloud & DevOps Skills

Future Enhancements

Author

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages