🦡 BADGERS

_{🧭 Navigation:}
_{🔵 Home | Vision LLM Theory | Frontend | Deployment | CDK Stacks | Runtime | S3 Files | Lambda Analyzers | Prompting System | Analyzer Wizard | Pricing Calculator}

🦡 BADGERS

Broad Agentic Document Generative Extraction & Recognition System

BADGERS transforms document processing through vision-enabled AI and deep layout analysis. Unlike traditional text extraction tools, BADGERS understands document structure and meaning by recognizing visual hierarchies, reading patterns, and contextual relationships between elements.

🤔 Why BADGERS?

Traditional document processing tools extract text but lose context. They can't distinguish a header from body text, understand table relationships, or recognize that a diagram explains the adjacent paragraph. BADGERS solves this by:

🏗️ Preserving semantic structure - Maintains document hierarchy and element relationships
👁️ Understanding visual context - Recognizes how layout conveys meaning
📚 Processing diverse content - Handles 21+ element types from handwriting to equations
🤖 Automating complex workflows - Orchestrates multiple specialized analyzers via an AI agent

Use cases: research acceleration, compliance automation, content management, accessibility remediation.

⚙️ How It Works

┌─────────────────────────────────────────────────────────────────────────────┐
│                           AgentCore Runtime                                 │
│   ┌─────────────────────────────────────────────────────────────────────┐   │
│   │  PDF Analysis Agent (Strands)                                       │   │
│   │  - Claude Sonnet 4.5 with Extended Thinking                         │   │
│   │  - Session state management                                         │   │
│   │  - MCP tool orchestration                                           │   │
│   └─────────────────────────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────────────────────────┘
                                      │
                                      ▼
┌─────────────────────────────────────────────────────────────────────────────┐
│                           AgentCore Gateway                                 │
│   - MCP Protocol (2025-03-26)                                               │
│   - Cognito JWT Authentication                                              │
│   - Semantic tool search                                                    │
└─────────────────────────────────────────────────────────────────────────────┘
                                      │
                   ┌──────────────────┼──────────────────┐
                   │                  │                  │
                   ▼                  ▼                  ▼
            ┌─────────────┐    ┌─────────────┐    ┌─────────────┐
            │   Lambda    │    │   Lambda    │    │   Lambda    │
            │  Analyzer   │    │  Analyzer   │    │  Analyzer   │
            │ (29 tools)  │    │             │    │             │
            └─────────────┘    └─────────────┘    └─────────────┘
                   │                  │                  │
                   └──────────────────┼──────────────────┘
                                      ▼
                               ┌─────────────┐
                               │   Bedrock   │
                               │   Claude    │
                               └─────────────┘

📄 User submits a document with analysis instructions
🧠 Strands Agent (running in AgentCore Runtime) interprets the request
🔧 Agent selects tools from 29 specialized analyzers via MCP Gateway
⚡ Lambda analyzers (29 functions) process document elements using Claude vision models
📊 Results aggregate with preserved structure and semantic relationships

🛠️ Tech Stack

Component	Technology
🤖 Agent Framework	Strands Agents
🏠 Agent Hosting	Amazon Bedrock AgentCore Runtime
🚪 Tool Gateway	Amazon Bedrock AgentCore Gateway (MCP Protocol)
🧠 Foundation Model	Claude Sonnet 4.5 (via Amazon Bedrock)
⚡ Compute	AWS Lambda (29 analyzer functions)
📦 Storage	Amazon S3 (configs, prompts, outputs)
🔐 Auth	Amazon Cognito (OAuth 2.0 client credentials)
🏗️ IaC	AWS CDK (Python)
📈 Observability	CloudWatch Logs, X-Ray
📊 Cost Tracking	Bedrock Application Inference Profiles

🔬 Analyzers

Analyzer	Purpose
📸 `pdf_to_images_converter`	Convert PDF pages to images
🏷️ `classify_pdf_content`	Classify document content type
📝 `full_text_analyzer`	Extract all text content
📊 `table_analyzer`	Extract and structure tables
📈 `charts_analyzer`	Analyze charts and graphs
🔀 `diagram_analyzer`	Process diagrams and flowcharts
📐 `layout_analyzer`	Document structure analysis
♿ `accessibility_analyzer`	Generate accessibility metadata
🏥 `decision_tree_analyzer`	Medical/clinical document analysis
🔬 `scientific_analyzer`	Scientific paper analysis
✍️ `handwriting_analyzer`	Handwritten text recognition
💻 `code_block_analyzer`	Extract code snippets
🗂️ `metadata_generic_analyzer`	Generic metadata extraction
🗂️ `metadata_mads_analyzer`	MADS metadata format extraction
🗂️ `metadata_mods_analyzer`	MODS metadata format extraction
🔑 `keyword_topic_analyzer`	Extract keywords and topics
🔧 `remediation_analyzer`	Analyze remediation requirements
📄 `page_analyzer`	Single page content analysis
🧱 `elements_analyzer`	Document element detection
🧱 `robust_elements_analyzer`	Enhanced element detection with fallbacks
👁️ `general_visual_analysis_analyzer`	General-purpose visual content analysis
✏️ `editorial_analyzer`	Editorial content and markup analysis
🗺️ `war_map_analyzer`	Historical war map analysis
🎓 `edu_transcript_analyzer`	Educational transcript analysis
🔗 `correlation_analyzer`	Correlate multi-analyzer results per page
🖼️ `image_enhancer`	Image enhancement and preprocessing

🚀 Deployment

Prerequisites

☁️ AWS CLI configured with credentials
📦 AWS CDK v2 (npm install -g aws-cdk)
🐳 Docker (running)
🐍 Python 3.12+
⚡ uv

Quick Start

cd deployment
./deploy_from_scratch.sh

This deploys 10 CloudFormation stacks:

📦 S3 (config + output buckets)
🔐 Cognito (OAuth authentication)
👤 IAM (execution roles)
🐳 ECR (container registry)
⚡ Lambda (29 analyzer functions)
🚪 Gateway (MCP endpoint)
🧠 Memory (session persistence)
📊 Inference Profiles (cost tracking)
🏃 Runtime (Strands agent container)
🧩 Custom Analyzers (optional, wizard-created)

Manual Steps

See deployment/DEPLOYMENT_README.md for step-by-step instructions.

Cleanup

cd deployment
./destroy.sh

📁 Project Structure

├── deployment/
│   ├── app.py                 # CDK app entry point
│   ├── stacks/                # CDK stack definitions
│   ├── lambdas/code/          # Analyzer Lambda functions
│   ├── runtime/               # AgentCore Runtime container
│   ├── s3_files/              # Prompts, schemas, manifests
│   └── badgers-foundation/    # Shared analyzer framework
├── frontend/
│   ├── main.py                # Multi-page Gradio app entry point
│   └── pages/                 # UI modules (chat, wizard, editor, etc.)
└── pyproject.toml

📄 License

MIT

🔍 Technical Deep Dive

📦 Lambda Layers

BADGERS uses Lambda layers shared across analyzer functions:

🏗️ Foundation Layer (layer.zip)

Built via deployment/lambdas/build_foundation_layer.sh
Contains the analyzer framework (7 Python modules)
Includes dependencies: boto3, botocore
Includes core system prompts used by all analyzers

layer/python/
├── foundation/
│   ├── analyzer_foundation.py    # 🎯 Main orchestration class
│   ├── bedrock_client.py         # 🔄 Bedrock API with retry/fallback
│   ├── configuration_manager.py  # ⚙️ Config loading/validation
│   ├── image_processor.py        # 🖼️ Image optimization
│   ├── message_chain_builder.py  # 💬 Claude message formatting
│   ├── prompt_loader.py          # 📜 Prompt file loading (local/S3)
│   └── response_processor.py     # 📤 Response extraction
├── config/
│   └── config.py
└── prompts/core_system_prompts/
    └── *.xml

📄 Poppler Layer (poppler-layer.zip)

PDF rendering library for pdf_to_images_converter
Built via deployment/lambdas/build_poppler_layer.sh

🔬 How an Analyzer Works

Each analyzer follows the same pattern using AnalyzerFoundation:

# Lambda handler (simplified)
def lambda_handler(event, context):
    # 1️⃣ Load config from S3 manifest
    config = load_manifest_from_s3(bucket, "full_text_analyzer")

    # 2️⃣ Initialize foundation with S3-aware prompt loader
    analyzer = AnalyzerFoundation(...)

    # 3️⃣ Run analysis pipeline
    result = analyzer.analyze(image_data)

    # 4️⃣ Save result to S3 and return
    save_result_to_s3(result, session_id)
    return {"result": result}

The analyze() method orchestrates:

🖼️ Image processing - Resize/optimize for Claude's vision API
📜 Prompt loading - Combine wrapper + analyzer prompts from S3
💬 Message building - Format for Bedrock Converse API
🤖 Model invocation - Call Claude with retry/fallback logic
✅ Response processing - Extract and validate result

📜 Prompting System

Prompts are modular XML files composed at runtime:

s3://config-bucket/
├── core_system_prompts/
│   ├── prompt_system_wrapper.xml   # 🎁 Main template with placeholders
│   ├── core_rules/rules.xml        # 📏 Shared rules for all analyzers
│   └── error_handling/*.xml        # ⚠️ Error response templates
├── prompts/{analyzer_name}/
│   ├── {analyzer}_job_role.xml     # 👤 Role definition
│   ├── {analyzer}_context.xml      # 🌍 Domain context
│   ├── {analyzer}_rules.xml        # 📏 Analyzer-specific rules
│   ├── {analyzer}_tasks.xml        # ✅ Task instructions
│   └── {analyzer}_format.xml       # 📋 Output format spec
└── wrappers/
    └── prompt_system_wrapper.xml

The PromptLoader composes the final system prompt:

<!-- prompt_system_wrapper.xml -->
<system_prompt>
    {core_rules}           <!-- 📏 Injected from core_rules/rules.xml -->
    {composed_prompt}      <!-- 🧩 Injected from analyzer prompt files -->
    {error_handler_general}
    {error_handler_not_found}
</system_prompt>

Placeholders like [[PIXEL_WIDTH]] and [[PIXEL_HEIGHT]] are replaced with actual image dimensions at runtime.

⚙️ Configuration System

Each analyzer has a manifest file in S3:

// s3://config-bucket/manifests/full_text_analyzer.json
{
    "tool": {
        "name": "analyze_full_text_tool",
        "description": "Extracts text content maintaining reading order...",
        "inputSchema": {
            "type": "object",
            "properties": {
                "image_path": { "type": "string" },
                "session_id": { "type": "string" },
                "audit_mode": { "type": "boolean" }
            },
            "required": ["image_path", "session_id"]
        }
    },
    "analyzer": {
        "name": "full_text_analyzer",
        "enhancement_eligible": true,
        "model_selections": {
            "primary": "global.anthropic.claude-sonnet-4-5-20250929-v1:0",
            "fallback_list": [
                "us.anthropic.claude-haiku-4-5-20251001-v1:0",
                "us.amazon.nova-premier-v1:0"
            ]
        },
        "max_retries": 3,
        "prompt_files": [
            "full_text_job_role.xml",
            "full_text_context.xml",
            "full_text_rules.xml",
            "full_text_tasks_extraction.xml",
            "full_text_format.xml"
        ],
        "max_examples": 0,
        "analysis_text": "full text content",
        "expected_output_tokens": 6000,
        "output_extension": "xml"
    }
}

Key configuration features:

🔄 Model fallback chain - Primary model with ordered fallbacks
🔁 Retry logic - Configurable retry count per analyzer
🧩 Prompt composition - List of XML files to combine
📋 Tool schema - MCP-compatible input schema for Gateway
🖼️ Enhancement eligible - Flag indicating analyzer benefits from image preprocessing (used by image_enhancer tool)

Global settings (from environment or defaults):

{
    "max_tokens": 8000,
    "temperature": 0.1,
    "max_image_size": 20971520,  # 20MB
    "max_dimension": 2048,
    "jpeg_quality": 85,
    "throttle_delay": 1.0,
    "aws_region": "us-west-2"
}

📊 Inference Profiles for Cost Tracking

BADGERS uses Application Inference Profiles to enable cost allocation and usage monitoring. The system maps model IDs to profile ARNs at runtime:

┌─────────────────────────────────────────────────────────────────────────────┐
│                        Inference Profile Flow                               │
├─────────────────────────────────────────────────────────────────────────────┤
│                                                                             │
│  1. CDK deploys InferenceProfilesStack                                      │
│     └─> Creates ApplicationInferenceProfile for each model                  │
│         • badgers-claude-sonnet-{id}  (Global)                              │
│         • badgers-claude-haiku-{id}   (Global)                              │
│         • badgers-claude-opus-{id}    (Global)                              │
│         • badgers-nova-premier-{id}   (US)                                  │
│                                                                             │
│  2. Runtime receives profile ARNs as environment variables                  │
│     └─> CLAUDE_SONNET_PROFILE_ARN, CLAUDE_HAIKU_PROFILE_ARN, etc.           │
│                                                                             │
│  3. At invocation, bedrock_client.py maps model_id → profile ARN            │
│     └─> "global.anthropic.claude-sonnet-4-5-*" → $CLAUDE_SONNET_PROFILE_ARN │
│                                                                             │
│  4. Bedrock invoked with profile ARN (enables cost tracking)                │
│     └─> Falls back to model ID if no profile configured                     │
│                                                                             │
└─────────────────────────────────────────────────────────────────────────────┘

Model ID to environment variable mapping:

Model Pattern	Environment Variable
`claude-sonnet-4-5`	`CLAUDE_SONNET_PROFILE_ARN`
`claude-haiku-4-5`	`CLAUDE_HAIKU_PROFILE_ARN`
`claude-opus-4-6`	`CLAUDE_OPUS_PROFILE_ARN`
`nova-premier`	`NOVA_PREMIER_PROFILE_ARN`

➕ Adding a New Analyzer

Option 1: Use the Wizard (Recommended)

cd frontend
uv run python main.py

The Analyzer Creation Wizard is available as a tab in the multi-page Gradio app.

Option 2: Manual Creation

📜 Create prompt files in deployment/s3_files/prompts/{analyzer_name}/
📋 Create manifest in deployment/s3_files/manifests/{analyzer_name}.json
📐 Create schema in deployment/s3_files/schemas/{analyzer_name}.json
⚡ Create Lambda code in deployment/lambdas/code/{analyzer_name}/lambda_handler.py
📝 Register in deployment/stacks/lambda_stack.py
🚀 Redeploy: cdk deploy badgers-lambda badgers-gateway

Notices

Customers are responsible for making their own independent assessment of the information in this Guidance. This Guidance: (a) is for informational purposes only, (b) represents AWS current product offerings and practices, which are subject to change without notice, and (c) does not create any commitments or assurances from AWS and its affiliates, suppliers or licensors. AWS products or services are provided "as is" without warranties, representations, or conditions of any kind, whether express or implied. AWS responsibilities and liabilities to its customers are controlled by AWS agreements, and this Guidance is not part of, nor does it modify, any agreement between AWS and its customers.

Authors

Randall Potter

📖 Further Reading

🤖 Amazon Bedrock & Foundation Models

Amazon Bedrock Developer Experience - Foundation model choice and customization
Anthropic's Claude in Amazon Bedrock - Claude Opus 4.6, Sonnet 4.5, Haiku 4.5 hybrid reasoning models
Claude Sonnet 4.5 in Amazon Bedrock - Most intelligent model for coding and complex agents
Claude Opus 4.6 in Amazon Bedrock - Tool search, extended thinking, and agent capabilities
Amazon Nova Foundation Models - Nova Micro, Lite, Pro, Premier - frontier intelligence
Using Amazon Nova in AI Agents - Nova as foundation model for agents

🚀 Amazon Bedrock AgentCore

Amazon Bedrock AgentCore Overview - Build, deploy, and operate agents at scale
AgentCore Gateway Guide - Set up unified tool connectivity
AgentCore Gateway Blog - Transforming enterprise AI agent tool development
AgentCore Runtime - Secure serverless hosting for AI agents

⚡ AWS Lambda

Lambda Layers Overview - Managing dependencies with layers
Python Lambda Layers - Working with layers for Python functions
Adding Layers to Functions - Layer configuration and management

🔐 Amazon Cognito

OAuth 2.0 Grants - Authorization code, implicit, and client credentials
M2M Authorization - Scopes, resource servers, and machine-to-machine auth
M2M Security Best Practices - Monitor, optimize, and secure M2M authorization

📈 Observability

CloudWatch + X-Ray Integration - Enhanced application monitoring
Cross-Account Tracing - Distributed tracing across accounts
AWS Observability Best Practices - Logs, metrics, and traces

📦 Amazon S3

S3 as Data Lake Storage - Central storage platform best practices
S3 Performance Optimization - Design patterns for optimal performance

💻 Amazon Kiro IDE

Amazon Kiro Overview - Agentic IDE for spec-driven development
Kiro with AWS Builder ID - Sign in and get started with Kiro
Nova Act IDE Extension - Accelerate AI agent development in Kiro
Production-Ready AI Agents at Scale - Kiro as part of the agent development ecosystem

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/assets		.github/assets
deployment		deployment
frontend		frontend
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
VISION_LLM_THEORY_README.md		VISION_LLM_THEORY_README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

aws-samples/sample-badgers

Folders and files

Latest commit

History

Repository files navigation

🦡 BADGERS

🤔 Why BADGERS?

⚙️ How It Works

🛠️ Tech Stack

🔬 Analyzers

🚀 Deployment

Prerequisites

Quick Start

Manual Steps

Cleanup

📁 Project Structure

📄 License

🔍 Technical Deep Dive

📦 Lambda Layers

🔬 How an Analyzer Works

📜 Prompting System

⚙️ Configuration System

📊 Inference Profiles for Cost Tracking

➕ Adding a New Analyzer

Notices

Authors

📖 Further Reading

🤖 Amazon Bedrock & Foundation Models

🚀 Amazon Bedrock AgentCore

⚡ AWS Lambda

🔐 Amazon Cognito

📈 Observability

📦 Amazon S3

💻 Amazon Kiro IDE

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages