OpenAI-Compatible Proxy Server for Amazon Bedrock Agents

This component provides an OpenAI-compatible chat completions API that internally uses Amazon Bedrock Agents.

Features

OpenAI-compatible /v1/chat/completions endpoint
Support for both streaming and non-streaming responses
Session management for maintaining conversation context
Exact matching of OpenAI's response format
Comprehensive error handling and logging

Server Components

Main Application (`app.py`)

Flask server implementation
Request/response handling
Bedrock Agent integration
Format conversion
Error handling

Streaming Support

The server implements Server-Sent Events (SSE) streaming that:

Matches OpenAI's chunk format exactly
Provides word-by-word streaming
Handles role and content deltas
Processes trace events and completion chunks
Maintains consistent message IDs

Streaming Test Tool (`test_streaming.py`)

A validation tool that:

Compares responses with OpenAI's API
Verifies streaming format compatibility
Checks chunk formatting and timing
Validates role and content handling
Measures streaming performance

Setup

Install dependencies:

pip install -r requirements.txt

Configure environment:

cp .env.example .env

Required variables in .env:

# AWS Credentials
AWS_ACCESS_KEY_ID=your_aws_access_key_id
AWS_SECRET_ACCESS_KEY=your_aws_secret_access_key
AWS_REGION=us-east-1

# Agent Configuration
AGENT_ID=your_bedrock_agent_id
AGENT_ALIAS_ID=your_bedrock_agent_alias_id

# Optional: For streaming comparison tests
OPENAI_API_KEY=your_openai_api_key

Start the server:

python app.py

Running Locally with ngrok

To make your local server accessible over the internet (useful for testing with external tools):

Install ngrok:

# On Ubuntu/Debian
curl -s https://ngrok-agent.s3.amazonaws.com/ngrok.asc | sudo tee /etc/apt/trusted.gpg.d/ngrok.asc >/dev/null && echo "deb https://ngrok-agent.s3.amazonaws.com buster main" | sudo tee /etc/apt/sources.list.d/ngrok.list && sudo apt update && sudo apt install ngrok

# On macOS with Homebrew
brew install ngrok

# Or download from https://ngrok.com/downloads

Sign up at https://ngrok.com and get your authtoken
Configure ngrok:

ngrok config add-authtoken your_auth_token

Start the Flask server:

python app.py

In a new terminal, start ngrok:

ngrok http 5000

Use the provided URL:

ngrok will display a URL like https://xxxx-xx-xx-xxx-xx.ngrok-free.app
Your OpenAI-compatible endpoint will be available at https://xxxx-xx-xx-xxx-xx.ngrok-free.app/v1/chat/completions
You can use this URL in any OpenAI-compatible client by setting the base URL

Example using OpenAI Python client:

from openai import OpenAI

client = OpenAI(
    base_url="https://xxxx-xx-xx-xxx-xx.ngrok-free.app/v1",
    api_key="not-needed"  # The proxy doesn't check API keys
)

response = client.chat.completions.create(
    model="bedrock-agent",  # Model name doesn't matter, will use Bedrock
    messages=[
        {"role": "user", "content": "Hello, how can you help me?"}
    ],
    stream=True  # Supports both streaming and non-streaming
)

for chunk in response:
    print(chunk.choices[0].delta.content or "", end="")

Note: The ngrok free tier provides:

Random URLs that change each time you start ngrok
Rate limits that should be fine for testing
For production use, consider ngrok's paid tiers or proper deployment

API Reference

Chat Completions

POST /v1/chat/completions

Request Format

{
    "model": "bedrock-agent",
    "messages": [
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello, how can you help me?"}
    ],
    "stream": false,
    "session_id": "optional-session-id"
}

Response Format (Non-Streaming)

{
    "id": "chatcmpl-123abc...",
    "object": "chat.completion",
    "created": 1677858242,
    "model": "bedrock-agent",
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "The response from the agent..."
            },
            "finish_reason": "stop"
        }
    ],
    "usage": {
        "prompt_tokens": -1,
        "completion_tokens": -1,
        "total_tokens": -1
    }
}

Streaming Response Format

When stream: true, responses are sent as Server-Sent Events:

data: {"id":"chatcmpl-123","object":"chat.completion.chunk","created":1234567890,"model":"bedrock-agent","choices":[{"index":0,"delta":{"role":"assistant"},"finish_reason":null}]}

data: {"id":"chatcmpl-123","object":"chat.completion.chunk","created":1234567890,"model":"bedrock-agent","choices":[{"index":0,"delta":{"content":"Hello"},"finish_reason":null}]}

data: {"id":"chatcmpl-123","object":"chat.completion.chunk","created":1234567890,"model":"bedrock-agent","choices":[{"index":0,"delta":{},"finish_reason":"stop"}]}

data: [DONE]

Testing

Run Streaming Format Test

python test_streaming.py

This will:

Start a local server instance
Send identical requests to both OpenAI and the proxy
Compare the streaming responses
Validate format compatibility
Generate a detailed comparison report

Implementation Notes

Token usage information is not available from Bedrock and will return -1
Session IDs are generated using UUID4 if not provided
Error responses follow OpenAI's format for compatibility
The server sanitizes error messages to prevent information leakage
AWS credentials are loaded securely from environment variables

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
cloud_formation_template		cloud_formation_template
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI-Compatible Proxy Server for Amazon Bedrock Agents

Features

Server Components

Main Application (`app.py`)

Streaming Support

Streaming Test Tool (`test_streaming.py`)

Setup

Running Locally with ngrok

API Reference

Chat Completions

Request Format

Response Format (Non-Streaming)

Streaming Response Format

Testing

Run Streaming Format Test

Implementation Notes

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

DamienDeepgram/amazon-bedrock-agent-chat-completions-proxy

Folders and files

Latest commit

History

Repository files navigation

OpenAI-Compatible Proxy Server for Amazon Bedrock Agents

Features

Server Components

Main Application (app.py)

Streaming Support

Streaming Test Tool (test_streaming.py)

Setup

Running Locally with ngrok

API Reference

Chat Completions

Request Format

Response Format (Non-Streaming)

Streaming Response Format

Testing

Run Streaming Format Test

Implementation Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Main Application (`app.py`)

Streaming Test Tool (`test_streaming.py`)

Packages