Skip to content

jwadow/kiro-openai-gateway

Repository files navigation

πŸš€ Kiro OpenAI Gateway

OpenAI-compatible proxy gateway for Kiro IDE API (AWS CodeWhisperer)

License: AGPL v3 Python 3.10+ FastAPI

Use Claude models through any tools that support the OpenAI API

Features β€’ Quick Start β€’ Configuration β€’ API Reference β€’ License


✨ Features

Feature Description
πŸ”Œ OpenAI-compatible API Works with any OpenAI client out of the box
πŸ’¬ Full message history Passes complete conversation context
πŸ› οΈ Tool Calling Supports function calling in OpenAI format
πŸ“‘ Streaming Full SSE streaming support
πŸ”„ Retry Logic Automatic retries on errors (403, 429, 5xx)
πŸ“‹ Extended model list Including versioned models
πŸ” Smart token management Automatic refresh before expiration
🧩 Modular architecture Easy to extend with new providers

πŸš€ Quick Start

Prerequisites

  • Python 3.10+
  • Kiro IDE with logged in account

Installation

# Clone the repository
git clone https://github.com/Jwadow/kiro-openai-gateway.git
cd kiro-openai-gateway

# Install dependencies
pip install -r requirements.txt

# Configure (see Configuration section)
cp .env.example .env
# Edit .env with your credentials

# Start the server
python main.py

The server will be available at http://localhost:8000


βš™οΈ Configuration

Option 1: JSON Credentials File

Specify the path to the credentials file:

KIRO_CREDS_FILE="~/.aws/sso/cache/kiro-auth-token.json"

# Password to protect YOUR proxy server (make up any secure string)
# You'll use this as api_key when connecting to your gateway
PROXY_API_KEY="my-super-secret-password-123"
πŸ“„ JSON file format
{
  "accessToken": "eyJ...",
  "refreshToken": "eyJ...",
  "expiresAt": "2025-01-12T23:00:00.000Z",
  "profileArn": "arn:aws:codewhisperer:us-east-1:...",
  "region": "us-east-1"
}

Option 2: Environment Variables (.env file)

Create a .env file in the project root:

# Required
REFRESH_TOKEN="your_kiro_refresh_token"

# Password to protect YOUR proxy server (make up any secure string)
PROXY_API_KEY="my-super-secret-password-123"

# Optional
PROFILE_ARN="arn:aws:codewhisperer:us-east-1:..."
KIRO_REGION="us-east-1"

Getting the Refresh Token

The refresh token can be obtained by intercepting Kiro IDE traffic. Look for requests to:

  • prod.us-east-1.auth.desktop.kiro.dev/refreshToken

πŸ“‘ API Reference

Endpoints

Endpoint Method Description
/ GET Health check
/health GET Detailed health check
/v1/models GET List available models
/v1/chat/completions POST Chat completions

Available Models

Model Description
claude-opus-4-5 Top-tier model
claude-opus-4-5-20251101 Top-tier model (versioned)
claude-sonnet-4-5 Enhanced model
claude-sonnet-4-5-20250929 Enhanced model (versioned)
claude-sonnet-4 Balanced model
claude-sonnet-4-20250514 Balanced model (versioned)
claude-haiku-4-5 Fast model
claude-3-7-sonnet-20250219 Legacy model

πŸ’‘ Usage Examples

πŸ”Ή Simple cURL Request
curl http://localhost:8000/v1/chat/completions \
  -H "Authorization: Bearer my-super-secret-password-123" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-5",
    "messages": [{"role": "user", "content": "Hello!"}],
    "stream": true
  }'

Note: Replace my-super-secret-password-123 with the PROXY_API_KEY you set in your .env file.

πŸ”Ή Streaming Request
curl http://localhost:8000/v1/chat/completions \
  -H "Authorization: Bearer my-super-secret-password-123" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-5",
    "messages": [
      {"role": "system", "content": "You are a helpful assistant."},
      {"role": "user", "content": "What is 2+2?"}
    ],
    "stream": true
  }'
πŸ”Ή With Tool Calling
curl http://localhost:8000/v1/chat/completions \
  -H "Authorization: Bearer my-super-secret-password-123" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-5",
    "messages": [{"role": "user", "content": "What is the weather in London?"}],
    "tools": [{
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "Get weather for a location",
        "parameters": {
          "type": "object",
          "properties": {
            "location": {"type": "string", "description": "City name"}
          },
          "required": ["location"]
        }
      }
    }]
  }'
🐍 Python OpenAI SDK
from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="my-super-secret-password-123"  # Your PROXY_API_KEY from .env
)

response = client.chat.completions.create(
    model="claude-sonnet-4-5",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello!"}
    ],
    stream=True
)

for chunk in response:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")
🦜 LangChain
from langchain_openai import ChatOpenAI

llm = ChatOpenAI(
    base_url="http://localhost:8000/v1",
    api_key="my-super-secret-password-123",  # Your PROXY_API_KEY from .env
    model="claude-sonnet-4-5"
)

response = llm.invoke("Hello, how are you?")
print(response.content)

πŸ“ Project Structure

kiro-openai-gateway/
β”œβ”€β”€ main.py                    # Entry point, FastAPI app creation
β”œβ”€β”€ requirements.txt           # Python dependencies
β”œβ”€β”€ .env.example               # Environment configuration example
β”‚
β”œβ”€β”€ kiro_gateway/              # Main package
β”‚   β”œβ”€β”€ __init__.py            # Package exports
β”‚   β”œβ”€β”€ config.py              # Configuration and constants
β”‚   β”œβ”€β”€ models.py              # Pydantic models for OpenAI API
β”‚   β”œβ”€β”€ auth.py                # KiroAuthManager - token management
β”‚   β”œβ”€β”€ cache.py               # ModelInfoCache - model caching
β”‚   β”œβ”€β”€ utils.py               # Helper utilities
β”‚   β”œβ”€β”€ converters.py          # OpenAI <-> Kiro conversion
β”‚   β”œβ”€β”€ parsers.py             # AWS SSE stream parsers
β”‚   β”œβ”€β”€ streaming.py           # Response streaming logic
β”‚   β”œβ”€β”€ http_client.py         # HTTP client with retry logic
β”‚   β”œβ”€β”€ debug_logger.py        # Debug logging (optional)
β”‚   └── routes.py              # FastAPI routes
β”‚
β”œβ”€β”€ tests/                     # Tests
β”‚   β”œβ”€β”€ unit/                  # Unit tests
β”‚   └── integration/           # Integration tests
β”‚
└── debug_logs/                # Debug logs (generated when enabled)

πŸ”§ Debugging

Debug logging is disabled by default. To enable, add to your .env:

# Debug logging mode:
# - off: disabled (default)
# - errors: save logs only for failed requests (4xx, 5xx) - recommended for troubleshooting
# - all: save logs for every request (overwrites on each request)
DEBUG_MODE=errors

Debug Modes

Mode Description Use Case
off Disabled (default) Production
errors Save logs only for failed requests (4xx, 5xx) Recommended for troubleshooting
all Save logs for every request Development/debugging

Debug Files

When enabled, requests are logged to the debug_logs/ folder:

File Description
request_body.json Incoming request from client (OpenAI format)
kiro_request_body.json Request sent to Kiro API
response_stream_raw.txt Raw stream from Kiro
response_stream_modified.txt Transformed stream (OpenAI format)
app_logs.txt Application logs for the request
error_info.json Error details (only on errors)

πŸ§ͺ Testing

# Run all tests
pytest

# Run unit tests only
pytest tests/unit/

# Run with coverage
pytest --cov=kiro_gateway

πŸ”Œ Extending with New Providers

The modular architecture makes it easy to add support for other providers:

  1. Create a new module kiro_gateway/providers/new_provider.py
  2. Implement the required classes:
    • NewProviderAuthManager β€” token management
    • NewProviderConverter β€” format conversion
    • NewProviderParser β€” response parsing
  3. Add routes to routes.py or create a separate router

πŸ“œ License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0).

This means:

  • βœ… You can use, modify, and distribute this software
  • βœ… You can use it for commercial purposes
  • ⚠️ You must disclose source code when you distribute the software
  • ⚠️ Network use is distribution β€” if you run a modified version on a server and let others interact with it, you must make the source code available to them
  • ⚠️ Modifications must be released under the same license

See the LICENSE file for the full license text.

Why AGPL-3.0?

AGPL-3.0 ensures that improvements to this software benefit the entire community. If you modify this gateway and deploy it as a service, you must share your improvements with your users.

Contributor License Agreement (CLA)

By submitting a contribution to this project, you agree to the terms of our Contributor License Agreement (CLA). This ensures that:

  • You have the right to submit the contribution
  • You grant the maintainer rights to use and relicense your contribution
  • The project remains legally protected

πŸ‘€ Author

Jwadow β€” @Jwadow


⚠️ Disclaimer

This project is not affiliated with, endorsed by, or sponsored by Amazon Web Services (AWS), Anthropic, or Kiro IDE. Use at your own risk and in compliance with the terms of service of the underlying APIs.