Skip to content

Conversation

@gsalberto
Copy link

Summary

Add Latitude AI as a new provider in the Portkey gateway.

Latitude AI is a GPU cloud provider offering OpenAI-compatible inference APIs for open-source LLMs.

Features

  • Chat completions with streaming support
  • Tool calling support
  • JSON mode / structured output
  • Vision model support (qwen-2.5-vl-7b)

Available Models

Model Context Features
qwen-2.5-7b 131K Tools, JSON mode
llama-3.1-8b 128K Tools, JSON mode
qwen3-32b 131K Tools, JSON mode
gemma-2-27b 8K Tools, JSON mode
deepseek-r1-distill-14b 64K Tools, JSON mode, Reasoning
qwen2.5-coder-32b 131K Tools, JSON mode
qwen-2.5-vl-7b 32K Tools, JSON mode, Vision

Files Changed

  • src/globals.ts - Added LATITUDE constant
  • src/providers/index.ts - Registered Latitude provider
  • src/providers/types.ts - Added LATITUDE_STOP_REASON to union type
  • src/providers/latitude/ - New provider implementation
    • api.ts - API configuration (base URL, headers, endpoints)
    • chatComplete.ts - Chat completion handler with streaming
    • index.ts - Provider configuration export
    • types.ts - Stop reason enum

Testing

  • Build passes with npm run build
  • Format passes with npm run format:check

API Compatibility

Latitude's API is fully OpenAI-compatible at https://api.lsh.ai/v1/chat/completions.

Latitude AI (https://ai.latitude.sh) is a GPU cloud provider offering
OpenAI-compatible inference APIs for open-source LLMs.

Features:
- Chat completions with streaming support
- Tool calling support
- JSON mode / structured output
- Vision model support (qwen-2.5-vl-7b)

Available models:
- qwen-2.5-7b (131K context)
- llama-3.1-8b (128K context)
- qwen3-32b (131K context)
- gemma-2-27b (8K context)
- deepseek-r1-distill-14b (64K context)
- qwen2.5-coder-32b (131K context)
- qwen-2.5-vl-7b (32K context, vision)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
@gsalberto
Copy link
Author

@narengogi Thanks for approving. When do you expect to merge?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants