|
| 1 | +--- |
| 2 | +sidebar_label: DeepInfra |
| 3 | +description: Configure DeepInfra's high-performance AI models in Roo Code. Access Qwen Coder, Llama, and other open-source models with prompt caching and vision capabilities. |
| 4 | +keywords: |
| 5 | + - deepinfra |
| 6 | + - deep infra |
| 7 | + - roo code |
| 8 | + - api provider |
| 9 | + - qwen coder |
| 10 | + - llama models |
| 11 | + - prompt caching |
| 12 | + - vision models |
| 13 | + - open source ai |
| 14 | +image: /img/social-share.jpg |
| 15 | +--- |
| 16 | + |
| 17 | +# Using DeepInfra With Roo Code |
| 18 | + |
| 19 | +DeepInfra provides cost-effective access to high-performance open-source models with features like prompt caching, vision support, and specialized coding models. Their infrastructure offers low latency and automatic load balancing across global edge locations. |
| 20 | + |
| 21 | +**Website:** [https://deepinfra.com/](https://deepinfra.com/) |
| 22 | + |
| 23 | +--- |
| 24 | + |
| 25 | +## Getting an API Key |
| 26 | + |
| 27 | +1. **Sign Up/Sign In:** Go to [DeepInfra](https://deepinfra.com/). Create an account or sign in. |
| 28 | +2. **Navigate to API Keys:** Access the API keys section in your dashboard. |
| 29 | +3. **Create a Key:** Generate a new API key. Give it a descriptive name (e.g., "Roo Code"). |
| 30 | +4. **Copy the Key:** **Important:** Copy the API key immediately. Store it securely. |
| 31 | + |
| 32 | +--- |
| 33 | + |
| 34 | +## Supported Models |
| 35 | + |
| 36 | +Roo Code dynamically fetches available models from DeepInfra's API. The default model is: |
| 37 | + |
| 38 | +* `Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo` (256K context, optimized for coding) |
| 39 | + |
| 40 | +Common models available include: |
| 41 | + |
| 42 | +* **Coding Models:** Qwen Coder series, specialized for programming tasks |
| 43 | +* **General Models:** Llama 3.1, Mixtral, and other open-source models |
| 44 | +* **Vision Models:** Models with image understanding capabilities |
| 45 | +* **Reasoning Models:** Models with advanced reasoning support |
| 46 | + |
| 47 | +Browse the full catalog at [deepinfra.com/models](https://deepinfra.com/models). |
| 48 | + |
| 49 | +--- |
| 50 | + |
| 51 | +## Configuration in Roo Code |
| 52 | + |
| 53 | +1. **Open Roo Code Settings:** Click the gear icon (<Codicon name="gear" />) in the Roo Code panel. |
| 54 | +2. **Select Provider:** Choose "DeepInfra" from the "API Provider" dropdown. |
| 55 | +3. **Enter API Key:** Paste your DeepInfra API key into the "DeepInfra API Key" field. |
| 56 | +4. **Select Model:** Choose your desired model from the "Model" dropdown. |
| 57 | + - Models will auto-populate after entering a valid API key |
| 58 | + - Click "Refresh Models" to update the list |
| 59 | + |
| 60 | +--- |
| 61 | + |
| 62 | +## Advanced Features |
| 63 | + |
| 64 | +### Prompt Caching |
| 65 | + |
| 66 | +DeepInfra supports prompt caching for eligible models, which: |
| 67 | +- Reduces costs for repeated contexts |
| 68 | +- Improves response times for similar queries |
| 69 | +- Automatically manages cache based on task IDs |
| 70 | + |
| 71 | +### Vision Support |
| 72 | + |
| 73 | +Models with vision capabilities can: |
| 74 | +- Process images alongside text |
| 75 | +- Understand visual content for coding tasks |
| 76 | +- Analyze screenshots and diagrams |
| 77 | + |
| 78 | +### Custom Base URL |
| 79 | + |
| 80 | +For enterprise deployments, you can configure a custom base URL in the advanced settings. |
| 81 | + |
| 82 | +--- |
| 83 | + |
| 84 | +## Tips and Notes |
| 85 | + |
| 86 | +* **Performance:** DeepInfra offers low latency with automatic load balancing across global locations. |
| 87 | +* **Cost Efficiency:** Competitive pricing with prompt caching to reduce costs for repeated contexts. |
| 88 | +* **Model Variety:** Access to the latest open-source models including specialized coding models. |
| 89 | +* **Context Windows:** Models support context windows up to 256K tokens for large codebases. |
| 90 | +* **Pricing:** Pay-per-use model with no minimums. Check [deepinfra.com](https://deepinfra.com/) for current pricing. |
0 commit comments