Skip to content

Commit 8bc0150

Browse files
committed
docs: add DeepInfra provider documentation
1 parent 139eebf commit 8bc0150

File tree

1 file changed

+90
-0
lines changed

1 file changed

+90
-0
lines changed

docs/providers/deepinfra.md

Lines changed: 90 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,90 @@
1+
---
2+
sidebar_label: DeepInfra
3+
description: Configure DeepInfra's high-performance AI models in Roo Code. Access Qwen Coder, Llama, and other open-source models with prompt caching and vision capabilities.
4+
keywords:
5+
- deepinfra
6+
- deep infra
7+
- roo code
8+
- api provider
9+
- qwen coder
10+
- llama models
11+
- prompt caching
12+
- vision models
13+
- open source ai
14+
image: /img/social-share.jpg
15+
---
16+
17+
# Using DeepInfra With Roo Code
18+
19+
DeepInfra provides cost-effective access to high-performance open-source models with features like prompt caching, vision support, and specialized coding models. Their infrastructure offers low latency and automatic load balancing across global edge locations.
20+
21+
**Website:** [https://deepinfra.com/](https://deepinfra.com/)
22+
23+
---
24+
25+
## Getting an API Key
26+
27+
1. **Sign Up/Sign In:** Go to [DeepInfra](https://deepinfra.com/). Create an account or sign in.
28+
2. **Navigate to API Keys:** Access the API keys section in your dashboard.
29+
3. **Create a Key:** Generate a new API key. Give it a descriptive name (e.g., "Roo Code").
30+
4. **Copy the Key:** **Important:** Copy the API key immediately. Store it securely.
31+
32+
---
33+
34+
## Supported Models
35+
36+
Roo Code dynamically fetches available models from DeepInfra's API. The default model is:
37+
38+
* `Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo` (256K context, optimized for coding)
39+
40+
Common models available include:
41+
42+
* **Coding Models:** Qwen Coder series, specialized for programming tasks
43+
* **General Models:** Llama 3.1, Mixtral, and other open-source models
44+
* **Vision Models:** Models with image understanding capabilities
45+
* **Reasoning Models:** Models with advanced reasoning support
46+
47+
Browse the full catalog at [deepinfra.com/models](https://deepinfra.com/models).
48+
49+
---
50+
51+
## Configuration in Roo Code
52+
53+
1. **Open Roo Code Settings:** Click the gear icon (<Codicon name="gear" />) in the Roo Code panel.
54+
2. **Select Provider:** Choose "DeepInfra" from the "API Provider" dropdown.
55+
3. **Enter API Key:** Paste your DeepInfra API key into the "DeepInfra API Key" field.
56+
4. **Select Model:** Choose your desired model from the "Model" dropdown.
57+
- Models will auto-populate after entering a valid API key
58+
- Click "Refresh Models" to update the list
59+
60+
---
61+
62+
## Advanced Features
63+
64+
### Prompt Caching
65+
66+
DeepInfra supports prompt caching for eligible models, which:
67+
- Reduces costs for repeated contexts
68+
- Improves response times for similar queries
69+
- Automatically manages cache based on task IDs
70+
71+
### Vision Support
72+
73+
Models with vision capabilities can:
74+
- Process images alongside text
75+
- Understand visual content for coding tasks
76+
- Analyze screenshots and diagrams
77+
78+
### Custom Base URL
79+
80+
For enterprise deployments, you can configure a custom base URL in the advanced settings.
81+
82+
---
83+
84+
## Tips and Notes
85+
86+
* **Performance:** DeepInfra offers low latency with automatic load balancing across global locations.
87+
* **Cost Efficiency:** Competitive pricing with prompt caching to reduce costs for repeated contexts.
88+
* **Model Variety:** Access to the latest open-source models including specialized coding models.
89+
* **Context Windows:** Models support context windows up to 256K tokens for large codebases.
90+
* **Pricing:** Pay-per-use model with no minimums. Check [deepinfra.com](https://deepinfra.com/) for current pricing.

0 commit comments

Comments
 (0)