|
1 | 1 | --- |
2 | 2 | sidebar_label: Fireworks AI |
3 | | -description: Configure Fireworks AI with Roo Code for fast access to open-source and proprietary AI models. Get started with Llama, Qwen, DeepSeek, and more on a high-performance platform. |
| 3 | +description: Use Fireworks AI with Roo Code to access state-of-the-art open-source models like Kimi K2, Qwen3, and DeepSeek with large context windows up to 256K tokens. |
4 | 4 | keywords: |
5 | 5 | - fireworks ai |
6 | | - - kimi models |
7 | | - - qwen models |
8 | | - - deepseek models |
| 6 | + - fireworks |
| 7 | + - kimi k2 |
| 8 | + - qwen3 |
| 9 | + - deepseek |
9 | 10 | - roo code |
10 | 11 | - api provider |
11 | | - - ai models |
12 | | - - language models |
13 | | - - llm api |
14 | 12 | - open source models |
15 | | - - serverless models |
| 13 | + - ai models |
| 14 | + - openai compatible |
16 | 15 | image: /img/social-share.jpg |
17 | 16 | --- |
18 | 17 |
|
19 | 18 | # Using Fireworks AI With Roo Code |
20 | 19 |
|
21 | | -Fireworks AI is a high-performance platform for running AI models that offers fast access to a wide range of open-source and proprietary language models. Built for speed and reliability, Fireworks AI provides both serverless and dedicated deployment options with OpenAI-compatible APIs. |
| 20 | +Fireworks AI provides access to state-of-the-art open-source language models through their optimized infrastructure. The provider offers cost-effective alternatives to proprietary AI services with context windows up to 256,000 tokens. |
22 | 21 |
|
23 | 22 | **Website:** [https://fireworks.ai/](https://fireworks.ai/) |
24 | 23 |
|
25 | 24 | --- |
26 | 25 |
|
27 | 26 | ## Getting an API Key |
28 | 27 |
|
29 | | -1. **Sign Up/Sign In:** Go to [Fireworks AI](https://fireworks.ai/) and create an account or sign in. |
30 | | -2. **Navigate to API Keys:** After logging in, go to the [API Keys page](https://app.fireworks.ai/settings/users/api-keys) in the account settings. |
31 | | -3. **Create a Key:** Click "Create API key" and give your key a descriptive name (e.g., "Roo Code"). |
32 | | -4. **Copy the Key:** Copy the API key *immediately* and store it securely. You will not be able to see it again. |
| 28 | +1. **Sign Up/Sign In:** Go to [Fireworks AI](https://fireworks.ai/). Create an account or sign in. |
| 29 | +2. **Navigate to Dashboard:** Access your account dashboard. |
| 30 | +3. **Generate API Key:** Create a new API key from the dashboard. |
| 31 | +4. **Copy the Key:** **Important:** Copy the API key immediately and store it securely. |
33 | 32 |
|
34 | 33 | --- |
35 | 34 |
|
36 | 35 | ## Supported Models |
37 | 36 |
|
38 | | -Roo Code supports the following Fireworks AI models: |
39 | | - |
40 | | -* `accounts/fireworks/models/kimi-k2-instruct` - Kimi K2 instruction-tuned model |
41 | | -* `accounts/fireworks/models/qwen3-235b-a22b-instruct-2507` - Qwen 3 235B instruction-tuned model |
42 | | -* `accounts/fireworks/models/qwen3-coder-480b-a35b-instruct` - Qwen 3 Coder 480B for code generation |
43 | | -* `accounts/fireworks/models/deepseek-r1-0528` - DeepSeek R1 reasoning model |
44 | | -* `accounts/fireworks/models/deepseek-v3` - DeepSeek V3 latest generation model |
| 37 | +Fireworks AI offers several high-performance models: |
| 38 | + |
| 39 | +### Kimi K2 |
| 40 | +* `accounts/fireworks/models/kimi-k2-instruct` (Default) |
| 41 | + - 1 trillion total parameters with 32B activated params |
| 42 | + - 128K context window |
| 43 | + - Optimized for agentic capabilities |
| 44 | + - $0.60/M input, $2.50/M output |
| 45 | + |
| 46 | +### Qwen3 Series |
| 47 | +* `accounts/fireworks/models/qwen3-235b-a22b-instruct-2507` |
| 48 | + - 256K context window |
| 49 | + - Competitive with closed-source models |
| 50 | + - $0.22/M input, $0.88/M output |
| 51 | + |
| 52 | +* `accounts/fireworks/models/qwen3-coder-480b-a35b-instruct` |
| 53 | + - 256K context window |
| 54 | + - Specialized for coding tasks |
| 55 | + - $0.45/M input, $1.80/M output |
| 56 | + |
| 57 | +### DeepSeek Series |
| 58 | +* `accounts/fireworks/models/deepseek-r1-0528` |
| 59 | + - 160K context window |
| 60 | + - Advanced reasoning with reduced hallucination |
| 61 | + - Function calling support |
| 62 | + - $3.00/M input, $8.00/M output |
| 63 | + |
| 64 | +* `accounts/fireworks/models/deepseek-v3` |
| 65 | + - 128K context window |
| 66 | + - 671B total parameters with 37B activated |
| 67 | + - $0.90/M input, $0.90/M output |
45 | 68 |
|
46 | 69 | --- |
47 | 70 |
|
48 | 71 | ## Configuration in Roo Code |
49 | 72 |
|
50 | 73 | 1. **Open Roo Code Settings:** Click the gear icon (<Codicon name="gear" />) in the Roo Code panel. |
51 | 74 | 2. **Select Provider:** Choose "Fireworks AI" from the "API Provider" dropdown. |
52 | | -3. **Enter API Key:** Paste your Fireworks AI API key into the "Fireworks AI API Key" field. |
53 | | -4. **Select Model:** Choose your desired model from the "Model" dropdown. |
| 75 | +3. **Enter API Key:** Paste your Fireworks API key into the "Fireworks API Key" field. |
| 76 | +4. **Model Selection:** The default model (Kimi K2) is automatically selected. You can change it from the model dropdown if needed. |
| 77 | + |
| 78 | +--- |
| 79 | + |
| 80 | +## Model Selection Guide |
| 81 | + |
| 82 | +Choose models based on your needs: |
| 83 | + |
| 84 | +| Model | Best For | Context | Price | |
| 85 | +|-------|----------|---------|-------| |
| 86 | +| **Kimi K2** | General tasks, balanced performance | 128K | Mid-range | |
| 87 | +| **Qwen3 235B** | Cost-effective general use | 256K | Budget-friendly | |
| 88 | +| **Qwen3 Coder** | Code generation and debugging | 256K | Mid-range | |
| 89 | +| **DeepSeek R1** | Complex reasoning, function calling | 160K | Premium | |
| 90 | +| **DeepSeek V3** | Strong general performance | 128K | Balanced | |
54 | 91 |
|
55 | 92 | --- |
56 | 93 |
|
57 | 94 | ## Tips and Notes |
58 | 95 |
|
59 | | -* **Performance:** Fireworks AI is optimized for speed and offers excellent performance for both chat and completion tasks. |
60 | | -* **Pricing:** Refer to the [Fireworks AI Pricing](https://fireworks.ai/pricing) page for current pricing information. |
61 | | -* **Rate Limits:** Fireworks AI has usage-based rate limits. Monitor your usage in the dashboard and consider upgrading your plan if needed. |
| 96 | +* **Cost-Effective:** Fireworks AI offers significantly lower pricing than proprietary models while maintaining competitive performance. |
| 97 | +* **Large Context Windows:** Most models support 128K-256K tokens, suitable for processing large documents and maintaining extended conversations. |
| 98 | +* **OpenAI Compatibility:** The provider uses an OpenAI-compatible API format with streaming support and usage tracking. |
| 99 | +* **Text-Only:** All models are text-only without image support or prompt caching capabilities. |
| 100 | +* **Default Temperature:** Uses 0.5 temperature by default for balanced creativity and consistency. |
| 101 | +* **API Keys:** Stored locally on your machine for security. |
| 102 | +* **Pricing:** See the [Fireworks AI pricing page](https://fireworks.ai/pricing) for current rates. Prices shown are per million tokens. |
0 commit comments