You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/advanced-usage/rate-limits-costs.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -33,6 +33,8 @@ Most AI providers charge based on the number of tokens used. Pricing varies depe
33
33
34
34
Roo Code automatically calculates the estimated cost of each API request based on the configured model's pricing. This cost is displayed in the chat history, next to the token usage.
35
35
36
+
For reasoning-capable models (for example, Gemini 3 Pro Preview and other models that expose separate "thinking" or reasoning tokens), Roo Code now includes both normal tokens **and** reasoning / "thought" tokens in its estimates when the provider reports them. This can make the displayed token usage and cost slightly higher than in older versions, but it better matches how providers actually bill you.
37
+
36
38
**Note:**
37
39
38
40
* The cost calculation is an *estimate*. The actual cost may vary slightly depending on the provider's billing practices.
Copy file name to clipboardExpand all lines: docs/providers/gemini.md
+24-38Lines changed: 24 additions & 38 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -33,54 +33,38 @@ Roo Code supports Google's Gemini family of models through the Google AI Gemini
33
33
34
34
## Supported Models
35
35
36
-
Roo Code supports the following Gemini models:
36
+
Roo Code supports the main Gemini model families and automatically tracks Google's latest stable releases.
37
37
38
-
### Model Aliases (Recommended)
38
+
### Recommended Defaults
39
39
40
-
For stability and automatic updates, use these aliases that point to the latest stable versions:
40
+
-**Gemini 3 Pro Preview**
41
+
- 1M-token context window for very large workspaces and long-running conversations
42
+
- Reasoning-capable behavior for multi-step coding and refactoring tasks
43
+
- Tiered pricing support in Roo Code (≤200K vs >200K tokens) to better match Google's published pricing
44
+
-**Gemini Pro family**
45
+
- Stable Pro models for complex coding, debugging, and analysis
46
+
- Roo Code defaults to a stable Pro model where your provider supports it (today this is a Gemini 2.5 Pro variant; future releases may point at newer Pro models)
47
+
-**Gemini Flash family**
48
+
- Fast, lower-cost models for everyday tasks and quick iterations
41
49
42
-
*`gemini-flash-latest` - Always uses the newest stable Flash model
43
-
*`gemini-pro-latest` - Always uses the newest stable Pro model
50
+
### Aliases
44
51
52
+
For stability and automatic updates, prefer these aliases instead of hard-coding specific versioned model IDs:
45
53
46
-
### Standard Models
47
-
*`gemini-2.5-flash-preview-05-20`
48
-
*`gemini-2.5-flash-preview-04-17`
49
-
*`gemini-2.5-flash-lite-preview-06-17`
50
-
*`gemini-2.5-pro-exp-03-25`
51
-
*`gemini-2.0-flash-001`
52
-
*`gemini-2.0-flash-lite-preview-02-05`
53
-
*`gemini-2.0-pro-exp-02-05`
54
-
*`gemini-2.0-flash-exp`
55
-
*`gemini-1.5-flash-002`
56
-
*`gemini-1.5-flash-exp-0827`
57
-
*`gemini-1.5-flash-8b-exp-0827`
58
-
*`gemini-1.5-pro-002`
59
-
*`gemini-1.5-pro-exp-0827`
60
-
*`gemini-exp-1206`
54
+
-`gemini-flash-latest` – Points to the newest stable Flash model
55
+
-`gemini-pro-latest` – Points to the newest stable Pro model
61
56
62
-
### Preview Models
57
+
Using aliases helps Roo Code follow Google's recommended stable releases without you having to update model IDs manually.
63
58
64
-
Preview models include Google's latest experimental features but may change without notice:
59
+
### Thinking / Reasoning Models
65
60
66
-
* Models with `-preview-` in the name (e.g., `gemini-2.5-flash-preview-05-20`)
67
-
* Models with `-exp-` suffix (e.g., `gemini-2.0-flash-exp`)
68
-
* Models prefixed with `gemini-exp-` (e.g., `gemini-exp-1206`)
61
+
Some Gemini models are reasoning-capable and may expose separate "thinking" or reasoning tokens:
69
62
70
-
Preview models are ideal for testing cutting-edge capabilities but may have breaking changes. Use stable aliases for production work.
63
+
- Roo Code treats these as reasoning models and can use them for deeper, multi-step planning.
64
+
- To use reasoning models effectively, enable the **reasoning budget** feature in Roo Code settings.
65
+
- When the Gemini API reports reasoning / "thought" token usage, Roo Code includes those tokens in its cost estimates so reported costs stay closer to your provider's billing.
71
66
72
-
### Thinking Models
73
-
These models require reasoning budget to be enabled in Roo Code settings:
74
-
*`gemini-2.5-flash-preview-05-20:thinking`
75
-
*`gemini-2.5-flash-preview-04-17:thinking`
76
-
*`gemini-2.0-flash-thinking-exp-01-21`
77
-
*`gemini-2.0-flash-thinking-exp-1219`
78
-
79
-
:::info
80
-
**Thinking Models:** Models with `:thinking` suffix or "thinking" in their name are hybrid reasoning models that provide step-by-step reasoning capabilities. To use these models, you must enable the reasoning budget feature in Roo Code settings.
81
-
:::
82
-
83
-
Refer to the [Gemini documentation](https://ai.google.dev/models/gemini) for more details on each model.
67
+
Refer to the [Gemini documentation](https://ai.google.dev/models/gemini) for more details on each model family and its capabilities.
84
68
85
69
---
86
70
@@ -91,6 +75,8 @@ Refer to the [Gemini documentation](https://ai.google.dev/models/gemini) for mor
91
75
3.**Enter API Key:** Paste your Gemini API key into the "Gemini API Key" field.
92
76
4.**Select Model:** Choose your desired Gemini model from the "Model" dropdown.
93
77
78
+
By default, Roo Code selects a stable Pro model (currently a Gemini 2.5 Pro variant) with a temperature of **1.0** where your provider supports it. This keeps suggestions more expressive and natural while still staying on task. If you need highly deterministic output (for example, for code generation in CI), you can lower the temperature toward `0.0`.
Copy file name to clipboardExpand all lines: docs/providers/openai-compatible.md
+58-12Lines changed: 58 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -49,21 +49,67 @@ You'll find these settings in the Roo Code settings panel (click the <Codicon na
49
49
50
50
---
51
51
52
-
## Supported Models (for OpenAINative Endpoint)
52
+
## Native Tool Calling (OpenAI-Native Endpoint)
53
53
54
-
While this provider type allows connecting to various endpoints, if you are connecting directly to the official OpenAI API (or an endpoint mirroring it exactly), Roo Code recognizes the following model IDs based on the `openAiNativeModels` definition in its source code:
54
+
When you connect this provider directly to the official OpenAI API (or an endpoint that mirrors it exactly), Roo Code can use OpenAI's **native tool-calling** protocol instead of the XML-based tool format.
55
55
56
-
*`o3-mini`
57
-
*`o3-mini-high`
58
-
*`o3-mini-low`
59
-
*`o1`
60
-
*`o1-preview`
61
-
*`o1-mini`
62
-
*`gpt-4.5-preview`
63
-
*`gpt-4o`
64
-
*`gpt-4o-mini`
56
+
At a high level:
65
57
66
-
**Note:** If you are using a different OpenAI-compatible provider (like Together AI, Anyscale, etc.), the available model IDs will vary. Always refer to your specific provider's documentation for their supported model names.
58
+
-**Tool definitions** are sent to the model using OpenAI's native tools schema.
59
+
-**Tool calls** stream back as dedicated tool events, including the tool name, arguments, and metadata.
60
+
-**Tool arguments** are streamed incrementally, which reduces latency between the model deciding to use a tool and Roo Code executing it.
61
+
62
+
### When native tools are used
63
+
64
+
Roo Code uses native tool calling when **all** of the following are true:
65
+
66
+
1. The selected provider is configured for the OpenAI-native protocol (OpenAI or an OpenAI-compatible endpoint that fully supports native tools).
67
+
2. The active profile's tool protocol is set to allow native tools (or left at its default, which prefers native tools when supported).
68
+
3. The selected model supports native tool calling.
69
+
70
+
If any of these conditions aren't met, Roo Code falls back to its XML-based tool protocol instead.
71
+
72
+
### Example: simple native tool flow
73
+
74
+
Here's a simplified example of how a file-reading tool might be exposed when using an OpenAI-native endpoint:
75
+
76
+
```json
77
+
{
78
+
"tools": [
79
+
{
80
+
"type": "function",
81
+
"function": {
82
+
"name": "read_file",
83
+
"description": "Read a file from the workspace with line numbers.",
When the model decides to use `read_file`, Roo Code surfaces **streamed tool events** in the task timeline:
100
+
101
+
- A native *tool call* event with the tool name and arguments as they're being generated
102
+
- The corresponding *tool result* event showing the file contents and any truncation or line-range information
103
+
104
+
This gives you lower-latency feedback on which tools are being used and with what arguments.
105
+
106
+
### Settings and limitations
107
+
108
+
-**Tool protocol selector:** In advanced settings, you can choose which tool protocol Roo Code should prefer (XML vs native). If you disable native tools here, Roo Code will always use XML even if the provider supports native tools.
109
+
-**Model support:** Not all OpenAI-native or compatible models support tools. If a model doesn't support tools, Roo Code will not attempt to send tool definitions for it.
110
+
-**Provider quirks:** Some OpenAI-compatible providers only partially implement the native tools API. If Roo Code detects protocol errors, it may fall back to XML tools automatically.
111
+
112
+
For a deeper overview of how tools work in Roo Code in general, see the [Tool Use Overview](/advanced-usage/available-tools/tool-use-overview).
Copy file name to clipboardExpand all lines: docs/providers/vertex.md
+23-25Lines changed: 23 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,31 +43,29 @@ If no model is specified, Roo Code defaults to `claude-sonnet-4@20250514`.
43
43
44
44
### Google Gemini Models
45
45
46
-
#### Standard Models
47
-
*`gemini-2.5-flash` - Production version with prompt caching support
48
-
*`gemini-2.5-flash-preview-05-20` - Preview with 1M context window
49
-
*`gemini-2.5-flash-preview-04-17` - Preview without caching
50
-
*`gemini-2.5-flash-lite-preview-06-17` - Lite version with lower pricing
51
-
*`gemini-2.5-pro` - Production version with reasoning support
52
-
*`gemini-2.5-pro-preview-03-25` - Pro preview version
53
-
*`gemini-2.5-pro-preview-05-06` - Pro preview version
54
-
*`gemini-2.5-pro-preview-06-05` - Pro preview with reasoning support
55
-
*`gemini-2.5-pro-exp-03-25` - Experimental version (free)
56
-
*`gemini-2.0-flash-001` - 2.0 Flash model
57
-
*`gemini-2.0-flash-lite-001` - 2.0 Flash lite version
58
-
*`gemini-2.0-flash-thinking-exp-01-21` - Thinking/reasoning model
59
-
*`gemini-2.0-pro-exp-02-05` - 2.0 Pro experimental
60
-
*`gemini-1.5-flash-002` - 1.5 Flash model
61
-
*`gemini-1.5-pro-002` - 1.5 Pro model
62
-
63
-
#### Thinking/Reasoning Models
64
-
These models support enhanced reasoning capabilities with the `:thinking` suffix:
65
-
*`gemini-2.5-flash-preview-05-20:thinking`
66
-
*`gemini-2.5-flash-preview-04-17:thinking`
67
-
68
-
:::info
69
-
**Thinking Models:** Models with `:thinking` suffix enable step-by-step reasoning. The suffix is stripped before sending to the API but enables reasoning features in Roo Code. You'll need to enable the reasoning budget in settings to use these models effectively.
70
-
:::
46
+
Vertex AI exposes multiple Gemini model families. Roo Code focuses on the main families and tracks Google's stable releases instead of requiring you to hard-code versioned model IDs.
47
+
48
+
#### Recommended Gemini options
49
+
50
+
-**Gemini 3 Pro Preview**
51
+
- Up to a 1M-token context window for very large workspaces and long-running conversations
52
+
- Reasoning-capable behavior for complex coding and refactoring tasks
53
+
- Roo Code's cost estimation supports tiered pricing (short vs long requests) to better match Vertex AI billing for this model
54
+
-**Gemini Pro family**
55
+
- Stable Pro models for complex reasoning and analysis
56
+
- When you select a Gemini model without overriding it in a profile, Roo Code prefers a stable Pro variant by default where available
57
+
-**Gemini Flash family**
58
+
- Faster, lower-cost models ideal for quick iterations and non-critical tasks
59
+
60
+
#### Reasoning / thinking models
61
+
62
+
Some Gemini models provide dedicated reasoning or "thinking" tokens:
63
+
64
+
- Roo Code treats these as reasoning models and uses them for deeper multi-step planning when enabled.
65
+
- The reasoning budget must be enabled in Roo Code settings to take full advantage of these models.
66
+
- When Vertex AI reports separate reasoning or "thought" tokens, Roo Code includes them in token usage and cost estimates. Compared to older versions, you may see slightly higher but more accurate token counts.
67
+
68
+
Refer to the [Google Cloud Vertex AI models documentation](https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models) for up-to-date Gemini model IDs and capabilities.
Copy file name to clipboardExpand all lines: docs/update-notes/v3.33.0.mdx
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ image: /img/v3.33.0/v3.33.0.png
9
9
10
10
# Roo Code 3.33.0 Release Notes (2025-11-18)
11
11
12
-
Gemini 3 Pro Preview is now available in Roo Code and already performing extremely well in real-world coding tasks, alongside 16 tool-protocol and UI tweaks and fixes—thanks to everyone in the Roo Code community who helped shape this release.
12
+
Gemini 3 Pro Preview is now available in Roo Code and already performing extremely well in real-world coding tasks, alongside 16 tweaks and fixes—thanks to everyone in the Roo Code community who helped shape this release.
0 commit comments