Skip to content

Commit 34bbb71

Browse files
docs: update image generation documentation and add release notes for v3.26.3 (#326)
1 parent 175d879 commit 34bbb71

File tree

5 files changed

+92
-12
lines changed

5 files changed

+92
-12
lines changed

docs/features/image-generation.md

Lines changed: 45 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,24 @@
11
---
2-
description: Generate images from text prompts directly in Roo Code using OpenRouter API. Save AI-generated images to your workspace with preview support.
2+
description: Generate new images from text prompts or edit existing images in Roo Code using OpenRouter API. Transform, enhance, and save AI-processed images to your workspace with preview support.
33
keywords:
44
- image generation
5+
- image editing
56
- text to image
7+
- image transformation
68
- OpenRouter
79
- AI images
810
- experimental feature
911
- image creation
1012
- prompt to image
13+
- watercolor
14+
- upscaling
15+
- style transfer
1116
image: /img/social-share.jpg
1217
---
1318

1419
# Image Generation
1520

16-
Generate images from plain text prompts, save them to your project, and preview them in the chat. This experimental feature requires an OpenRouter API key.
21+
Generate new images from text prompts or edit existing images in your workspace. Save results to your project with preview in chat. This experimental feature requires an OpenRouter API key.
1722

1823
:::warning Experimental Feature
1924
Image Generation is an experimental feature that requires enabling in settings and configuring an OpenRouter API key.
@@ -23,25 +28,32 @@ Image Generation is an experimental feature that requires enabling in settings a
2328

2429
## Key Features
2530

26-
- Create images directly from chat using natural language
27-
- Saves to your workspace at a path you choose; file extension is auto-added if missing
28-
- Shows a preview of the generated image in the conversation
29-
- Choose an image generation model (via OpenRouter)
31+
- Create new images from text prompts using natural language
32+
- Edit and transform existing images in your workspace
33+
- Saves to your workspace at a path you choose; appropriate extension (.png or .jpg) is auto-added if missing
34+
- Shows a preview of the generated/edited image in the conversation
35+
- Currently uses Gemini 2.5 Flash Image Preview models via OpenRouter
3036
- Simple on/off toggle under Experimental settings
3137

3238
---
3339

34-
## Use Case
40+
## Use Cases
3541

42+
### Image Generation
3643
**Before:** You had to copy prompts to an external site, download the result, then move it into your workspace.
3744

3845
**With this feature:** Ask Roo to generate an image, approve, pick a save location, and continue editing with the image already in your project.
3946

47+
### Image Editing
48+
**Before:** Export image, upload to external editor, make changes, download, import back to project.
49+
50+
**With this feature:** Ask Roo to transform your existing image directly - it reads the file, applies your edits, and saves the result in your project.
51+
4052
---
4153

4254
## How It Works
4355

44-
When enabled, Roo sends your prompt to an image-capable model through OpenRouter. The image returned by OpenRouter is saved to the path you specify inside your current workspace. Roo shows a preview in the chat and the file appears in your file explorer.
56+
When enabled, Roo sends your prompt (and optionally an existing image) to an image-capable model through OpenRouter. The generated or edited image returned by OpenRouter is saved to the path you specify inside your current workspace. Roo shows a preview in the chat and the file appears in your file explorer.
4557

4658
---
4759

@@ -67,19 +79,39 @@ When enabled, Roo sends your prompt to an image-capable model through OpenRouter
6779

6880
### 3. Image Generation Model
6981
- **Purpose:** Selects which model to use for generation
70-
- **Default:** Gemini 2.5 Flash Image Preview (or the free preview variant)
82+
- **Default:** Gemini 2.5 Flash Image Preview
83+
- **Available Models:** Currently limited to Gemini 2.5 Flash Image Preview and its free variant
7184

7285
---
7386

7487
## Using Image Generation
7588

7689
1. In chat, ask Roo to generate an image and describe what you want (subject, style, lighting, composition).
7790
2. Confirm the action when prompted. Roo may ask you to choose a save path (for example: `images/sunset.png`).
78-
3. Roo generates the image and saves it. If you don't include an extension, one is added automatically (png or jpg).
91+
3. Roo generates the image and saves it. If you don't include an extension, the appropriate extension (.png or .jpg) is added based on the output format.
7992
4. See the image preview in the chat and locate the file in your workspace.
8093

8194
---
8295

96+
## Editing Existing Images
97+
98+
Roo can also transform and edit existing images in your workspace:
99+
100+
1. Ask Roo to edit an image, describing the transformation you want
101+
2. Specify both the input image path and where to save the result
102+
3. Roo will apply your requested edits to the existing image
103+
104+
**Supported Input Formats**: PNG, JPG, JPEG, GIF, WEBP
105+
106+
**Example Requests**:
107+
- "Transform `photos/portrait.jpg` into a watercolor painting and save as `art/watercolor-portrait.png`"
108+
- "Upscale and enhance `images/logo.png` to higher resolution"
109+
- "Apply a vintage filter to `screenshots/app.png`"
110+
111+
**Note**: Both the input image path and output path must be accessible (not blocked by `.rooignore`)
112+
113+
---
114+
83115
## Tips for Better Results
84116

85117
### Be Specific
@@ -95,8 +127,11 @@ Include these elements in your prompts:
95127
## Limitations
96128

97129
- Experimental feature; availability and model list are limited
130+
- Currently limited to Gemini 2.5 Flash Image Preview models
98131
- One image is produced per request
99132
- Output formats supported: PNG or JPG
133+
- Supported input formats for editing: PNG, JPG, JPEG, GIF, WEBP only
134+
- Image paths must be accessible (not blocked by `.rooignore` restrictions)
100135
- Usage may be subject to your OpenRouter plan limits and costs
101136

102137
---

docs/update-notes/index.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ image: /img/social-share.jpg
1919

2020
### Version 3.26
2121

22+
* [3.26.3](/update-notes/v3.26.3) (2025-08-29)
2223
* [3.26.2](/update-notes/v3.26.2) (2025-08-28)
2324
* [3.26.1](/update-notes/v3.26.1) (2025-08-27)
2425
* [3.26.0](/update-notes/v3.26.0) (2025-08-26)

docs/update-notes/v3.26.3.mdx

Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,33 @@
1+
---
2+
description: This release adds image editing capabilities to the image generation tool and improves developer experience.
3+
keywords:
4+
- roo code 3.26.3
5+
- image editing
6+
- image generation
7+
image: /img/social-share.jpg
8+
---
9+
10+
# Roo Code 3.26.3 Release Notes (2025-08-29)
11+
12+
This release adds image editing capabilities to the image generation tool and improves developer experience.
13+
14+
## Image Editing with Input Images
15+
16+
We've enhanced the image generation tool with the ability to edit and transform existing images ([#7525](https://github.com/RooCodeInc/Roo-Code/pull/7525)):
17+
18+
- **Transform Existing Images**: Apply artistic styles, convert photos to paintings, or create variations of existing artwork
19+
- **Style Transfer**: Convert images into watercolor, oil painting, sketch, or other artistic styles
20+
- **Image Enhancement**: Upscale and enhance existing images to higher resolution while maintaining composition
21+
- **Creative Editing**: Use text prompts to modify specific aspects of an image while preserving the rest
22+
23+
The tool now accepts an optional `image` parameter for transforming existing images in your workspace. Supported input formats include PNG, JPG, JPEG, GIF, and WEBP.
24+
25+
> **📚 Documentation**: See [Image Generation - Editing Existing Images](/features/image-generation#editing-existing-images) for detailed usage and transformation examples.
26+
27+
## Bug Fixes
28+
29+
* **Image Generation Settings**: Fixed issue where the saved API key would clear when switching modes ([#7536](https://github.com/RooCodeInc/Roo-Code/pull/7536))
30+
31+
## Misc Improvements
32+
33+
* **Test Debugging**: Console logs now visible in tests when using the --no-silent flag (thanks hassoncs!) ([#7467](https://github.com/RooCodeInc/Roo-Code/pull/7467))

docs/update-notes/v3.26.mdx

Lines changed: 12 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -69,13 +69,20 @@ We've added Vercel AI Gateway as a complete provider integration (thanks joshual
6969
7070
### Image Generation (OpenRouter) — Free option: Gemini 2.5 Flash Image Preview
7171

72-
Generate images from natural‑language prompts directly inside Roo Code using OpenRouters image generation models. Configure your OpenRouter API key, pick a supported model, and preview results in the built‑in Image Viewer. See [Image Generation](/features/image-generation) and [OpenRouter Provider](/providers/openrouter) for setup and model selection.
72+
Generate images from natural‑language prompts directly inside Roo Code using OpenRouter's image generation models. Configure your OpenRouter API key, pick a supported model, and preview results in the built‑in Image Viewer. See [Image Generation](/features/image-generation) and [OpenRouter Provider](/providers/openrouter) for setup and model selection.
7373

7474
- **Free option available: Gemini 2.5 Flash Image Preview** — try image generation without paid credits for faster onboarding and quick experiments
7575
- Prompt‑to‑image workflow inside the editor with approvals flow (supports auto‑approval when write permissions are granted)
7676
- Image Viewer with zoom, copy, and save for quick reuse in docs and prototypes
77+
- **NEW in v3.26.3: Image Editing** — Transform and edit existing images in your workspace ([#7525](https://github.com/RooCodeInc/Roo-Code/pull/7525)):
78+
- Apply artistic styles like watercolor, oil painting, or sketch
79+
- Upscale and enhance images to higher resolution
80+
- Modify specific aspects while preserving the rest
81+
- Supports PNG, JPG, JPEG, GIF, and WEBP input formats
7782

78-
PRs: [#7474](https://github.com/RooCodeInc/Roo-Code/pull/7474), [#7492](https://github.com/RooCodeInc/Roo-Code/pull/7492), [#7493](https://github.com/RooCodeInc/Roo-Code/pull/7493)
83+
PRs: [#7474](https://github.com/RooCodeInc/Roo-Code/pull/7474), [#7492](https://github.com/RooCodeInc/Roo-Code/pull/7492), [#7493](https://github.com/RooCodeInc/Roo-Code/pull/7493), [#7525](https://github.com/RooCodeInc/Roo-Code/pull/7525))
84+
85+
> **📚 Documentation**: See [Image Generation - Editing Existing Images](/features/image-generation#editing-existing-images) for transformation examples.
7986
8087
### Provider Updates
8188

@@ -106,6 +113,7 @@ PRs: [#7474](https://github.com/RooCodeInc/Roo-Code/pull/7474), [#7492](https://
106113
* **Mistral Thinking Content**: Fixed validation errors when using Mistral models that send thinking content (thanks Biotrioo!) ([#7106](https://github.com/RooCodeInc/Roo-Code/pull/7106))
107114
* **Requesty Model Listing**: Fixed model listing for Requesty provider when using custom base URLs (thanks dtrugman!) ([#7378](https://github.com/RooCodeInc/Roo-Code/pull/7378))
108115
* **Todo List Setting**: Fixed newTaskRequireTodos setting to properly enforce todo list requirements ([#7363](https://github.com/RooCodeInc/Roo-Code/pull/7363))
116+
* **Image Generation Settings** (v3.26.3): Fixed issue where the saved API key would clear when switching modes ([#7536](https://github.com/RooCodeInc/Roo-Code/pull/7536))
109117
* ImageGenerationSettings no longer shows a dirty state on first open; the save button only enables after an actual change. (#[7495](https://github.com/RooCodeInc/Roo-Code/pull/7495))
110118
* GPT‑5 reliability improvements:
111119
- Manual condense preserves conversation continuity by correctly handling `previous_response_id` on the next request
@@ -128,9 +136,11 @@ PRs: [#7474](https://github.com/RooCodeInc/Roo-Code/pull/7474), [#7492](https://
128136
* **Issue Fixer Mode**: Added missing todos parameter in new_task tool usage ([#7391](https://github.com/RooCodeInc/Roo-Code/pull/7391))
129137
* **Privacy Policy Update**: Updated privacy policy to clarify proxy mode data handling (thanks jdilla1277!) ([#7255](https://github.com/RooCodeInc/Roo-Code/pull/7255))
130138
* **Dependencies**: Updated drizzle-kit to v0.31.4 ([#5453](https://github.com/RooCodeInc/Roo-Code/pull/5453))
139+
* **Test Debugging** (v3.26.3): Console logs now visible in tests when using the --no-silent flag (thanks hassoncs!) ([#7467](https://github.com/RooCodeInc/Roo-Code/pull/7467))
131140
* Release automation: version bumps, changelog updates, and auto-publishing on merge for a faster, more reliable release process. (#[7490](https://github.com/RooCodeInc/Roo-Code/pull/7490))
132141
* New TaskSpawned developer event so integrations can detect when a subtask is created and capture its ID for chaining or monitoring. (#[7465](https://github.com/RooCodeInc/Roo-Code/pull/7465))
133142
* Roo Code Cloud SDK bumped to 0.25.0. (#[7475](https://github.com/RooCodeInc/Roo-Code/pull/7475))
134143
* Release automation: version bumps, changelog updates, and auto-publishing on merge for a faster, more reliable release process. (#[7490](https://github.com/RooCodeInc/Roo-Code/pull/7490))
135144
* New TaskSpawned developer event so integrations can detect when a subtask is created and capture its ID for chaining or monitoring. (#[7465](https://github.com/RooCodeInc/Roo-Code/pull/7465))
145+
136146
* Roo Code Cloud SDK bumped to 0.25.0. (#[7475](https://github.com/RooCodeInc/Roo-Code/pull/7475))

sidebars.ts

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -217,6 +217,7 @@ const sidebars: SidebarsConfig = {
217217
label: '3.26',
218218
items: [
219219
{ type: 'doc', id: 'update-notes/v3.26', label: '3.26 Combined' },
220+
{ type: 'doc', id: 'update-notes/v3.26.3', label: '3.26.3' },
220221
{ type: 'doc', id: 'update-notes/v3.26.2', label: '3.26.2' },
221222
{ type: 'doc', id: 'update-notes/v3.26.1', label: '3.26.1' },
222223
{ type: 'doc', id: 'update-notes/v3.26.0', label: '3.26.0' },

0 commit comments

Comments
 (0)