Skip to content

Commit 2f4b88e

Browse files
committed
docs: clarify Image Generation flow with Quick Start, simple flow explanation, and imperative examples (incl. GPT-5 mini)
1 parent de5a047 commit 2f4b88e

File tree

1 file changed

+20
-1
lines changed

1 file changed

+20
-1
lines changed

docs/features/image-generation.md

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,13 @@ Generate new images from text prompts or edit existing images in your workspace.
2424
Image Generation is an experimental feature that requires enabling in settings and configuring an OpenRouter API key.
2525
:::
2626

27+
## Quick Start
28+
29+
1. Enable Image Generation in Settings > Experimental and save your OpenRouter API key.
30+
2. Switch to any coding model you prefer (e.g., Claude 3.7 Sonnet, Gemini Pro, GPT-5 mini).
31+
3. In chat, type an explicit instruction to create an image (for example: "Generate an image of a low-poly kangaroo mascot on a dark background").
32+
4. Approve the action and choose a save path (e.g., `images/kangaroo.png`). Roo saves the file and shows a preview.
33+
2734
---
2835

2936
## Key Features
@@ -53,6 +60,13 @@ Image Generation is an experimental feature that requires enabling in settings a
5360

5461
## How It Works
5562

63+
### Simple Flow
64+
- Turn on Image Generation and set your OpenRouter API key.
65+
- Use any coding model; Roo exposes an Image tool to the model.
66+
- When you instruct "Generate an image …", Roo enhances the prompt and invokes that tool, which calls the Gemini 2.5 Flash Image Preview model via OpenRouter.
67+
- Roo writes the resulting image to your chosen path and shows a preview in the chat.
68+
69+
### Behind the Scenes
5670
When enabled, Roo sends your prompt (and optionally an existing image) to an image-capable model through OpenRouter. The generated or edited image returned by OpenRouter is saved to the path you specify inside your current workspace. Roo shows a preview in the chat and the file appears in your file explorer.
5771

5872
---
@@ -86,11 +100,16 @@ When enabled, Roo sends your prompt (and optionally an existing image) to an ima
86100

87101
## Using Image Generation
88102

89-
1. In chat, ask Roo to generate an image and describe what you want (subject, style, lighting, composition).
103+
1. In chat, instruct Roo to generate an image and specify clear characteristics (subject, style, lighting, composition, aspect ratio).
90104
2. Confirm the action when prompted. Roo may ask you to choose a save path (for example: `images/sunset.png`).
91105
3. Roo generates the image and saves it. If you don't include an extension, the appropriate extension (.png or .jpg) is added based on the output format.
92106
4. See the image preview in the chat and locate the file in your workspace.
93107

108+
### Example Prompts
109+
- Create an image of a friendly low-poly kangaroo mascot on a dark gradient background, 1024x1024 PNG.
110+
- Generate a watercolor illustration of a mountain lake at sunrise with light mist, soft lighting, pastel palette, 16:9 aspect ratio.
111+
- Example using GPT-5 mini: Generate an image of a retro terminal wallpaper with green phosphor glow and subtle scanlines, minimal noise, 3840x2160.
112+
94113
---
95114

96115
## Editing Existing Images

0 commit comments

Comments
 (0)