feat(image-models): add pruna and nano banana pro image models (#34)

TimPietruskyRunPod · web-flow · commit b8cb20435cb3 · 2025-12-03T14:29:01.000+01:00
* feat(image-models): add pruna and nano banana pro image models

- add pruna/p-image-t2i text-to-image model
- add pruna/p-image-edit image editing model
- add google/nano-banana-pro-edit image editing model
- update buildInputPayload to handle pruna and nano banana pro formats
- update README with new model capabilities and provider options

* chore: add changeset for pruna and nano banana pro models
diff --git a/.changeset/add-pruna-nano-banana-models.md b/.changeset/add-pruna-nano-banana-models.md
@@ -0,0 +1,10 @@
+---
+"@runpod/ai-sdk-provider": minor
+---
+
+Add support for Pruna and Nano Banana Pro image models:
+- `pruna/p-image-t2i` - Pruna text-to-image generation
+- `pruna/p-image-edit` - Pruna image editing
+- `google/nano-banana-pro-edit` - Nano Banana Pro image editing (Gemini-powered)
+
+These models support flexible aspect ratios and additional provider options like `aspect_ratio`, `resolution`, `enable_sync_mode`, and `enable_base64_output`.
diff --git a/README.md b/README.md
@@ -106,11 +106,11 @@ for await (const delta of textStream) {
 
 ### Model Capabilities
 
-| Model ID                        | Description                                                         | Streaming | Object Generation | Tool Usage | Reasoning Notes           |
-| ------------------------------- | ------------------------------------------------------------------- | --------- | ----------------- | ---------- | ------------------------- |
-| `qwen/qwen3-32b-awq`           | 32B parameter multilingual model with strong reasoning capabilities | ✅        | ❌                | ✅         | Standard reasoning events |
-| `openai/gpt-oss-120b`           | 120B parameter open-source GPT model                                | ✅        | ❌                | ✅         | Standard reasoning events |
-| `deepcogito/cogito-671b-v2.1-fp8` | 671B parameter Cogito model with FP8 quantization                | ✅        | ❌                | ✅         | Standard reasoning events |
+| Model ID                          | Description                                                         | Streaming | Object Generation | Tool Usage | Reasoning Notes           |
+| --------------------------------- | ------------------------------------------------------------------- | --------- | ----------------- | ---------- | ------------------------- |
+| `qwen/qwen3-32b-awq`              | 32B parameter multilingual model with strong reasoning capabilities | ✅        | ❌                | ✅         | Standard reasoning events |
+| `openai/gpt-oss-120b`             | 120B parameter open-source GPT model                                | ✅        | ❌                | ✅         | Standard reasoning events |
+| `deepcogito/cogito-671b-v2.1-fp8` | 671B parameter Cogito model with FP8 quantization                   | ✅        | ❌                | ✅         | Standard reasoning events |
 
 **Note:** This list is not complete. For a full list of all available models, see the [Runpod Public Endpoint Reference](https://docs.runpod.io/hub/public-endpoint-reference).
 
@@ -235,6 +235,9 @@ writeFileSync('landscape.jpg', image.uint8Array);
 | `qwen/qwen-image`                      | Text-to-image generation        | 1:1, 4:3, 3:4                         |
 | `qwen/qwen-image-edit`                 | Image editing (prompt-guided)   | 1:1, 4:3, 3:4                         |
 | `nano-banana-edit`                     | Image editing (multi-image)     | 1:1, 4:3, 3:4                         |
+| `google/nano-banana-pro-edit`          | Image editing (Gemini-powered)  | Uses resolution param (1k, 2k)        |
+| `pruna/p-image-t2i`                    | Pruna text-to-image             | 1:1, 16:9, 9:16, 4:3, 3:4, etc.       |
+| `pruna/p-image-edit`                   | Pruna image editing             | match_input_image, 1:1, 16:9, etc.    |
 
 **Note**: The provider uses strict validation for image parameters. Unsupported aspect ratios (like `16:9`, `9:16`, `3:2`, `2:3`) will throw an `InvalidArgumentError` with a clear message about supported alternatives.
 
@@ -307,6 +310,8 @@ const { image } = await generateImage({
 });
 ```
 
+Check out our [examples](https://github.com/runpod/examples/tree/main/ai-sdk/getting-started) for more code snippets on how to use all the different models.
+
 ### Advanced Configuration
 
 ```ts
@@ -349,17 +354,22 @@ const { image } = await generateImage({
 
 Runpod image models support flexible provider options through the `providerOptions.runpod` object:
 
-| Option                  | Type       | Default | Description                                                              |
-| ----------------------- | ---------- | ------- | ------------------------------------------------------------------------ |
-| `negative_prompt`       | `string`   | `""`    | Text describing what you don't want in the image                         |
-| `enable_safety_checker` | `boolean`  | `true`  | Enable content safety filtering                                          |
-| `image`                 | `string`   | -       | Single input image: URL or base64 data URI (Flux Kontext)                |
-| `images`                | `string[]` | -       | Multiple input images (e.g., for `nano-banana-edit` multi-image editing) |
-| `num_inference_steps`   | `number`   | Auto    | Number of denoising steps (Flux: 4 for schnell, 28 for others)           |
-| `guidance`              | `number`   | Auto    | Guidance scale for prompt adherence (Flux: 7 for schnell, 2 for others)  |
-| `output_format`         | `string`   | `"png"` | Output image format ("png" or "jpg")                                     |
-| `maxPollAttempts`       | `number`   | `60`    | Maximum polling attempts for async generation                            |
-| `pollIntervalMillis`    | `number`   | `5000`  | Polling interval in milliseconds (5 seconds)                             |
+| Option                   | Type       | Default | Description                                                              |
+| ------------------------ | ---------- | ------- | ------------------------------------------------------------------------ |
+| `negative_prompt`        | `string`   | `""`    | Text describing what you don't want in the image                         |
+| `enable_safety_checker`  | `boolean`  | `true`  | Enable content safety filtering                                          |
+| `disable_safety_checker` | `boolean`  | `false` | Disable safety checker (Pruna models)                                    |
+| `image`                  | `string`   | -       | Single input image: URL or base64 data URI (Flux Kontext)                |
+| `images`                 | `string[]` | -       | Multiple input images (e.g., for `nano-banana-edit` multi-image editing) |
+| `aspect_ratio`           | `string`   | `"1:1"` | Aspect ratio string (Pruna: "16:9", "match_input_image", etc.)           |
+| `resolution`             | `string`   | `"1k"`  | Output resolution (Nano Banana Pro: "1k", "2k")                          |
+| `num_inference_steps`    | `number`   | Auto    | Number of denoising steps (Flux: 4 for schnell, 28 for others)           |
+| `guidance`               | `number`   | Auto    | Guidance scale for prompt adherence (Flux: 7 for schnell, 2 for others)  |
+| `output_format`          | `string`   | `"png"` | Output image format ("png", "jpg", or "jpeg")                            |
+| `enable_base64_output`   | `boolean`  | `false` | Return base64 instead of URL (Nano Banana Pro)                           |
+| `enable_sync_mode`       | `boolean`  | `false` | Enable synchronous mode (some models)                                    |
+| `maxPollAttempts`        | `number`   | `60`    | Maximum polling attempts for async generation                            |
+| `pollIntervalMillis`     | `number`   | `5000`  | Polling interval in milliseconds (5 seconds)                             |
 
 ## About Runpod
 
diff --git a/src/runpod-image-model.ts b/src/runpod-image-model.ts
@@ -328,6 +328,63 @@ export class RunpodImageModel implements ImageModelV2 {
       }
     }
 
+    // Check if this is a Pruna model
+    const isPrunaModel = this.modelId.includes('pruna') || this.modelId.includes('p-image');
+    if (isPrunaModel) {
+      const isPrunaEdit = this.modelId.includes('edit');
+
+      if (isPrunaEdit) {
+        // Pruna image edit uses images array and aspect_ratio string
+        return {
+          prompt,
+          seed: seed ?? -1,
+          aspect_ratio: runpodOptions?.aspect_ratio ?? 'match_input_image',
+          disable_safety_checker: runpodOptions?.disable_safety_checker ?? false,
+          enable_sync_mode: runpodOptions?.enable_sync_mode ?? false,
+          ...runpodOptions,
+        };
+      } else {
+        // Pruna text-to-image uses aspect_ratio string format
+        const aspectRatioMap: Record<string, string> = {
+          '1328*1328': '1:1',
+          '1472*1140': '4:3',
+          '1140*1472': '3:4',
+          '512*512': '1:1',
+          '768*768': '1:1',
+          '1024*1024': '1:1',
+          '1536*1536': '1:1',
+          '2048*2048': '1:1',
+          '4096*4096': '1:1',
+          '512*768': '2:3',
+          '768*512': '3:2',
+          '1024*768': '4:3',
+          '768*1024': '3:4',
+        };
+        const aspectRatio = runpodOptions?.aspect_ratio ?? aspectRatioMap[runpodSize] ?? '1:1';
+
+        return {
+          prompt,
+          seed: seed ?? 0,
+          aspect_ratio: aspectRatio,
+          enable_safety_checker: runpodOptions?.enable_safety_checker ?? true,
+          ...runpodOptions,
+        };
+      }
+    }
+
+    // Check if this is a Nano Banana Pro model (google/nano-banana-pro-edit)
+    const isNanaBananaProModel = this.modelId.includes('nano-banana-pro');
+    if (isNanaBananaProModel) {
+      return {
+        prompt,
+        resolution: runpodOptions?.resolution ?? '1k',
+        output_format: runpodOptions?.output_format ?? 'jpeg',
+        enable_base64_output: runpodOptions?.enable_base64_output ?? false,
+        enable_sync_mode: runpodOptions?.enable_sync_mode ?? false,
+        ...runpodOptions,
+      };
+    }
+
     // Default format for Qwen and other models
     return {
       prompt,
diff --git a/src/runpod-provider.ts b/src/runpod-provider.ts
@@ -84,6 +84,11 @@ const IMAGE_MODEL_ID_TO_ENDPOINT_URL: Record<string, string> = {
     'https://api.runpod.ai/v2/black-forest-labs-flux-1-dev',
   // Nano Banana (edit only)
   'nano-banana-edit': 'https://api.runpod.ai/v2/nano-banana-edit',
+  // Nano Banana Pro (edit only)
+  'google/nano-banana-pro-edit': 'https://api.runpod.ai/v2/nano-banana-pro-edit',
+  // Pruna (t2i and edit)
+  'pruna/p-image-t2i': 'https://api.runpod.ai/v2/p-image-t2i',
+  'pruna/p-image-edit': 'https://api.runpod.ai/v2/p-image-edit',
 };
 
 // Mapping of Runpod model IDs to their OpenAI model names