diff --git a/docs/concepts/multimodal/image_generation.md b/docs/concepts/multimodal/image_generation.md
new file mode 100644
index 00000000..6b4c7327
--- /dev/null
+++ b/docs/concepts/multimodal/image_generation.md
@@ -0,0 +1,427 @@
+# Image Generation and Editing
+
+This module introduces support for multimodal data generation pipelines that produce **image outputs** using AI image generation and editing models. It enables text-to-image generation and image editing tasks, expanding traditional text-only pipelines to support visual content creation.
+
+## Key Features
+
+- Supports **text-to-image** generation from natural language descriptions.
+- Supports **image editing** with text instructions (single or multiple images).
+- Returns **base64-encoded image data URLs** compatible with standard image formats.
+- Compatible with HuggingFace datasets, streaming, and on-disk formats.
+
+## Supported Models
+
+**Currently, we only support the following OpenAI image models:**
+
+- `dall-e-2` - DALL-E 2 for image generation and editing
+  - Sizes: 256x256, 512x512, 1024x1024
+  - Batch generation: Up to 10 images
+  - Single image editing only
+
+- `dall-e-3` - DALL-E 3 for high-quality image generation
+  - Sizes: 1024x1024, 1792x1024, 1024x1792
+  - Quality: standard or hd
+  - Style: vivid or natural
+  - Single image per request
+
+- `gpt-image-1` - Latest GPT-Image model with enhanced capabilities
+  - Same features as DALL-E 3
+  - **Multi-image editing**: Up to 16 images per request
+  - Supports PNG, WEBP, JPG formats
+  - Maximum file size: 50MB per image
+
+## Input Requirements
+
+### Text-to-Image Generation
+
+For image generation from text:
+
+- **Text prompt**: Natural language description of the desired image
+- Maximum length: 
+  - DALL-E-2: 1000 characters
+  - DALL-E-3: 4000 characters
+  - GPT-Image-1: 32000 characters
+
+### Image Editing
+
+For image editing with text instructions:
+
+- **Images**: One or more images can be passed as image_url (refer to [image_to_text.md](./image_to_text.md) to see how input images are handled)
+  - DALL-E-2: Single image only (PNG, square, < 4MB)
+  - GPT-Image-1: 1-16 images (PNG/WEBP/JPG, < 50MB each)
+- **Text prompt**: Instruction describing the desired edits
+
+## How Image Generation Works
+
+### Text-to-Image Flow
+
+1. Text prompt is extracted from the input record.
+2. The image model generates image(s) from the prompt.
+3. Image(s) are returned as **base64-encoded data URLs** (e.g., `data:image/png;base64,...`).
+4. Data URLs are converted to files and saved to disk.
+5. Output contains absolute paths to the saved image files.
+
+### Image Editing Flow
+
+1. System detects images in the input (data URLs).
+2. If images present → routes to **edit_image** API.
+4. Edited images are returned as data URLs and saved to disk.
+5. Output contains absolute paths to the saved edited image files.
+
+## Model Configuration
+
+The model configuration must specify `output_type: image` and include image-specific parameters:
+
+### Basic Text-to-Image Configuration
+
+```yaml
+dalle3_model:
+  model: dalle_3
+  output_type: image
+  model_type: openai
+  parameters:
+    size: "1024x1024"
+    quality: "hd"
+    style: "vivid"
+```
+
+### Image Editing Configuration
+
+```yaml
+gpt_image_1:
+  model: gpt_image_1
+  output_type: image
+  model_type: azure_openai
+  api_version: 2025-03-01-preview
+  parameters:
+    quality: "medium"
+```
+
+## Example 1: Product Image Generation
+
+Generate product images from descriptions:
+
+```yaml
+data_config:
+  source:
+    type: "disk"
+    file_path: "data/products.json"
+
+graph_config:
+  nodes:
+    generate_product_images:
+      node_type: llm
+      output_keys: product_image
+      prompt:
+        - user: |
+            A professional product photo of {product_name}: {product_description}.
+            Studio lighting, white background, high quality, commercial photography.
+      model:
+        name: dalle3_model
+        parameters:
+          size: "1024x1024"
+          quality: "hd"
+          style: "natural"
+  
+  edges:
+    - from: START
+      to: generate_product_images
+    - from: generate_product_images
+      to: END
+
+output_config:
+  output_map:
+    id:
+      from: "id"
+    product_name:
+      from: "product_name"
+    product_image:
+      from: "product_image"
+```
+
+### Input Data (`data/products.json`)
+
+```json
+[
+  {
+    "id": "1",
+    "product_name": "Wireless Headphones",
+    "product_description": "Premium over-ear headphones with noise cancellation, matte black finish"
+  },
+  {
+    "id": "2",
+    "product_name": "Smart Watch",
+    "product_description": "Modern fitness tracker with OLED display, silver aluminum case"
+  }
+]
+```
+
+### Output
+
+```json
+[
+  {
+    "id": "1",
+    "product_name": "Wireless Headphones",
+    "product_image": "/path/to/multimodal_output/image/1_product_image_0.png"
+  },
+  {
+    "id": "2",
+    "product_name": "Smart Watch",
+    "product_image": "/path/to/multimodal_output/image/2_product_image_0.png"
+  }
+]
+```
+
+## Example 2: Image Editing (Background Removal)
+
+Edit existing images with text instructions:
+
+```yaml
+data_config:
+  source:
+    type: "disk"
+    file_path: "data/photos.json"
+
+graph_config:
+  nodes:
+    edit_background:
+      node_type: llm
+      output_keys: edited_image
+      prompt:
+        - user:
+            - type: image_url
+              image_url: "{original_image}"
+            - type: text
+              text: "Remove the background and replace it with a solid white background. Keep the subject unchanged."
+      model:
+        name: gpt_image_1
+        parameters:
+          size: "1024x1024"
+  
+  edges:
+    - from: START
+      to: edit_background
+    - from: edit_background
+      to: END
+
+output_config:
+  output_map:
+    id:
+      from: "id"
+    original_image:
+      from: "original_image"
+    edited_image:
+      from: "edited_image"
+```
+
+### Input Data (`data/photos.json`)
+
+```json
+[
+  {
+    "id": "1",
+    "original_image": "data:image/png;base64,iVBORw0KGgo..."
+  },
+  {
+    "id": "2",
+    "original_image": "data:image/png;base64,iVBORw0KGgo..."
+  }
+]
+```
+
+### Output
+
+```json
+[
+  {
+    "id": "1",
+    "original_image": "data:image/png;base64,iVBORw0KGgo...",
+    "edited_image": "/path/to/multimodal_output/image/1_edited_image_0.png"
+  },
+  {
+    "id": "2",
+    "original_image": "data:image/png;base64,iVBORw0KGgo...",
+    "edited_image": "/path/to/multimodal_output/image/2_edited_image_0.png"
+  }
+]
+```
+
+## Example 3: Multi-Image Editing (GPT-Image-1)
+
+Edit multiple images simultaneously to create a collage:
+
+```yaml
+data_config:
+  source:
+    type: "disk"
+    file_path: "data/photo_sets.json"
+
+graph_config:
+  nodes:
+    create_collage:
+      node_type: llm
+      output_keys: collage
+      prompt:
+        - user:
+            - type: image_url
+              image_url: "{image_1}"
+            - type: image_url
+              image_url: "{image_2}"
+            - type: image_url
+              image_url: "{image_3}"
+            - type: text
+              text: "Arrange these images into a beautiful 3-panel collage with white borders. Maintain image quality."
+      model:
+        name: gpt_image_1
+        parameters:
+          quality: "medium"
+  
+  edges:
+    - from: START
+      to: create_collage
+    - from: create_collage
+      to: END
+
+output_config:
+  output_map:
+    id:
+      from: "id"
+    collage:
+      from: "collage"
+```
+
+### Input Data (`data/photo_sets.json`)
+
+```json
+[
+  {
+    "id": "1",
+    "image_1": "data:image/png;base64,iVBORw0KGgo...",
+    "image_2": "data:image/png;base64,iVBORw0KGgo...",
+    "image_3": "data:image/png;base64,iVBORw0KGgo..."
+  }
+]
+```
+
+### Output
+
+```json
+[
+  {
+    "id": "1",
+    "collage": "/path/to/multimodal_output/image/1_collage_0.png"
+  }
+]
+```
+
+## Example 4: Batch Image Generation
+
+Generate multiple variations in one request:
+
+```yaml
+data_config:
+  source:
+    type: "disk"
+    file_path: "data/concepts.json"
+
+graph_config:
+  nodes:
+    generate_variations:
+      node_type: llm
+      output_keys: images
+      prompt:
+        - user: "{concept_description}"
+      model:
+        model: dalle_2
+        parameters:
+          size: "512x512"
+          n: 5  # Generate 5 variations
+  
+  edges:
+    - from: START
+      to: generate_variations
+    - from: generate_variations
+      to: END
+
+output_config:
+  output_map:
+    id:
+      from: "id"
+    concept:
+      from: "concept_description"
+    images:
+      from: "images"
+```
+
+### Input Data (`data/concepts.json`)
+
+```json
+[
+  {
+    "id": "1",
+    "concept_description": "A futuristic city skyline at sunset with flying cars"
+  }
+]
+```
+
+### Output
+
+```json
+[
+  {
+    "id": "1",
+    "concept": "A futuristic city skyline at sunset with flying cars",
+    "images": [
+        "/path/to/multimodal_output/image/1_images_0.png",
+        "/path/to/multimodal_output/image/1_images_1.png",
+        "/path/to/multimodal_output/image/1_images_2.png",
+        "/path/to/multimodal_output/image/1_images_3.png",
+        "/path/to/multimodal_output/image/1_images_4.png"
+      ]
+  }
+]
+```
+
+## Auto-Detection Logic
+
+The system automatically detects the operation type:
+
+1. **Input has images** → Routes to **edit_image()** API (image editing)
+2. **Input has no images** → Routes to **create_image()** API (text-to-image)
+
+This means you only need to set `output_type: image` - the system handles the rest!
+
+## Output File Organization
+
+Generated images are saved to:
+
+```
+task_dir/
+└── multimodal_output/
+    └── image/
+        ├── {record_id}_{field_name}_0.png
+        ├── {record_id}_{field_name}_1.png
+        └── ...
+```
+
+- `record_id`: ID from input record
+- `field_name`: Output field name from prompt
+- `index`: Image index (for multiple images)
+
+## Notes
+
+- **Image generation is currently only supported for OpenAI models** (DALL-E-2, DALL-E-3, GPT-Image-1).
+- The `output_type` in model configuration must be set to `image` to enable image operations.
+- Image files are automatically saved and paths are inserted into the output.
+- Multi-image editing requires GPT-Image-1 model; other models support single image only.
+- All image models have their own limitations and restrictions; refer to the [OpenAI Image API Documentation](https://platform.openai.com/docs/guides/images) before use.
+
+---
+
+## See Also
+
+- [Text to Speech](./text_to_speech.md) - For audio generation
+- [Audio to Text](./audio_to_text.md) - For speech recognition and transcription
+- [Image to Text](./image_to_text.md) - For vision-based multimodal pipelines
+- [OpenAI Image API Documentation](https://platform.openai.com/docs/guides/images) - Official OpenAI Image API reference
diff --git a/sygra/config/models.yaml b/sygra/config/models.yaml
index 7cd5fb27..9528cb19 100644
--- a/sygra/config/models.yaml
+++ b/sygra/config/models.yaml
@@ -82,10 +82,21 @@ qwen3_1.7b:
 # TTS openai model
 tts_openai:
   model: tts
-  output_type: audio  # This triggers TTS functionality
-  model_type: azure_openai  # Use azure_openai or openai model type
+  output_type: audio
+  model_type: azure_openai
   api_version: 2025-03-01-preview
   # URL and api_key should be defined at .env file as SYGRA_TTS_OPENAI_URL and SYGRA_TTS_OPENAI_TOKEN
   parameters:
     voice: "alloy"
     response_format: "wav"
+
+# Image generation model
+gpt_image_1:
+  model: gpt-image-1
+  output_type: image
+  model_type: azure_openai
+  api_version: 2025-04-01-preview
+  # URL and api_key should be defined at .env file as SYGRA_GPT_IMAGE_1_URL and SYGRA_GPT_IMAGE_1_TOKEN
+  parameters:
+    size: "1024x1024"
+    quality: "high"
\ No newline at end of file
diff --git a/sygra/core/models/client/openai_azure_client.py b/sygra/core/models/client/openai_azure_client.py
index 23d1ddfb..48b050c2 100644
--- a/sygra/core/models/client/openai_azure_client.py
+++ b/sygra/core/models/client/openai_azure_client.py
@@ -198,3 +198,140 @@ async def create_speech(
             response_format=response_format,
             speed=speed,
         )
+
+    async def create_image(
+        self,
+        model: str,
+        prompt: str,
+        **kwargs: Any,
+    ) -> Any:
+        """
+        Generate images from text prompts using Azure OpenAI's image generation API.
+
+        Args:
+            model (str): The deployment name for the image model
+            prompt (str): The text description of the desired image(s)
+            **kwargs: Additional parameters supported by the API:
+                - size (str): Image size
+                - quality (str): "standard" or "hd"
+                - n (int): Number of images to generate
+                - response_format (str): "url" or "b64_json"
+                - style (str): "vivid" or "natural"
+                - stream (bool): Enable streaming responses
+                - Any other parameters supported by Azure OpenAI API
+
+        Returns:
+            Any: The image generation response from the API
+
+        Raises:
+            ValueError: If async_client is False (Image generation requires async client)
+        """
+        if not self.async_client:
+            raise ValueError(
+                "Image generation API requires async client. Please initialize with async_client=True"
+            )
+
+        client = cast(Any, self.client)
+
+        # Build the request parameters with all provided kwargs
+        params: Dict[str, Any] = {
+            "model": model,
+            "prompt": prompt,
+            **kwargs,  # Pass all additional parameters
+        }
+
+        return await client.images.generate(**params)
+
+    async def edit_image(
+        self,
+        image: Union[Any, List[Any]],
+        prompt: str,
+        **kwargs: Any,
+    ) -> Any:
+        """
+        Edit existing image(s) based on a text prompt using Azure OpenAI.
+
+        Args:
+            image: The image(s) to edit. Can be:
+                   - Single image: file path (str) or file-like object
+                   - Multiple images (gpt-image-1 only): list of file paths or file-like objects
+                   For gpt-image-1: png, webp, or jpg < 50MB each, up to 16 images
+                   For dall-e-2: single square png < 4MB
+            prompt (str): A text description of the desired edits
+            **kwargs: Additional parameters supported by the API:
+                - model (str): Azure deployment name
+                - n (int): Number of images to generate
+                - size (str): Size of generated images
+                - response_format (str): "url" or "b64_json"
+                - stream (bool): Enable streaming responses
+                - Any other parameters supported by Azure OpenAI API
+
+        Returns:
+            Any: The image edit response from the API
+
+        Raises:
+            ValueError: If async_client is False
+        """
+        if not self.async_client:
+            raise ValueError(
+                "Image edit API requires async client. Please initialize with async_client=True"
+            )
+
+        client = cast(Any, self.client)
+
+        # Build the request parameters with all provided kwargs
+        params: Dict[str, Any] = {
+            "image": image,
+            "prompt": prompt,
+            **kwargs,  # Pass all additional parameters
+        }
+
+        return await client.images.edit(**params)
+
+    async def create_image_variation(
+        self,
+        image: Any,
+        model: Optional[str] = None,
+        n: int = 1,
+        size: Optional[str] = None,
+        response_format: Optional[str] = None,
+    ) -> Any:
+        """
+        Create a variation of a given image using Azure OpenAI.
+
+        Args:
+            image: The image to use as the basis for variation(s). Must be a valid PNG file,
+                   less than 4MB, and square. Can be a file path (str) or file-like object.
+            model (str, optional): The deployment name for the model
+            n (int, optional): Number of variations to generate (1-10). Defaults to 1
+            size (str, optional): Size of generated images: "256x256", "512x512", or "1024x1024"
+            response_format (str, optional): "url" or "b64_json"
+
+        Returns:
+            Any: The image variation response from the API
+
+        Raises:
+            ValueError: If async_client is False
+        """
+        if not self.async_client:
+            raise ValueError(
+                "Image variation API requires async client. Please initialize with async_client=True"
+            )
+
+        client = cast(Any, self.client)
+
+        # Build the request parameters
+        params: Dict[str, Any] = {
+            "image": image,
+            "n": n,
+        }
+
+        # Add optional parameters if provided
+        if model is not None:
+            params["model"] = model
+        if size is not None:
+            params["size"] = size
+        if response_format is not None:
+            params["response_format"] = response_format
+
+        return await client.images.create_variation(**params)
diff --git a/sygra/core/models/client/openai_client.py b/sygra/core/models/client/openai_client.py
index 7a704069..c6c7a1a8 100644
--- a/sygra/core/models/client/openai_client.py
+++ b/sygra/core/models/client/openai_client.py
@@ -220,3 +220,140 @@ async def create_speech(
             response_format=response_format,
             speed=speed,
         )
+
+    async def create_image(
+        self,
+        model: str,
+        prompt: str,
+        **kwargs: Any,
+    ) -> Any:
+        """
+        Generate images from text prompts using OpenAI's image generation API.
+
+        Args:
+            model (str): The image model to use (e.g., 'dall-e-2', 'dall-e-3', 'gpt-image-1')
+            prompt (str): The text description of the desired image(s)
+            **kwargs: Additional parameters supported by the API:
+                - size (str): Image size (e.g., "1024x1024", "1792x1024")
+                - quality (str): "standard" or "hd"
+                - n (int): Number of images to generate
+                - response_format (str): "url" or "b64_json"
+                - style (str): "vivid" or "natural"
+                - stream (bool): Enable streaming responses
+                - Any other parameters supported by OpenAI API
+
+        Returns:
+            Any: The image generation response from the API
+
+        Raises:
+            ValueError: If async_client is False (Image generation requires async client)
+        """
+        if not self.async_client:
+            raise ValueError(
+                "Image generation API requires async client. Please initialize with async_client=True"
+            )
+
+        client = cast(Any, self.client)
+
+        # Build the request parameters with all provided kwargs
+        params: Dict[str, Any] = {
+            "model": model,
+            "prompt": prompt,
+            **kwargs,  # Pass all additional parameters
+        }
+
+        return await client.images.generate(**params)
+
+    async def edit_image(
+        self,
+        image: Union[Any, List[Any]],
+        prompt: str,
+        **kwargs: Any,
+    ) -> Any:
+        """
+        Edit existing image(s) based on a text prompt.
+
+        Args:
+            image: The image(s) to edit. Can be:
+                   - Single image: file path (str) or file-like object
+                   - Multiple images (gpt-image-1 only): list of file paths or file-like objects
+                   For gpt-image-1: png, webp, or jpg < 50MB each, up to 16 images
+                   For dall-e-2: single square png < 4MB
+            prompt (str): A text description of the desired edits
+            **kwargs: Additional parameters supported by the API:
+                - model (str): Model to use (e.g., 'dall-e-2', 'gpt-image-1')
+                - n (int): Number of images to generate
+                - size (str): Size of generated images
+                - response_format (str): "url" or "b64_json"
+                - stream (bool): Enable streaming responses
+                - Any other parameters supported by OpenAI API
+
+        Returns:
+            Any: The image edit response from the API
+
+        Raises:
+            ValueError: If async_client is False
+        """
+        if not self.async_client:
+            raise ValueError(
+                "Image edit API requires async client. Please initialize with async_client=True"
+            )
+
+        client = cast(Any, self.client)
+
+        # Build the request parameters with all provided kwargs
+        params: Dict[str, Any] = {
+            "image": image,
+            "prompt": prompt,
+            **kwargs,  # Pass all additional parameters
+        }
+
+        return await client.images.edit(**params)
+
+    async def create_image_variation(
+        self,
+        image: Any,
+        model: Optional[str] = None,
+        n: int = 1,
+        size: Optional[str] = None,
+        response_format: Optional[str] = None,
+    ) -> Any:
+        """
+        Create a variation of a given image.
+
+        Args:
+            image: The image to use as the basis for variation(s). Must be a valid PNG file,
+                   less than 4MB, and square. Can be a file path (str) or file-like object.
+            model (str, optional): The model to use (e.g., 'dall-e-2')
+            n (int, optional): Number of variations to generate (1-10). Defaults to 1
+            size (str, optional): Size of generated images: "256x256", "512x512", or "1024x1024"
+            response_format (str, optional): "url" or "b64_json"
+
+        Returns:
+            Any: The image variation response from the API
+
+        Raises:
+            ValueError: If async_client is False
+        """
+        if not self.async_client:
+            raise ValueError(
+                "Image variation API requires async client. Please initialize with async_client=True"
+            )
+
+        client = cast(Any, self.client)
+
+        # Build the request parameters
+        params: Dict[str, Any] = {
+            "image": image,
+            "n": n,
+        }
+
+        # Add optional parameters if provided
+        if model is not None:
+            params["model"] = model
+        if size is not None:
+            params["size"] = size
+        if response_format is not None:
+            params["response_format"] = response_format
+
+        return await client.images.create_variation(**params)
diff --git a/sygra/core/models/custom_models.py b/sygra/core/models/custom_models.py
index 515f8188..72aff6ae 100644
--- a/sygra/core/models/custom_models.py
+++ b/sygra/core/models/custom_models.py
@@ -1061,9 +1061,12 @@ async def _generate_native_structured_output(
     async def _generate_response(
         self, input: ChatPromptValue, model_params: ModelParams
     ) -> Tuple[str, int]:
-        # Check if this is a TTS request based on output_type
-        if self.model_config.get("output_type") == "audio":
+        # Check the output type and route to appropriate method
+        output_type = self.model_config.get("output_type")
+        if output_type == "audio":
             return await self._generate_speech(input, model_params)
+        elif output_type == "image":
+            return await self._generate_image(input, model_params)
         else:
             return await self._generate_text(input, model_params)
 
@@ -1213,6 +1216,272 @@ async def _generate_speech(
 
         return resp_text, ret_code
 
+    async def _generate_image(
+        self, input: ChatPromptValue, model_params: ModelParams
+    ) -> Tuple[str, int]:
+        """
+        Generate or edit images using OpenAI/Azure OpenAI Image API.
+        Auto-detects whether to use generation or editing based on input content:
+        - If input contains images: uses edit_image() API (text+image-to-image)
+        - If input is text only: uses create_image() API (text-to-image)
+
+        Args:
+            input: ChatPromptValue containing text prompt and optionally images
+            model_params: Model parameters including URL and auth token
+
+        Returns:
+            Tuple of (response_text, status_code)
+            - On success: returns base64 encoded image(s) as JSON and 200
+            - On error: returns error message and error code
+        """
+        ret_code = 200
+        model_url = model_params.url
+
+        try:
+
+            # Extract text and images from messages
+            prompt_text = ""
+            image_data_urls = []
+
+            for message in input.messages:
+                if hasattr(message, "content"):
+                    if isinstance(message.content, str):
+                        content = message.content
+                        if content.startswith("data:image/"):
+                            image_data_urls.append(content)
+                        else:
+                            prompt_text += content + " "
+                    elif isinstance(message.content, list):
+                        for item in message.content:
+                            if isinstance(item, dict):
+                                if item.get("type") == "text":
+                                    prompt_text += item.get("text", "") + " "
+                                elif item.get("type") == "image_url":
+                                    url = item.get("image_url", "")
+                                    if isinstance(url, dict):
+                                        url = url.get("url", "")
+                                    if url.startswith("data:image/"):
+                                        image_data_urls.append(url)
+
+            prompt_text = prompt_text.strip()
+            if not prompt_text:
+                logger.error(f"[{self.name()}] No prompt provided for image generation")
+                return f"{constants.ERROR_PREFIX} No prompt provided for image generation", 400
+
+            if len(prompt_text) < 1000:
+                pass
+            elif self.model_config.get("model") == "dall-e-2" and len(prompt_text) > 1000:
+                logger.warn(
+                    f"[{self.name()}] Prompt exceeds 1000 character limit: {len(prompt_text)} characters"
+                )
+            elif self.model_config.get("model") == "dall-e-3" and len(prompt_text) > 4000:
+                logger.warn(
+                    f"[{self.name()}] Prompt exceeds 4000 character limit: {len(prompt_text)} characters"
+                )
+            elif self.model_config.get("model") == "gpt-image-1" and len(prompt_text) > 32000:
+                logger.warn(
+                    f"[Model {self.name()}] Prompt exceeds 32000 character limit: {len(prompt_text)} characters"
+                )
+
+            has_images = len(image_data_urls) > 0
+
+            if has_images:
+                # Image editing
+                logger.debug(
+                    f"[{self.name()}] Detected {len(image_data_urls)} image(s) in input, using image edit API"
+                )
+                return await self._edit_image_with_data_urls(
+                    image_data_urls, prompt_text, model_url, model_params
+                )
+            else:
+                # Text-to-image generation
+                logger.debug(
+                    f"[{self.name()}] No input images detected, using text-to-image generation API"
+                )
+                return await self._generate_image_from_text(prompt_text, model_url, model_params)
+
+        except ValueError as e:
+            logger.error(f"[{self.name()}] Invalid image data URL: {e}")
+            resp_text = f"{constants.ERROR_PREFIX} Invalid image data: {e}"
+            ret_code = 400
+        except openai.RateLimitError as e:
+            logger.warning(f"[{self.name()}] OpenAI Image API rate limit: {e}")
+            resp_text = f"{constants.ERROR_PREFIX} Rate limit exceeded: {e}"
+            ret_code = 429
+        except openai.BadRequestError as e:
+            logger.error(f"[{self.name()}] OpenAI Image API bad request: {e}")
+            resp_text = f"{constants.ERROR_PREFIX} Bad request: {e}"
+            ret_code = 400
+        except openai.APIError as e:
+            logger.error(f"[{self.name()}] OpenAI Image API error: {e}")
+            resp_text = f"{constants.ERROR_PREFIX} API error: {e}"
+            ret_code = getattr(e, "status_code", 500)
+        except Exception as x:
+            resp_text = f"{constants.ERROR_PREFIX} Image operation failed: {x}"
+            logger.error(f"[{self.name()}] {resp_text}")
+            rcode = self._get_status_from_body(x)
+            ret_code = rcode if rcode else 999
+
+        return resp_text, ret_code
+
+    async def _generate_image_from_text(
+        self, prompt_text: str, model_url: str, model_params: ModelParams
+    ) -> Tuple[str, int]:
+        """
+        Generate images from text prompts (text-to-image).
+
+        Args:
+            prompt_text: Text prompt for image generation
+            model_url: Model URL
+            model_params: Model parameters
+
+        Returns:
+            Tuple of (response_text, status_code)
+        """
+        self._set_client(model_url, model_params.auth_token)
+
+        params = self.generation_params
+
+        # Check if streaming is enabled
+        is_streaming = params.get("stream", False)
+
+        logger.debug(
+            f"[{self.name()}] Image generation parameters - {params}, streaming: {is_streaming}"
+        )
+
+        openai_client = cast(OpenAIClient, self._client)
+        image_response = await openai_client.create_image(
+            model=str(self.model_config.get("model")), prompt=prompt_text, **params
+        )
+
+        if is_streaming:
+            images_data = await self._process_streaming_image_response(image_response)
+        else:
+            images_data = await self._process_image_response(image_response)
+
+        if len(images_data) == 1:
+            return images_data[0], 200
+        else:
+            return json.dumps(images_data), 200
+
+    async def _process_streaming_image_response(self, stream_response):
+        """
+        Process streaming image generation response.
+        Delegates to image_utils for processing.
+        """
+        from sygra.utils.image_utils import process_streaming_image_response
+
+        return await process_streaming_image_response(stream_response, self.name())
+
+    async def _process_image_response(self, image_response):
+        """
+        Process regular (non-streaming) image response.
+        Delegates to image_utils for processing.
+        """
+        from sygra.utils.image_utils import process_image_response
+
+        return await process_image_response(image_response, self.name())
+
+    async def _url_to_data_url(self, url: str) -> str:
+        """
+        Fetch an image from URL and convert to base64 data URL.
+        Delegates to image_utils for processing.
+        """
+        from sygra.utils.image_utils import url_to_data_url
+
+        return await url_to_data_url(url, self.name())
+
+    async def _edit_image_with_data_urls(
+        self, image_data_urls: list, prompt_text: str, model_url: str, model_params: ModelParams
+    ) -> Tuple[str, int]:
+        """
+        Edit images using data URLs.
+        - GPT-Image-1: Supports up to 16 images
+        - DALL-E-2: Supports only 1 image
+
+        Args:
+            image_data_urls: List of image data URLs
+            prompt_text: Edit instruction
+            model_url: Model URL
+            model_params: Model parameters
+
+        Returns:
+            Tuple of (response_text, status_code)
+        """
+        import io
+
+        from sygra.utils.image_utils import parse_image_data_url
+
+        if not prompt_text:
+            logger.error(f"[{self.name()}] No prompt provided for image editing")
+            return f"{constants.ERROR_PREFIX} No prompt provided for image editing", 400
+
+        # Set up the OpenAI client
+        self._set_client(model_url, model_params.auth_token)
+
+        model_name = str(self.model_config.get("model", "")).lower()
+        # only gpt-image-1 supports multiple images for editing
+        supports_multiple_images = "gpt-image-1" == model_name
+
+        num_images = len(image_data_urls)
+        if not supports_multiple_images and num_images > 1:
+            logger.warning(
+                f"[{self.name()}] Model {model_name} only supports single image editing. "
+                f"Using first image only. Additional {num_images - 1} image(s) will be ignored."
+            )
+        elif supports_multiple_images and num_images > 16:
+            logger.warning(
+                f"[{self.name()}] Model {model_name} supports max 16 images. "
+                f"Using first 16 images only. {num_images - 16} image(s) will be ignored."
+            )
+            image_data_urls = image_data_urls[:16]
+
+        params = self.generation_params
+
+        # Check if streaming is enabled
+        is_streaming = params.get("stream", False)
+
+        logger.debug(
+            f"[{self.name()}] Image edit parameters - images: {num_images}, params: {params}, streaming: {is_streaming}"
+        )
+
+        # Decode images
+        if supports_multiple_images and num_images > 1:
+            # Multiple images for GPT-Image-1
+            image_files = []
+            for idx, data_url in enumerate(image_data_urls):
+                _, _, img_bytes = parse_image_data_url(data_url)
+                img_file = io.BytesIO(img_bytes)
+                img_file.name = f"image_{idx}.png"
+                image_files.append(img_file)
+
+            image_param = image_files
+        else:
+            # Single image for DALL-E-2 or single image input
+            _, _, image_bytes = parse_image_data_url(image_data_urls[0])
+            image_file = io.BytesIO(image_bytes)
+            image_file.name = "image.png"
+
+            image_param = [image_file]
+
+        # Call the image edit API
+        openai_client = cast(OpenAIClient, self._client)
+        image_response = await openai_client.edit_image(
+            image=image_param, prompt=prompt_text, **params
+        )
+
+        # Handle streaming response
+        if is_streaming:
+            images_data = await self._process_streaming_image_response(image_response)
+        else:
+            # Process regular response - convert URLs to data URLs
+            images_data = await self._process_image_response(image_response)
+
+        if len(images_data) == 1:
+            return images_data[0], 200
+        else:
+            return json.dumps(images_data), 200
+
 
 class CustomOllama(BaseCustomModel):
     def __init__(self, model_config: dict[str, Any]) -> None:
diff --git a/sygra/utils/image_utils.py b/sygra/utils/image_utils.py
index 8c2af793..680e32f8 100644
--- a/sygra/utils/image_utils.py
+++ b/sygra/utils/image_utils.py
@@ -5,6 +5,7 @@
 from pathlib import Path
 from typing import Any, Optional, Tuple
 
+import httpx
 import requests  # type: ignore[import-untyped]
 from PIL import Image
 
@@ -276,3 +277,97 @@ def save_image_data_url(
     except Exception as e:
         logger.error(f"Failed to save image data: {e}")
         raise
+
+
+async def url_to_data_url(url: str, model_name: str = "image_model") -> str:
+    """
+    Fetch an image from URL and convert to base64 data URL.
+
+    Args:
+        url (str): The image URL to fetch
+        model_name (str): Name of the model (for logging)
+
+    Returns:
+        str: Base64-encoded data URL
+    """
+    try:
+        async with httpx.AsyncClient() as client:
+            response = await client.get(url)
+            response.raise_for_status()
+            image_bytes = response.content
+
+            # Convert to base64
+            b64_encoded = base64.b64encode(image_bytes).decode("utf-8")
+
+            # Determine format from content-type or default to png
+            content_type = response.headers.get("content-type", "image/png")
+            if "image/" in content_type:
+                image_format = content_type.split("/")[-1]
+            else:
+                image_format = "png"
+
+            return f"data:image/{image_format};base64,{b64_encoded}"
+    except Exception as e:
+        logger.error(f"[{model_name}] Failed to fetch image from URL {url}: {e}")
+        # Return original URL as fallback
+        return url
+
+
+async def process_image_response(image_response: Any, model_name: str = "image_model") -> list[str]:
+    """
+    Process regular (non-streaming) image response.
+    Converts all URLs to data URLs for consistency.
+
+    Args:
+        image_response: The response from OpenAI images API
+        model_name (str): Name of the model (for logging)
+
+    Returns:
+        list[str]: List of base64-encoded image data URLs
+    """
+    images_data = []
+    for img_data in image_response.data:
+        # Try to get b64_json first
+        if hasattr(img_data, "b64_json") and img_data.b64_json:
+            b64_json = img_data.b64_json
+            # Create base64 encoded data URL
+            data_url = f"data:image/png;base64,{b64_json}"
+            images_data.append(data_url)
+        # Otherwise get URL and convert to data URL
+        elif hasattr(img_data, "url") and img_data.url:
+            data_url = await url_to_data_url(img_data.url, model_name)
+            images_data.append(data_url)
+        else:
+            logger.error(f"[{model_name}] Image data has neither b64_json nor url")
+
+    return images_data
+
+
+async def process_streaming_image_response(
+    stream_response: Any, model_name: str = "image_model"
+) -> list[str]:
+    """
+    Process streaming image generation response.
+    Collects all events and converts URLs to data URLs.
+
+    Args:
+        stream_response: The streaming response from OpenAI images API
+        model_name (str): Name of the model (for logging)
+
+    Returns:
+        list[str]: List of base64-encoded image data URLs
+    """
+    images_data = []
+    async for event in stream_response:
+        if hasattr(event, "data"):
+            for img_data in event.data:
+                # Convert to data URL
+                if hasattr(img_data, "b64_json") and img_data.b64_json:
+                    data_url = f"data:image/png;base64,{img_data.b64_json}"
+                    images_data.append(data_url)
+                elif hasattr(img_data, "url") and img_data.url:
+                    # Fetch URL and convert to data URL
+                    data_url = await url_to_data_url(img_data.url, model_name)
+                    images_data.append(data_url)
+
+    return images_data
diff --git a/sygra/utils/multimodal_processor.py b/sygra/utils/multimodal_processor.py
index 3d50e5ce..6ddfe002 100644
--- a/sygra/utils/multimodal_processor.py
+++ b/sygra/utils/multimodal_processor.py
@@ -89,6 +89,20 @@ def process_value(value: Any, field_name: str, index: int = 0) -> Any:
         elif isinstance(value, list):
             return [process_value(item, field_name, idx) for idx, item in enumerate(value)]
 
+        elif isinstance(value, str) and value.startswith("["):
+            # Try to parse JSON arrays (for n>1 image generation)
+            try:
+                import json
+
+                parsed = json.loads(value)
+                if isinstance(parsed, list):
+                    # Recursively process the parsed list
+                    return process_value(parsed, field_name, index)
+            except (json.JSONDecodeError, ValueError):
+                # Not valid JSON, return as-is
+                pass
+            return value
+
         else:
             # Return value as-is if it's not a data URL, dict, or list
             return value
@@ -106,6 +120,10 @@ def process_batch_multimodal_data(
 ) -> list[Dict[str, Any]]:
     """
     Process a batch of records and save all multimodal data to files.
+    Uses lazy directory creation - directories are created on-demand when saving files.
+
+    This eliminates the need for a pre-check scan, making processing ~50% faster
+    while still preventing empty directories from being created.
 
     Args:
         records: List of records to process
@@ -117,9 +135,6 @@ def process_batch_multimodal_data(
     if not records:
         return records
 
-    # Create multimodal output directory
-    output_dir.mkdir(parents=True, exist_ok=True)
-
     processed_records: List[Dict[str, Any]] = []
     for record in records:
         # Use record ID if available, otherwise use index
@@ -128,5 +143,9 @@ def process_batch_multimodal_data(
         processed_record = process_record_multimodal_data(record, output_dir, record_id)
         processed_records.append(processed_record)
 
-    logger.info(f"Processed {len(records)} records, saved multimodal files to {output_dir}")
+    if output_dir.exists():
+        logger.info(f"Processed {len(records)} records, saved multimodal files to {output_dir}")
+    else:
+        logger.debug(f"Processed {len(records)} records, no multimodal data found")
+
     return processed_records
diff --git a/tests/core/models/client/test_openai_azure_client.py b/tests/core/models/client/test_openai_azure_client.py
index 6e181831..01852a29 100644
--- a/tests/core/models/client/test_openai_azure_client.py
+++ b/tests/core/models/client/test_openai_azure_client.py
@@ -1,8 +1,9 @@
+import asyncio
 import json
 import sys
 import unittest
 from pathlib import Path
-from unittest.mock import MagicMock, patch
+from unittest.mock import AsyncMock, MagicMock, patch
 
 # Add the parent directory to sys.path to import the necessary modules
 sys.path.append(str(Path(__file__).parent.parent.parent.parent.parent))
@@ -476,6 +477,185 @@ def test_json_decode_error(self, mock_azure_openai):
             response_format=SampleStructuredOutput,
         )
 
+    # ===== Image API Tests =====
+
+    @patch("sygra.core.models.client.openai_azure_client.AsyncAzureOpenAI")
+    def test_create_image_async(self, mock_async_azure_openai):
+        """Test create_image with async client"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_azure_openai.return_value = MagicMock()
+        mock_async_azure_openai.return_value.images.generate = AsyncMock(
+            return_value=mock_image_response
+        )
+
+        # Create client
+        client = OpenAIAzureClient(async_client=True, **self.valid_config)
+
+        # Call create_image (await since it's async)
+        result = asyncio.run(
+            client.create_image(
+                model="dall-e-3",
+                prompt="A serene mountain landscape",
+                size="1024x1024",
+                quality="hd",
+                n=1,
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_azure_openai.return_value.images.generate.assert_called_once_with(
+            model="dall-e-3",
+            prompt="A serene mountain landscape",
+            size="1024x1024",
+            quality="hd",
+            n=1,
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_azure_client.AzureOpenAI")
+    def test_create_image_sync_raises_error(self, mock_azure_openai):
+        """Test that create_image raises ValueError with sync client"""
+        mock_azure_openai.return_value = MagicMock()
+
+        # Create sync client
+        client = OpenAIAzureClient(async_client=False, **self.valid_config)
+
+        # Verify ValueError is raised for sync client (need to await to get the error)
+        with self.assertRaises(ValueError) as context:
+            asyncio.run(client.create_image(model="dall-e-3", prompt="A serene mountain landscape"))
+
+        self.assertIn("requires async client", str(context.exception))
+
+    @patch("sygra.core.models.client.openai_azure_client.AsyncAzureOpenAI")
+    def test_edit_image_async_single(self, mock_async_azure_openai):
+        """Test edit_image with async client and single image"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_azure_openai.return_value = MagicMock()
+        mock_async_azure_openai.return_value.images.edit = AsyncMock(
+            return_value=mock_image_response
+        )
+
+        # Create client
+        client = OpenAIAzureClient(async_client=True, **self.valid_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Call edit_image with single image (await since it's async)
+        result = asyncio.run(
+            client.edit_image(
+                image=mock_image_file,
+                prompt="Remove the background",
+                model="dall-e-2",
+                n=1,
+                size="1024x1024",
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_azure_openai.return_value.images.edit.assert_called_once_with(
+            image=mock_image_file,
+            prompt="Remove the background",
+            model="dall-e-2",
+            n=1,
+            size="1024x1024",
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_azure_client.AsyncAzureOpenAI")
+    def test_edit_image_async_multiple(self, mock_async_azure_openai):
+        """Test edit_image with async client and multiple images (GPT-Image-1)"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_azure_openai.return_value = MagicMock()
+        mock_async_azure_openai.return_value.images.edit = AsyncMock(
+            return_value=mock_image_response
+        )
+
+        # Create client
+        client = OpenAIAzureClient(async_client=True, **self.valid_config)
+
+        # Mock image files (list for multi-image)
+        mock_image_files = [MagicMock(), MagicMock(), MagicMock()]
+
+        # Call edit_image with multiple images (await since it's async)
+        result = asyncio.run(
+            client.edit_image(
+                image=mock_image_files, prompt="Combine into a collage", model="gpt-image-1", n=2
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_azure_openai.return_value.images.edit.assert_called_once_with(
+            image=mock_image_files, prompt="Combine into a collage", model="gpt-image-1", n=2
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_azure_client.AzureOpenAI")
+    def test_edit_image_sync_raises_error(self, mock_azure_openai):
+        """Test that edit_image raises ValueError with sync client"""
+        mock_azure_openai.return_value = MagicMock()
+
+        # Create sync client
+        client = OpenAIAzureClient(async_client=False, **self.valid_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Verify ValueError is raised for sync client (need to await to get the error)
+        with self.assertRaises(ValueError) as context:
+            asyncio.run(client.edit_image(image=mock_image_file, prompt="Remove the background"))
+
+        self.assertIn("requires async client", str(context.exception))
+
+    @patch("sygra.core.models.client.openai_azure_client.AsyncAzureOpenAI")
+    def test_create_image_variation_async(self, mock_async_azure_openai):
+        """Test create_image_variation with async client"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_azure_openai.return_value = MagicMock()
+        mock_async_azure_openai.return_value.images.create_variation = AsyncMock(
+            return_value=mock_image_response
+        )
+
+        # Create client
+        client = OpenAIAzureClient(async_client=True, **self.valid_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Call create_image_variation (await since it's async)
+        result = asyncio.run(
+            client.create_image_variation(
+                image=mock_image_file, model="dall-e-2", n=3, size="512x512"
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_azure_openai.return_value.images.create_variation.assert_called_once_with(
+            image=mock_image_file, model="dall-e-2", n=3, size="512x512"
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_azure_client.AzureOpenAI")
+    def test_create_image_variation_sync_raises_error(self, mock_azure_openai):
+        """Test that create_image_variation raises ValueError with sync client"""
+        mock_azure_openai.return_value = MagicMock()
+
+        # Create sync client
+        client = OpenAIAzureClient(async_client=False, **self.valid_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Verify ValueError is raised for sync client (need to await to get the error)
+        with self.assertRaises(ValueError) as context:
+            asyncio.run(client.create_image_variation(image=mock_image_file))
+
+        self.assertIn("requires async client", str(context.exception))
+
 
 if __name__ == "__main__":
     unittest.main()
diff --git a/tests/core/models/client/test_openai_client.py b/tests/core/models/client/test_openai_client.py
index 509d5795..f18c790c 100644
--- a/tests/core/models/client/test_openai_client.py
+++ b/tests/core/models/client/test_openai_client.py
@@ -2,7 +2,7 @@
 import sys
 import unittest
 from pathlib import Path
-from unittest.mock import MagicMock, patch
+from unittest.mock import AsyncMock, MagicMock, patch
 
 import httpx
 
@@ -487,6 +487,179 @@ def test_send_request_with_vllm_extensions_async(self, mock_async_openai):
             extra_body={"guided_json": guided_json},
         )
 
+    # ===== Image API Tests =====
+
+    @patch("sygra.core.models.client.openai_client.AsyncOpenAI")
+    def test_create_image_async(self, mock_async_openai):
+        """Test create_image with async client"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_openai.return_value = MagicMock()
+        mock_async_openai.return_value.images.generate = AsyncMock(return_value=mock_image_response)
+
+        # Create client
+        client = OpenAIClient(async_client=True, **self.async_config)
+
+        # Call create_image (await since it's async)
+        result = asyncio.run(
+            client.create_image(
+                model="dall-e-3",
+                prompt="A serene mountain landscape",
+                size="1024x1024",
+                quality="hd",
+                n=1,
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_openai.return_value.images.generate.assert_called_once_with(
+            model="dall-e-3",
+            prompt="A serene mountain landscape",
+            size="1024x1024",
+            quality="hd",
+            n=1,
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_client.OpenAI")
+    def test_create_image_sync_raises_error(self, mock_openai):
+        """Test that create_image raises ValueError with sync client"""
+        mock_openai.return_value = MagicMock()
+
+        # Create sync client
+        client = OpenAIClient(async_client=False, **self.sync_config)
+
+        # Verify ValueError is raised for sync client (need to await to get the error)
+        with self.assertRaises(ValueError) as context:
+            asyncio.run(client.create_image(model="dall-e-3", prompt="A serene mountain landscape"))
+
+        self.assertIn("requires async client", str(context.exception))
+
+    @patch("sygra.core.models.client.openai_client.AsyncOpenAI")
+    def test_edit_image_async_single(self, mock_async_openai):
+        """Test edit_image with async client and single image"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_openai.return_value = MagicMock()
+        mock_async_openai.return_value.images.edit = AsyncMock(return_value=mock_image_response)
+
+        # Create client
+        client = OpenAIClient(async_client=True, **self.async_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Call edit_image with single image (await since it's async)
+        result = asyncio.run(
+            client.edit_image(
+                image=mock_image_file,
+                prompt="Remove the background",
+                model="dall-e-2",
+                n=1,
+                size="1024x1024",
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_openai.return_value.images.edit.assert_called_once_with(
+            image=mock_image_file,
+            prompt="Remove the background",
+            model="dall-e-2",
+            n=1,
+            size="1024x1024",
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_client.AsyncOpenAI")
+    def test_edit_image_async_multiple(self, mock_async_openai):
+        """Test edit_image with async client and multiple images (GPT-Image-1)"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_openai.return_value = MagicMock()
+        mock_async_openai.return_value.images.edit = AsyncMock(return_value=mock_image_response)
+
+        # Create client
+        client = OpenAIClient(async_client=True, **self.async_config)
+
+        # Mock image files (list for multi-image)
+        mock_image_files = [MagicMock(), MagicMock(), MagicMock()]
+
+        # Call edit_image with multiple images (await since it's async)
+        result = asyncio.run(
+            client.edit_image(
+                image=mock_image_files, prompt="Combine into a collage", model="gpt-image-1", n=2
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_openai.return_value.images.edit.assert_called_once_with(
+            image=mock_image_files, prompt="Combine into a collage", model="gpt-image-1", n=2
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_client.OpenAI")
+    def test_edit_image_sync_raises_error(self, mock_openai):
+        """Test that edit_image raises ValueError with sync client"""
+        mock_openai.return_value = MagicMock()
+
+        # Create sync client
+        client = OpenAIClient(async_client=False, **self.sync_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Verify ValueError is raised for sync client (need to await to get the error)
+        with self.assertRaises(ValueError) as context:
+            asyncio.run(client.edit_image(image=mock_image_file, prompt="Remove the background"))
+
+        self.assertIn("requires async client", str(context.exception))
+
+    @patch("sygra.core.models.client.openai_client.AsyncOpenAI")
+    def test_create_image_variation_async(self, mock_async_openai):
+        """Test create_image_variation with async client"""
+        # Setup mock
+        mock_image_response = MagicMock()
+        mock_async_openai.return_value = MagicMock()
+        mock_async_openai.return_value.images.create_variation = AsyncMock(
+            return_value=mock_image_response
+        )
+
+        # Create client
+        client = OpenAIClient(async_client=True, **self.async_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Call create_image_variation (await since it's async)
+        result = asyncio.run(
+            client.create_image_variation(
+                image=mock_image_file, model="dall-e-2", n=3, size="512x512"
+            )
+        )
+
+        # Verify the call was made with correct parameters
+        mock_async_openai.return_value.images.create_variation.assert_called_once_with(
+            image=mock_image_file, model="dall-e-2", n=3, size="512x512"
+        )
+        self.assertEqual(result, mock_image_response)
+
+    @patch("sygra.core.models.client.openai_client.OpenAI")
+    def test_create_image_variation_sync_raises_error(self, mock_openai):
+        """Test that create_image_variation raises ValueError with sync client"""
+        mock_openai.return_value = MagicMock()
+
+        # Create sync client
+        client = OpenAIClient(async_client=False, **self.sync_config)
+
+        # Mock image file
+        mock_image_file = MagicMock()
+
+        # Verify ValueError is raised for sync client (need to await to get the error)
+        with self.assertRaises(ValueError) as context:
+            asyncio.run(client.create_image_variation(image=mock_image_file))
+
+        self.assertIn("requires async client", str(context.exception))
+
 
 if __name__ == "__main__":
     unittest.main()
diff --git a/tests/core/models/test_custom_openai.py b/tests/core/models/test_custom_openai.py
index 17ba8fd6..03900e4e 100644
--- a/tests/core/models/test_custom_openai.py
+++ b/tests/core/models/test_custom_openai.py
@@ -39,13 +39,14 @@ def setUp(self):
             "model": "tts-1",
             "model_type": "openai",
             "output_type": "audio",
-            "parameters": {},
             "url": "https://api.openai.com/v1",
             "auth_token": "Bearer sk-test_key_123",
             "api_version": "2023-05-15",
-            "voice": "alloy",
-            "response_format": "mp3",
-            "speed": 1.0,
+            "parameters": {
+                "voice": "alloy",
+                "response_format": "mp3",
+                "speed": 1.0,
+            },
         }
 
         # Configuration with completions API
@@ -66,6 +67,21 @@ def setUp(self):
         self.tts_messages = [HumanMessage(content="Hello, this is a test of text to speech.")]
         self.tts_input = ChatPromptValue(messages=self.tts_messages)
 
+        # Configuration for Image Generation
+        self.image_config = {
+            "name": "dalle3_model",
+            "model": "dall-e-3",
+            "output_type": "image",
+            "url": "https://api.openai.com/v1",
+            "auth_token": "Bearer sk-test_key_123",
+            "api_version": "2023-05-15",
+            "parameters": {"size": "1024x1024", "quality": "standard", "style": "vivid"},
+        }
+
+        # Mock messages for Image Generation
+        self.image_messages = [HumanMessage(content="A serene mountain landscape at sunset")]
+        self.image_input = ChatPromptValue(messages=self.image_messages)
+
     def test_init(self):
         """Test initialization of CustomOpenAI"""
         custom_openai = CustomOpenAI(self.text_config)
@@ -317,7 +333,7 @@ async def _run_generate_speech_speed_clamping(self, mock_set_client):
         mock_client.create_speech = AsyncMock(return_value=mock_response)
 
         # Test speed too low
-        config_low = {**self.tts_config, "speed": 0.1}
+        config_low = {**self.tts_config, "parameters": {"speed": 0.1}}
         custom_openai_low = CustomOpenAI(config_low)
         custom_openai_low._client = mock_client
 
@@ -329,7 +345,7 @@ async def _run_generate_speech_speed_clamping(self, mock_set_client):
         self.assertEqual(call_args.kwargs["speed"], 0.25)
 
         # Test speed too high
-        config_high = {**self.tts_config, "speed": 5.0}
+        config_high = {**self.tts_config, "parameters": {"speed": 5.0}}
         custom_openai_high = CustomOpenAI(config_high)
         custom_openai_high._client = mock_client
 
@@ -428,10 +444,670 @@ async def _run_generate_response_routes_to_speech(self, mock_set_client):
         mock_client.create_speech.assert_awaited_once()
         self.assertEqual(resp_status, 200)
 
+    # ===================== Image Generation Tests =====================
+
+    async def _run_generate_image_success_single(self, mock_set_client):
+        """Test _generate_image successfully generates a single image"""
+        import base64
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data = b"fake_image_data_png"
+        mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+        # Mock the response structure
+        mock_img = MagicMock()
+        mock_img.b64_json = mock_b64
+        mock_response = MagicMock()
+        mock_response.data = [mock_img]
+
+        mock_client.create_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model
+        custom_openai = CustomOpenAI(self.image_config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(self.image_input, model_params)
+
+        # Verify results
+        self.assertIn("data:image/png;base64,", resp_text)
+        self.assertIn(mock_b64, resp_text)
+        self.assertEqual(resp_status, 200)
+
+        # Verify method calls
+        mock_set_client.assert_called_once()
+        mock_client.create_image.assert_awaited_once()
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_success_single(self, mock_set_client):
+        asyncio.run(self._run_generate_image_success_single(mock_set_client))
+
+    async def _run_generate_image_success_multiple(self, mock_set_client):
+        """Test _generate_image successfully generates multiple images (DALL-E-2)"""
+        import base64
+        import json
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data1 = b"fake_image_data_1"
+        mock_image_data2 = b"fake_image_data_2"
+        mock_b64_1 = base64.b64encode(mock_image_data1).decode("utf-8")
+        mock_b64_2 = base64.b64encode(mock_image_data2).decode("utf-8")
+
+        # Mock the response structure with multiple images
+        mock_img1 = MagicMock()
+        mock_img1.b64_json = mock_b64_1
+        mock_img2 = MagicMock()
+        mock_img2.b64_json = mock_b64_2
+        mock_response = MagicMock()
+        mock_response.data = [mock_img1, mock_img2]
+
+        mock_client.create_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model with n=2
+        config = {**self.image_config, "n": 2, "model": "dall-e-2"}
+        custom_openai = CustomOpenAI(config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(self.image_input, model_params)
+
+        # Verify results
+        result = json.loads(resp_text)
+        self.assertIsInstance(result, list)
+        self.assertEqual(len(result), 2)
+        self.assertIn("data:image/png;base64,", result[0])
+        self.assertIn("data:image/png;base64,", result[1])
+        self.assertEqual(resp_status, 200)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_success_multiple(self, mock_set_client):
+        asyncio.run(self._run_generate_image_success_multiple(mock_set_client))
+
+    async def _run_generate_image_with_different_sizes(self, mock_set_client):
+        """Test _generate_image with different size parameters"""
+        import base64
+
+        sizes = ["256x256", "512x512", "1024x1024", "1792x1024", "1024x1792"]
+
+        for size in sizes:
+            # Setup mock client
+            mock_client = MagicMock()
+            mock_image_data = b"fake_image_data"
+            mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+            mock_img = MagicMock()
+            mock_img.b64_json = mock_b64
+            mock_response = MagicMock()
+            mock_response.data = [mock_img]
+
+            mock_client.create_image = AsyncMock(return_value=mock_response)
+
+            # Setup custom model with specific size
+            config = {**self.image_config, "parameters": {"size": size}}
+            custom_openai = CustomOpenAI(config)
+            custom_openai._client = mock_client
+
+            # Call _generate_image
+            model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+            resp_text, resp_status = await custom_openai._generate_image(
+                self.image_input, model_params
+            )
+
+            # Verify results
+            self.assertIn("data:image/png;base64,", resp_text)
+            self.assertEqual(resp_status, 200)
+
+            # Verify create_image was called and size was passed via kwargs
+            mock_client.create_image.assert_called_once()
+            # Size is in generation_params which gets passed via **params
+            call_kwargs = mock_client.create_image.call_args.kwargs
+            self.assertIn("size", call_kwargs)
+            self.assertEqual(call_kwargs["size"], size)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_with_different_sizes(self, mock_set_client):
+        asyncio.run(self._run_generate_image_with_different_sizes(mock_set_client))
+
+    async def _run_generate_image_with_quality_hd(self, mock_set_client):
+        """Test _generate_image with HD quality"""
+        import base64
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data = b"fake_hd_image_data"
+        mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+        mock_img = MagicMock()
+        mock_img.b64_json = mock_b64
+        mock_response = MagicMock()
+        mock_response.data = [mock_img]
+
+        mock_client.create_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model with HD quality
+        config = {**self.image_config, "parameters": {"quality": "hd"}}
+        custom_openai = CustomOpenAI(config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(self.image_input, model_params)
+
+        # Verify results
+        self.assertEqual(resp_status, 200)
+
+        # Verify create_image was called with HD quality
+        mock_client.create_image.assert_called_once()
+        call_kwargs = mock_client.create_image.call_args.kwargs
+        self.assertIn("quality", call_kwargs)
+        self.assertEqual(call_kwargs["quality"], "hd")
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_with_quality_hd(self, mock_set_client):
+        asyncio.run(self._run_generate_image_with_quality_hd(mock_set_client))
+
+    async def _run_generate_image_with_different_styles(self, mock_set_client):
+        """Test _generate_image with different style parameters"""
+        import base64
+
+        styles = ["vivid", "natural"]
+
+        for style in styles:
+            # Setup mock client
+            mock_client = MagicMock()
+            mock_image_data = b"fake_image_data"
+            mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+            mock_img = MagicMock()
+            mock_img.b64_json = mock_b64
+            mock_response = MagicMock()
+            mock_response.data = [mock_img]
+
+            mock_client.create_image = AsyncMock(return_value=mock_response)
+
+            # Setup custom model with specific style
+            config = {**self.image_config, "parameters": {"style": style}}
+            custom_openai = CustomOpenAI(config)
+            custom_openai._client = mock_client
+
+            # Call _generate_image
+            model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+            resp_text, resp_status = await custom_openai._generate_image(
+                self.image_input, model_params
+            )
+
+            # Verify results
+            self.assertEqual(resp_status, 200)
+
+            # Verify create_image was called with the correct style
+            mock_client.create_image.assert_called_once()
+            call_kwargs = mock_client.create_image.call_args.kwargs
+            self.assertIn("style", call_kwargs)
+            self.assertEqual(call_kwargs["style"], style)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_with_different_styles(self, mock_set_client):
+        asyncio.run(self._run_generate_image_with_different_styles(mock_set_client))
+
+    async def _run_generate_image_empty_prompt(self, mock_set_client):
+        """Test _generate_image with empty prompt returns error"""
+        # Setup mock client
+        mock_client = MagicMock()
+
+        # Setup custom model
+        custom_openai = CustomOpenAI(self.image_config)
+        custom_openai._client = mock_client
+
+        # Create empty prompt
+        empty_messages = [HumanMessage(content="")]
+        empty_input = ChatPromptValue(messages=empty_messages)
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(empty_input, model_params)
+
+        # Verify error response
+        self.assertIn("No prompt provided", resp_text)
+        self.assertEqual(resp_status, 400)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_empty_prompt(self, mock_set_client):
+        asyncio.run(self._run_generate_image_empty_prompt(mock_set_client))
+
+    async def _run_generate_image_rate_limit_error(self, mock_set_client):
+        """Test _generate_image handles rate limit errors"""
+        # Setup mock client
+        mock_client = MagicMock()
+        rate_limit_error = openai.RateLimitError(
+            "Rate limit exceeded",
+            response=MagicMock(status_code=429),
+            body=None,
+        )
+        mock_client.create_image = AsyncMock(side_effect=rate_limit_error)
+
+        # Setup custom model
+        custom_openai = CustomOpenAI(self.image_config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(self.image_input, model_params)
+
+        # Verify error handling
+        self.assertIn("Rate limit exceeded", resp_text)
+        self.assertEqual(resp_status, 429)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_rate_limit_error(self, mock_set_client):
+        asyncio.run(self._run_generate_image_rate_limit_error(mock_set_client))
+
+    async def _run_generate_image_bad_request_error(self, mock_set_client):
+        """Test _generate_image handles bad request errors"""
+        # Setup mock client
+        mock_client = MagicMock()
+        bad_request_error = openai.BadRequestError(
+            "Invalid size parameter",
+            response=MagicMock(status_code=400),
+            body=None,
+        )
+        mock_client.create_image = AsyncMock(side_effect=bad_request_error)
+
+        # Setup custom model
+        custom_openai = CustomOpenAI(self.image_config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(self.image_input, model_params)
+
+        # Verify error handling
+        self.assertIn("Bad request", resp_text)
+        self.assertEqual(resp_status, 400)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_bad_request_error(self, mock_set_client):
+        asyncio.run(self._run_generate_image_bad_request_error(mock_set_client))
+
+    async def _run_generate_image_api_error(self, mock_set_client):
+        """Test _generate_image handles API errors"""
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_request = MagicMock()
+        mock_request.status_code = 500
+        api_error = openai.APIError(
+            "Internal server error",
+            request=mock_request,
+            body={"error": {"message": "Internal server error", "type": "api_error"}},
+        )
+        api_error.status_code = 500
+        mock_client.create_image = AsyncMock(side_effect=api_error)
+
+        # Setup custom model
+        custom_openai = CustomOpenAI(self.image_config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(self.image_input, model_params)
+
+        # Verify error handling
+        self.assertIn("API error", resp_text)
+        self.assertEqual(resp_status, 500)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_image_api_error(self, mock_set_client):
+        asyncio.run(self._run_generate_image_api_error(mock_set_client))
+
+    async def _run_generate_response_routes_to_image(self, mock_set_client):
+        """Test _generate_response correctly routes to _generate_image when output_type is 'image'"""
+        import base64
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data = b"fake_image_data"
+        mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+        mock_img = MagicMock()
+        mock_img.b64_json = mock_b64
+        mock_response = MagicMock()
+        mock_response.data = [mock_img]
+
+        mock_client.create_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model with image output type
+        custom_openai = CustomOpenAI(self.image_config)
+        custom_openai._client = mock_client
+
+        # Call _generate_response (should route to _generate_image)
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_response(
+            self.image_input, model_params
+        )
+
+        # Verify it called create_image (image generation path)
+        mock_client.create_image.assert_awaited_once()
+        self.assertEqual(resp_status, 200)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_generate_response_routes_to_image(self, mock_set_client):
+        asyncio.run(self._run_generate_response_routes_to_image(mock_set_client))
+
     @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
     def test_generate_response_routes_to_speech(self, mock_set_client):
         asyncio.run(self._run_generate_response_routes_to_speech(mock_set_client))
 
+    # ===== Image Editing Tests =====
+
+    async def _run_edit_image_single_dalle2(self, mock_set_client):
+        """Test single image editing with DALL-E-2"""
+        import base64
+
+        # Sample image data URL
+        sample_image = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+
+        # Create input with image and text
+        from langchain.schema import HumanMessage
+
+        messages_with_image = [
+            HumanMessage(
+                content=[
+                    {"type": "image_url", "image_url": sample_image},
+                    {"type": "text", "text": "Remove the background"},
+                ]
+            )
+        ]
+        image_edit_input = ChatPromptValue(messages=messages_with_image)
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data = b"fake_edited_image"
+        mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+        mock_img = MagicMock()
+        mock_img.b64_json = mock_b64
+        mock_response = MagicMock()
+        mock_response.data = [mock_img]
+
+        mock_client.edit_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model for DALL-E-2
+        config = {**self.image_config, "model": "dall-e-2"}
+        custom_openai = CustomOpenAI(config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image (should auto-detect and route to editing)
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(image_edit_input, model_params)
+
+        # Verify results
+        self.assertIn("data:image/png;base64,", resp_text)
+        self.assertEqual(resp_status, 200)
+
+        # Verify edit_image was called (not create_image)
+        mock_client.edit_image.assert_called_once()
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_edit_image_single_dalle2(self, mock_set_client):
+        asyncio.run(self._run_edit_image_single_dalle2(mock_set_client))
+
+    async def _run_edit_image_multiple_gpt_image_1(self, mock_set_client):
+        """Test multi-image editing with GPT-Image-1 (2-16 images)"""
+        import base64
+        import json
+
+        # Sample image data URLs (3 images)
+        sample_image_1 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+        sample_image_2 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mP8/5+hHgAHggJ/PchI7wAAAABJRU5ErkJggg=="
+        sample_image_3 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNkYPhfDwAChAHgFPU/tQAAAABJRU5ErkJggg=="
+
+        # Create input with multiple images and text
+        from langchain.schema import HumanMessage
+
+        messages_with_images = [
+            HumanMessage(
+                content=[
+                    {"type": "image_url", "image_url": sample_image_1},
+                    {"type": "image_url", "image_url": sample_image_2},
+                    {"type": "image_url", "image_url": sample_image_3},
+                    {"type": "text", "text": "Combine into a collage"},
+                ]
+            )
+        ]
+        image_edit_input = ChatPromptValue(messages=messages_with_images)
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data1 = b"fake_edited_image_1"
+        mock_image_data2 = b"fake_edited_image_2"
+        mock_b64_1 = base64.b64encode(mock_image_data1).decode("utf-8")
+        mock_b64_2 = base64.b64encode(mock_image_data2).decode("utf-8")
+
+        mock_img1 = MagicMock()
+        mock_img1.b64_json = mock_b64_1
+        mock_img2 = MagicMock()
+        mock_img2.b64_json = mock_b64_2
+        mock_response = MagicMock()
+        mock_response.data = [mock_img1, mock_img2]
+
+        mock_client.edit_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model for GPT-Image-1
+        config = {**self.image_config, "model": "gpt-image-1", "n": 2}
+        custom_openai = CustomOpenAI(config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(image_edit_input, model_params)
+
+        # Verify results - should return multiple images
+        result = json.loads(resp_text)
+        self.assertIsInstance(result, list)
+        self.assertEqual(len(result), 2)
+        self.assertEqual(resp_status, 200)
+
+        # Verify edit_image was called with list of images
+        mock_client.edit_image.assert_called_once()
+        call_args = mock_client.edit_image.call_args
+        self.assertIsInstance(call_args.kwargs["image"], list)
+        self.assertEqual(len(call_args.kwargs["image"]), 3)  # 3 input images
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_edit_image_multiple_gpt_image_1(self, mock_set_client):
+        asyncio.run(self._run_edit_image_multiple_gpt_image_1(mock_set_client))
+
+    async def _run_edit_image_more_than_16_images(self, mock_set_client):
+        """Test that >16 images are trimmed to 16 for GPT-Image-1"""
+        import base64
+
+        # Create 20 image data URLs
+        sample_image = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+
+        # Create input with 20 images
+        from langchain.schema import HumanMessage
+
+        image_items = [{"type": "image_url", "image_url": sample_image} for _ in range(20)]
+        image_items.append({"type": "text", "text": "Create a grid"})
+        messages_with_many_images = [HumanMessage(content=image_items)]
+        image_edit_input = ChatPromptValue(messages=messages_with_many_images)
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data = b"fake_edited_image"
+        mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+        mock_img = MagicMock()
+        mock_img.b64_json = mock_b64
+        mock_response = MagicMock()
+        mock_response.data = [mock_img]
+
+        mock_client.edit_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model for GPT-Image-1
+        config = {**self.image_config, "model": "gpt-image-1"}
+        custom_openai = CustomOpenAI(config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+
+        with patch("sygra.core.models.custom_models.logger") as mock_logger:
+            resp_text, resp_status = await custom_openai._generate_image(
+                image_edit_input, model_params
+            )
+
+            # Verify warning was logged
+            mock_logger.warning.assert_called_once()
+            warning_msg = mock_logger.warning.call_args[0][0]
+            self.assertIn("supports max 16 images", warning_msg)
+            self.assertIn("4 image(s) will be ignored", warning_msg)
+
+        # Verify only 16 images were passed to API
+        call_args = mock_client.edit_image.call_args
+        self.assertEqual(len(call_args.kwargs["image"]), 16)
+        self.assertEqual(resp_status, 200)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_edit_image_more_than_16_images(self, mock_set_client):
+        asyncio.run(self._run_edit_image_more_than_16_images(mock_set_client))
+
+    async def _run_edit_image_multiple_with_dalle2_warns(self, mock_set_client):
+        """Test that DALL-E-2 warns and uses only first image when given multiple"""
+        import base64
+
+        # Create input with 3 images
+        sample_image_1 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+        sample_image_2 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mP8/5+hHgAHggJ/PchI7wAAAABJRU5ErkJggg=="
+        sample_image_3 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNkYPhfDwAChAHgFPU/tQAAAABJRU5ErkJggg=="
+
+        from langchain.schema import HumanMessage
+
+        messages_with_images = [
+            HumanMessage(
+                content=[
+                    {"type": "image_url", "image_url": sample_image_1},
+                    {"type": "image_url", "image_url": sample_image_2},
+                    {"type": "image_url", "image_url": sample_image_3},
+                    {"type": "text", "text": "Edit this"},
+                ]
+            )
+        ]
+        image_edit_input = ChatPromptValue(messages=messages_with_images)
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data = b"fake_edited_image"
+        mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+        mock_img = MagicMock()
+        mock_img.b64_json = mock_b64
+        mock_response = MagicMock()
+        mock_response.data = [mock_img]
+
+        mock_client.edit_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model for DALL-E-2 (single image only)
+        config = {**self.image_config, "model": "dall-e-2"}
+        custom_openai = CustomOpenAI(config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+
+        with patch("sygra.core.models.custom_models.logger") as mock_logger:
+            resp_text, resp_status = await custom_openai._generate_image(
+                image_edit_input, model_params
+            )
+
+            # Verify warning was logged
+            mock_logger.warning.assert_called_once()
+            warning_msg = mock_logger.warning.call_args[0][0]
+            self.assertIn("only supports single image editing", warning_msg)
+            self.assertIn("2 image(s) will be ignored", warning_msg)
+
+        self.assertEqual(resp_status, 200)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_edit_image_multiple_with_dalle2_warns(self, mock_set_client):
+        asyncio.run(self._run_edit_image_multiple_with_dalle2_warns(mock_set_client))
+
+    async def _run_edit_image_empty_prompt(self, mock_set_client):
+        """Test error when no edit instruction provided"""
+        # Sample image but no text prompt
+        sample_image = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+
+        from langchain.schema import HumanMessage
+
+        # Only image, no text
+        messages_image_only = [
+            HumanMessage(content=[{"type": "image_url", "image_url": sample_image}])
+        ]
+        image_edit_input = ChatPromptValue(messages=messages_image_only)
+
+        # Setup custom model
+        config = {**self.image_config, "model": "dall-e-2"}
+        custom_openai = CustomOpenAI(config)
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(image_edit_input, model_params)
+
+        # Verify error response
+        self.assertIn("###SERVER_ERROR###", resp_text)
+        self.assertIn("No prompt provided", resp_text)
+        self.assertEqual(resp_status, 400)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_edit_image_empty_prompt(self, mock_set_client):
+        asyncio.run(self._run_edit_image_empty_prompt(mock_set_client))
+
+    async def _run_edit_image_no_images_routes_to_generation(self, mock_set_client):
+        """Test that input without images routes to generation (not editing)"""
+        import base64
+
+        # Text prompt only, no images
+        from langchain.schema import HumanMessage
+
+        messages_text_only = [HumanMessage(content="Generate a sunset")]
+        text_only_input = ChatPromptValue(messages=messages_text_only)
+
+        # Setup mock client
+        mock_client = MagicMock()
+        mock_image_data = b"fake_generated_image"
+        mock_b64 = base64.b64encode(mock_image_data).decode("utf-8")
+
+        mock_img = MagicMock()
+        mock_img.b64_json = mock_b64
+        mock_response = MagicMock()
+        mock_response.data = [mock_img]
+
+        # Mock create_image (generation)
+        mock_client.create_image = AsyncMock(return_value=mock_response)
+        # Mock edit_image (should NOT be called)
+        mock_client.edit_image = AsyncMock(return_value=mock_response)
+
+        # Setup custom model
+        config = {**self.image_config, "model": "dall-e-3"}
+        custom_openai = CustomOpenAI(config)
+        custom_openai._client = mock_client
+
+        # Call _generate_image
+        model_params = ModelParams(url="https://api.openai.com/v1", auth_token="sk-test")
+        resp_text, resp_status = await custom_openai._generate_image(text_only_input, model_params)
+
+        # Verify create_image was called (generation), NOT edit_image
+        mock_client.create_image.assert_called_once()
+        mock_client.edit_image.assert_not_called()
+        self.assertEqual(resp_status, 200)
+
+    @patch("sygra.core.models.custom_models.BaseCustomModel._set_client")
+    def test_edit_image_no_images_routes_to_generation(self, mock_set_client):
+        asyncio.run(self._run_edit_image_no_images_routes_to_generation(mock_set_client))
+
 
 if __name__ == "__main__":
     unittest.main()
diff --git a/tests/utils/test_multimodal_processor.py b/tests/utils/test_multimodal_processor.py
new file mode 100644
index 00000000..68b440c9
--- /dev/null
+++ b/tests/utils/test_multimodal_processor.py
@@ -0,0 +1,183 @@
+"""Tests for multimodal_processor utility functions."""
+
+import shutil
+import tempfile
+from pathlib import Path
+
+from sygra.utils.multimodal_processor import (
+    is_multimodal_data_url,
+    process_batch_multimodal_data,
+)
+
+
+class TestIsMultimodalDataURL:
+    """Tests for is_multimodal_data_url function."""
+
+    def test_image_data_url(self):
+        """Test that image data URLs are recognized."""
+        data_url = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+        assert is_multimodal_data_url(data_url) is True
+
+    def test_audio_data_url(self):
+        """Test that audio data URLs are recognized."""
+        data_url = (
+            "data:audio/mp3;base64,SUQzBAAAAAAAI1RTU0UAAAAPAAADTGF2ZjU4Ljc2LjEwMAAAAAAAAAAAAAAA"
+        )
+        assert is_multimodal_data_url(data_url) is True
+
+    def test_not_data_url(self):
+        """Test that regular strings are not recognized."""
+        assert is_multimodal_data_url("hello world") is False
+        assert is_multimodal_data_url("http://example.com") is False
+        assert is_multimodal_data_url(123) is False
+        assert is_multimodal_data_url(None) is False
+
+
+class TestProcessBatchMultimodalData:
+    """Tests for process_batch_multimodal_data function."""
+
+    def setup_method(self):
+        """Create a temporary directory for each test."""
+        self.temp_dir = Path(tempfile.mkdtemp())
+
+    def teardown_method(self):
+        """Clean up temporary directory after each test."""
+        if self.temp_dir.exists():
+            shutil.rmtree(self.temp_dir)
+
+    def test_no_multimodal_data_no_directory_created(self):
+        """Test that directory is NOT created when there's no multimodal data."""
+        records = [
+            {"id": "1", "text": "Hello world"},
+            {"id": "2", "text": "Another record"},
+        ]
+
+        output_dir = self.temp_dir / "multimodal_output"
+
+        # Process records
+        result = process_batch_multimodal_data(records, output_dir)
+
+        # Directory should NOT be created
+        assert not output_dir.exists()
+
+        # Records should be unchanged
+        assert result == records
+
+    def test_with_multimodal_data_directory_created(self):
+        """Test that directory IS created when there's multimodal data."""
+        # Sample 1x1 PNG as data URL
+        data_url = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+
+        records = [
+            {"id": "1", "image": data_url},
+        ]
+
+        output_dir = self.temp_dir / "multimodal_output"
+
+        # Process records
+        process_batch_multimodal_data(records, output_dir)
+
+        # Directory SHOULD be created
+        assert output_dir.exists()
+        assert output_dir.is_dir()
+
+        # Image subdirectory should exist
+        assert (output_dir / "image").exists()
+
+    def test_mixed_records_directory_created(self):
+        """Test that directory is created when at least one record has multimodal data."""
+        data_url = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+
+        records = [
+            {"id": "1", "text": "No image"},
+            {"id": "2", "image": data_url},  # Has image
+            {"id": "3", "text": "Also no image"},
+        ]
+
+        output_dir = self.temp_dir / "multimodal_output"
+
+        # Process records
+        process_batch_multimodal_data(records, output_dir)
+
+        # Directory SHOULD be created because at least one record has multimodal data
+        assert output_dir.exists()
+
+    def test_nested_multimodal_data_detected(self):
+        """Test that nested multimodal data is detected."""
+        data_url = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+
+        records = [
+            {"id": "1", "nested": {"deep": {"image": data_url}}},
+        ]
+
+        output_dir = self.temp_dir / "multimodal_output"
+
+        # Process records
+        process_batch_multimodal_data(records, output_dir)
+
+        # Directory SHOULD be created because nested data contains image
+        assert output_dir.exists()
+
+    def test_multimodal_data_in_list_detected(self):
+        """Test that multimodal data in lists is detected."""
+        data_url = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+
+        records = [
+            {"id": "1", "images": [data_url, "text", "more text"]},
+        ]
+
+        output_dir = self.temp_dir / "multimodal_output"
+
+        # Process records
+        process_batch_multimodal_data(records, output_dir)
+
+        # Directory SHOULD be created because list contains image
+        assert output_dir.exists()
+
+    def test_empty_records_no_directory(self):
+        """Test that empty records list doesn't create directory."""
+        records = []
+
+        output_dir = self.temp_dir / "multimodal_output"
+
+        # Process records
+        result = process_batch_multimodal_data(records, output_dir)
+
+        # Directory should NOT be created
+        assert not output_dir.exists()
+
+        # Result should be empty list
+        assert result == []
+
+    def test_json_array_of_data_urls(self):
+        """Test that JSON arrays of data URLs (n>1 images) are parsed and processed."""
+        import json
+
+        # Sample 1x1 PNG as data URL
+        data_url1 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mNk+M9QDwADhgGAWjR9awAAAABJRU5ErkJggg=="
+        data_url2 = "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mP8/5+hHgAHggJ/PchI7wAAAABJRU5ErkJggg=="
+
+        # Create a JSON string of the array (simulating n>1 image generation)
+        json_array = json.dumps([data_url1, data_url2])
+
+        records = [
+            {"id": "1", "images": json_array},
+        ]
+
+        output_dir = self.temp_dir / "multimodal_output"
+
+        # Process records
+        result = process_batch_multimodal_data(records, output_dir)
+
+        # Directory SHOULD be created because JSON array contains images
+        assert output_dir.exists()
+        assert (output_dir / "image").exists()
+
+        # The JSON array should be parsed and replaced with a list of file paths
+        assert isinstance(result[0]["images"], list)
+        assert len(result[0]["images"]) == 2
+        # Both should be file paths now
+        for path in result[0]["images"]:
+            assert isinstance(path, str)
+            assert not path.startswith("data:")
+            assert "image" in path