deepsense-ai
diff --git a/‎docs/how-to/llms/use_llms.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/how-to/llms/use_llms.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/how-to/prompts/use_attachments_in_prompts.md‎
Lines changed: 121 additions & 0 deletions b/‎docs/how-to/prompts/use_attachments_in_prompts.md‎
Lines changed: 121 additions & 0 deletions
diff --git a/‎docs/how-to/prompts/use_images_in_prompts.md‎
Lines changed: 0 additions & 124 deletions b/‎docs/how-to/prompts/use_images_in_prompts.md‎
Lines changed: 0 additions & 124 deletions
diff --git a/‎examples/README.md‎
Lines changed: 2 additions & 1 deletion b/‎examples/README.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎examples/core/prompt/multimodal_with_few_shots.py‎
Lines changed: 9 additions & 11 deletions b/‎examples/core/prompt/multimodal_with_few_shots.py‎
Lines changed: 9 additions & 11 deletions
diff --git a/‎examples/core/prompt/multimodal.py‎ renamed to ‎examples/core/prompt/multimodal_with_image.py‎
Lines changed: 6 additions & 10 deletions b/‎examples/core/prompt/multimodal.py‎ renamed to ‎examples/core/prompt/multimodal_with_image.py‎
Lines changed: 6 additions & 10 deletions
@@ -58,7 +58,7 @@ Ragbits provides a flexible way to interact with LLMs by allowing you to use [`P
 
 ## Using prompts with LLMs
 
-Prompts in Ragbits are powerful tools for structuring inputs and outputs when interacting with LLMs. They allow you to define **system prompts**, **user prompts**, and even **structured output formats** using Pydantic models. For more details on using prompts, check out the [Prompting Guide](https://ragbits.deepsense.ai/how-to/use_prompting/). For more advanced use cases, such as using images in prompts, check out the guide: [How-To: Use images in prompts wirh Ragbits](../prompts/use_images_in_prompts.md).
+Prompts in Ragbits are powerful tools for structuring inputs and outputs when interacting with LLMs. They allow you to define **system prompts**, **user prompts**, and even **structured output formats** using Pydantic models. For more details on using prompts, check out the [Prompting Guide](https://ragbits.deepsense.ai/how-to/use_prompting/). For more advanced use cases, such as using attachments in prompts, check out the guide: [How-To: Use attachments in prompts with Ragbits](../prompts/use_attachments_in_prompts.md).
 
 ```python
 from ragbits.core.prompt import Prompt
 
@@ -0,0 +1,121 @@
+# How-To: Use attachments in prompts with Ragbits
+
+This guide will walk you through defining and using prompts in Ragbits that accept attachments as input.
+It covers handling single and multiple attachment inputs, incorporating conditionals in prompt templates based on the presence of attachments, and using such prompts with an LLM.
+
+Attachment types currently supported include standard image formats (such as JPEG, PNG) and PDF documents.
+
+## How to define a prompt with an attachment input
+
+The attachment is represented by the `prompt.Attachment` class.
+It can be initialized in multiple ways depending on the source of the file data.
+The class supports both binary data and URLs, with optional MIME type specification.
+
+```python
+from ragbits.core.prompt import Attachment
+
+file_bytes = Attachment(data=b"file_bytes")
+image_url = Attachment(url="http://image.jpg")
+pdf_url = Attachment(url="http://document.pdf")
+file_with_url_and_mime =  Attachment(url="http://address.pl/file_with_no_extension", mime_type="jpeg")
+```
+
+To define a prompt that takes an attachment as input, create a Pydantic model representing the input structure.
+The model should include a field for the attachment that holds an instance of `prompt.Attachment` class - its name does not matter.
+
+To pass multiple attachments, just define multiple fields of type `Attachment` or a single field that is a list of `Attachment` instances.
+
+
+```python
+import asyncio
+from pydantic import BaseModel
+from ragbits.core.prompt import Attachment, Prompt
+from ragbits.core.llms.litellm import LiteLLM
+
+class EmployeeOnboardingInput(BaseModel):
+    """
+    Input model for employee onboarding files.
+    """
+    headshot: Attachment
+    contract: Attachment
+    documents: list[Attachment]
+
+
+class EmployeeOnboardingPrompt(Prompt):
+    """
+    A prompt to process employee onboarding files.
+    """
+
+    user_prompt = "Review the employee onboarding files and provide feedback."
+
+
+async def main():
+    llm = LiteLLM("gpt-4o")
+
+    headshot = Attachment(data=b"<your_photo_here>")
+    contract = Attachment(data=b"<your_contract_here>")
+    documents = [
+        Attachment(data=b"<your_document_1_here>"),
+        Attachment(data=b"<your_document_2_here>"),
+    ]
+    prompt = EmployeeOnboardingPrompt(
+        EmployeeOnboardingInput(headshot=headshot, contract=contract, documents=documents)
+    )
+    response = await llm.generate(prompt)
+    print(response)
+
+
+asyncio.run(main())
+```
+
+## Using conditionals in templates
+
+Sometimes, you may want to modify the prompt based on whether an attachment is provided. Jinja conditionals can help achieve this.
+
+```python
+import asyncio
+from pydantic import BaseModel
+from ragbits.core.prompt import Attachment, Prompt
+from ragbits.core.llms.litellm import LiteLLM
+
+class QuestionWithOptionalPhotoInput(BaseModel):
+    """
+    Input model that optionally includes a photo.
+    """
+    question: str
+    reference_photo: Attachment | None = None
+
+
+class QuestionWithPhotoPrompt(Prompt[QuestionWithOptionalPhotoInput]):
+    """
+    A prompt that considers whether a photo is provided.
+    """
+
+    system_prompt = """
+    You are a knowledgeable assistant providing detailed answers.
+    If a photo is provided, use it as a reference for your response.
+    """
+
+    user_prompt = """
+    User asked: {{ question }}
+    {% if reference_photo %}
+    Here is a reference photo: {{ reference_photo }}
+    {% else %}
+    No photo was provided.
+    {% endif %}
+    """
+
+
+async def main():
+    llm = LiteLLM("gpt-4o")
+    input_with_photo = QuestionWithOptionalPhotoInput(
+        question="What animal do you see in this photo?", reference_photo=Attachment(data=b"<your_photo_here>")
+    )
+    input_without_photo = QuestionWithOptionalPhotoInput(question="What is the capital of France?")
+
+    print(await llm.generate(QuestionWithPhotoPrompt(input_with_photo)))
+    print(await llm.generate(QuestionWithPhotoPrompt(input_without_photo)))
+
+
+asyncio.run(main())
+```
@@ -16,7 +16,8 @@ All necessary details are provided in the comments at the top of each script.
 |:-------------------------------------------------------------------------------------------------|:------------------------------------------------------------:|:--------------------------------------------------------------------------------------------------------------------------------------------------------|
 | [Text Prompt](/examples/core/prompt/text.py)                                                     |            [ragbits-core](/packages/ragbits-core)            | Example of how to use the `Prompt` class to generate themed text using an LLM with a simple text prompt.                                                |
 | [Text Prompt with Few Shots](/examples/core/prompt/text_with_few_shots.py)                       |            [ragbits-core](/packages/ragbits-core)            | Example of how to use the `Prompt` class to generate themed text using an LLM with a text prompt and few-shot examples.                                 |
-| [Multimodal Prompt](/examples/core/prompt/multimodal.py)                                         |            [ragbits-core](/packages/ragbits-core)            | Example of how to use the `Prompt` class to generate themed text using an LLM with both text and image inputs.                                          |
+| [Multimodal Prompt with Image Input](/examples/core/prompt/multimodal_with_image.py)             |            [ragbits-core](/packages/ragbits-core)            | Example of how to use the `Prompt` class to generate themed text using an LLM with both text and image inputs.                                          |
+| [Multimodal Prompt with PDF Input](/examples/core/prompt/multimodal_with_pdf.py)                 |            [ragbits-core](/packages/ragbits-core)            | Example of how to use the `Prompt` class to answer the question using an LLM with both text and PDF inputs.                                             |
 | [Multimodal Prompt with Few Shots](/examples/core/prompt/multimodal_with_few_shots.py)           |            [ragbits-core](/packages/ragbits-core)            | Example of how to use the `Prompt` class to generate themed text using an LLM with multimodal inputs and few-shot examples.                             |
 | [Tool Use with LLM](/examples/core/llms/tool_use.py)                                             |            [ragbits-core](/packages/ragbits-core)            | Example of how to provide tools and return tool calls from LLM.                                                                                         |
 | [OpenTelemetry Audit](/examples/core/audit/otel.py)                                              |            [ragbits-core](/packages/ragbits-core)            | Example of how to collect traces and metrics using Ragbits audit module with OpenTelemetry.                                                             |
 
@@ -24,7 +24,7 @@
 from pydantic import BaseModel
 
 from ragbits.core.llms import LiteLLM
-from ragbits.core.prompt import Prompt
+from ragbits.core.prompt import Attachment, Prompt
 
 
 class ImagePromptInput(BaseModel):
@@ -33,7 +33,7 @@ class ImagePromptInput(BaseModel):
     """
 
     theme: str
-    image_url: str
+    image: Attachment
 
 
 class ImagePromptOutput(BaseModel):
@@ -57,20 +57,18 @@ class ImagePrompt(Prompt[ImagePromptInput, ImagePromptOutput]):
     Theme: {{ theme }}
     """
 
-    image_input_fields = ["image_url"]
-
     few_shots = [
         (
             ImagePromptInput(
                 theme="pirates",
-                image_url="https://upload.wikimedia.org/wikipedia/commons/5/55/Acd_a_frame.jpg",
+                image=Attachment(url="https://upload.wikimedia.org/wikipedia/commons/5/55/Acd_a_frame.jpg"),
             ),
             ImagePromptOutput(description="Arrr, that would be a dog!"),
         ),
         (
             ImagePromptInput(
                 theme="fairy tale",
-                image_url="https://upload.wikimedia.org/wikipedia/commons/6/62/Red_Wolf.jpg",
+                image=Attachment(url="https://upload.wikimedia.org/wikipedia/commons/6/62/Red_Wolf.jpg"),
             ),
             ImagePromptOutput(
                 description="Once upon a time, in an enchanted forest, a noble wolf roamed under the moonlit sky."
@@ -79,7 +77,9 @@ class ImagePrompt(Prompt[ImagePromptInput, ImagePromptOutput]):
         (
             ImagePromptInput(
                 theme="sci-fi",
-                image_url="https://upload.wikimedia.org/wikipedia/commons/thumb/9/91/Bruce_McCandless_II_during_EVA_in_1984.jpg/2560px-Bruce_McCandless_II_during_EVA_in_1984.jpg",
+                image=Attachment(
+                    url="https://upload.wikimedia.org/wikipedia/commons/thumb/9/91/Bruce_McCandless_II_during_EVA_in_1984.jpg/2560px-Bruce_McCandless_II_during_EVA_in_1984.jpg"
+                ),
             ),
             ImagePromptOutput(
                 description="A lone astronaut drifts through the void, bathed in the eerie glow of distant galaxies."
@@ -93,10 +93,8 @@ async def main() -> None:
     Run the example.
     """
     llm = LiteLLM(model_name="gpt-4o-2024-08-06", use_structured_output=True)
-    prompt_input = ImagePromptInput(
-        image_url="https://upload.wikimedia.org/wikipedia/en/8/85/Cute_Dom_cat.JPG",
-        theme="dramatic",
-    )
+    image = Attachment(url="https://upload.wikimedia.org/wikipedia/en/8/85/Cute_Dom_cat.JPG")
+    prompt_input = ImagePromptInput(image=image, theme="dramatic")
     prompt = ImagePrompt(prompt_input)
     response = await llm.generate(prompt)
     print(response.description)
 
@@ -1,5 +1,5 @@
 """
-Ragbits Core Example: Multimodal Prompt
+Ragbits Core Example: Multimodal Prompt with Image Input
 
 This example demonstrates how to use the `Prompt` class to generate themed text using an LLM
 with both text and image inputs. We define an `ImagePrompt` that generates a themed description
@@ -8,7 +8,7 @@
 To run the script, execute the following command:
 
     ```bash
-    uv run examples/core/prompt/multimodal.py
+    uv run examples/core/prompt/multimodal_with_image.py
     ```
 """
 
@@ -24,7 +24,7 @@
 from pydantic import BaseModel
 
 from ragbits.core.llms import LiteLLM
-from ragbits.core.prompt import Prompt
+from ragbits.core.prompt import Attachment, Prompt
 
 
 class ImagePromptInput(BaseModel):
@@ -33,7 +33,7 @@ class ImagePromptInput(BaseModel):
     """
 
     theme: str
-    image_url: str
+    image: Attachment
 
 
 class ImagePromptOutput(BaseModel):
@@ -57,18 +57,14 @@ class ImagePrompt(Prompt[ImagePromptInput, ImagePromptOutput]):
     Theme: {{ theme }}
     """
 
-    image_input_fields = ["image_url"]
-
 
 async def main() -> None:
     """
     Run the example.
     """
     llm = LiteLLM(model_name="gpt-4o-2024-08-06", use_structured_output=True)
-    prompt_input = ImagePromptInput(
-        image_url="https://upload.wikimedia.org/wikipedia/en/8/85/Cute_Dom_cat.JPG",
-        theme="dramatic",
-    )
+    image = Attachment(url="https://upload.wikimedia.org/wikipedia/en/8/85/Cute_Dom_cat.JPG")
+    prompt_input = ImagePromptInput(image=image, theme="dramatic")
     prompt = ImagePrompt(prompt_input)
     response = await llm.generate(prompt)
     print(response.description)