deepset-ai
diff --git a/‎LICENSE-MIT.txt‎
Lines changed: 20 additions & 0 deletions b/‎LICENSE-MIT.txt‎
Lines changed: 20 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 10 additions & 7 deletions b/‎README.md‎
Lines changed: 10 additions & 7 deletions
diff --git a/‎docs/pydoc/config/generators_api.yml‎
Lines changed: 30 additions & 0 deletions b/‎docs/pydoc/config/generators_api.yml‎
Lines changed: 30 additions & 0 deletions
diff --git a/‎haystack_experimental/components/generators/__init__.py‎
Lines changed: 3 additions & 0 deletions b/‎haystack_experimental/components/generators/__init__.py‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎haystack_experimental/components/generators/chat/__init__.py‎
Lines changed: 18 additions & 0 deletions b/‎haystack_experimental/components/generators/chat/__init__.py‎
Lines changed: 18 additions & 0 deletions
diff --git a/‎haystack_experimental/components/generators/chat/openai.py‎
Lines changed: 193 additions & 0 deletions b/‎haystack_experimental/components/generators/chat/openai.py‎
Lines changed: 193 additions & 0 deletions
diff --git a/‎haystack_experimental/utils/__init__.py‎
Lines changed: 3 additions & 0 deletions b/‎haystack_experimental/utils/__init__.py‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎haystack_experimental/utils/hallucination_risk_calculator/__init__.py‎
Lines changed: 5 additions & 0 deletions b/‎haystack_experimental/utils/hallucination_risk_calculator/__init__.py‎
Lines changed: 5 additions & 0 deletions
@@ -0,0 +1,20 @@
+MIT License
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
+
@@ -41,13 +41,14 @@ that includes it. Once it reaches the end of its lifespan, the experiment will b
 
 ### Active experiments
 
-| Name                                  | Type                               | Expected End Date | Dependencies | Cookbook                                                                                                                                                                                                                                                  | Discussion   |
-|---------------------------------------|------------------------------------|-------------------|--------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------|
-| [`InMemoryChatMessageStore`][1]       | Memory Store                       | December 2024     | None         | <a href="https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/conversational_rag_using_memory.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>      | [Discuss][4] |
-| [`ChatMessageRetriever`][2]           | Memory Component                   | December 2024     | None         | <a href="https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/conversational_rag_using_memory.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>      | [Discuss][4] |
-| [`ChatMessageWriter`][3]              | Memory Component                   | December 2024     | None         | <a href="https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/conversational_rag_using_memory.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>      | [Discuss][4] |
-| [`QueryExpander`][5]                  | Query Expansion Component          | October 2025      | None         | None | [Discuss][6] |
-| [`EmbeddingBasedDocumentSplitter`][8] | EmbeddingBasedDocumentSplitter     | August 2025       | None         | None | [Discuss][7] |
+| Name                                  | Type                           | Expected End Date | Dependencies | Cookbook                                                                                                                                                                                                                                                  | Discussion    |
+|---------------------------------------|--------------------------------|-------------------|--------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------|
+| [`InMemoryChatMessageStore`][1]       | Memory Store                   | December 2024     | None         | <a href="https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/conversational_rag_using_memory.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>      | [Discuss][4]  |
+| [`ChatMessageRetriever`][2]           | Memory Component               | December 2024     | None         | <a href="https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/conversational_rag_using_memory.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>      | [Discuss][4]  |
+| [`ChatMessageWriter`][3]              | Memory Component               | December 2024     | None         | <a href="https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/conversational_rag_using_memory.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>      | [Discuss][4]  |
+| [`QueryExpander`][5]                  | Query Expansion Component      | October 2025      | None         | None | [Discuss][6]  |
+| [`EmbeddingBasedDocumentSplitter`][8] | EmbeddingBasedDocumentSplitter | August 2025       | None         | None | [Discuss][7]  |
+| [`OpenAIChatGenerator`][9]            | Chat Generator Component       | November 2025     | None         | None | [Discuss][10] |
 
 [1]: https://github.com/deepset-ai/haystack-experimental/blob/main/haystack_experimental/chat_message_stores/in_memory.py
 [2]: https://github.com/deepset-ai/haystack-experimental/blob/main/haystack_experimental/components/retrievers/chat_message_retriever.py
@@ -57,6 +58,8 @@ that includes it. Once it reaches the end of its lifespan, the experiment will b
 [6]: https://github.com/deepset-ai/haystack-experimental/discussions/346
 [7]: https://github.com/deepset-ai/haystack-experimental/discussions/356
 [8]: https://github.com/deepset-ai/haystack-experimental/blob/main/haystack_experimental/components/preprocessors/embedding_based_document_splitter.py
+[9]: https://github.com/deepset-ai/haystack-experimental/blob/main/haystack_experimental/components/generators/chat/openai.py
+[10]: https://github.com/deepset-ai/haystack-experimental/discussions/XXX
 
 ### Adopted experiments
 | Name                                                                                   | Type                                     | Final release |
 
@@ -0,0 +1,30 @@
+loaders:
+  - type: haystack_pydoc_tools.loaders.CustomPythonLoader
+    search_path: [../../../]
+    modules:
+      [
+        "haystack_experimental.components.generators.chat.openai",
+      ]
+    ignore_when_discovered: ["__init__"]
+processors:
+  - type: filter
+    expression:
+    documented_only: true
+    do_not_filter_modules: false
+    skip_empty_modules: true
+  - type: smart
+  - type: crossref
+renderer:
+  type: haystack_pydoc_tools.renderers.ReadmeCoreRenderer
+  excerpt: Enables text generation using LLMs.
+  category_slug: experiments-api
+  title: Generators
+  slug: experimental-generators-api
+  order: 42
+  markdown:
+    descriptive_class_title: false
+    classdef_code_block: false
+    descriptive_module_title: true
+    add_method_class_prefix: true
+    add_member_class_prefix: false
+    filename: experimental_generators_api.md
@@ -0,0 +1,3 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <[email protected]>
+#
+# SPDX-License-Identifier: Apache-2.0
@@ -0,0 +1,18 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <[email protected]>
+#
+# SPDX-License-Identifier: Apache-2.0
+
+import sys
+from typing import TYPE_CHECKING
+
+from lazy_imports import LazyImporter
+
+_import_structure = {
+    "openai": ["OpenAIChatGenerator"],
+}
+
+if TYPE_CHECKING:
+    from .openai import OpenAIChatGenerator as OpenAIChatGenerator
+
+else:
+    sys.modules[__name__] = LazyImporter(name=__name__, module_file=__file__, import_structure=_import_structure)
@@ -0,0 +1,193 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <[email protected]>
+#
+# SPDX-License-Identifier: Apache-2.0
+
+from dataclasses import replace
+from typing import Any, Optional, Union
+
+from haystack import component
+from haystack.components.generators.chat.openai import OpenAIChatGenerator as BaseOpenAIChatGenerator
+from haystack.dataclasses import ChatMessage, StreamingCallbackT
+from haystack.tools import Tool, Toolset
+
+from haystack_experimental.utils.hallucination_risk_calculator.dataclasses import HallucinationScoreConfig
+from haystack_experimental.utils.hallucination_risk_calculator.openai_planner import calculate_hallucination_metrics
+
+
+@component
+class OpenAIChatGenerator(BaseOpenAIChatGenerator):
+    """
+    An OpenAI chat-based text generator component that supports hallucination risk scoring.
+
+    This is based on the paper
+    [LLMs are Bayesian, in Expectation, not in Realization](https://arxiv.org/abs/2507.11768).
+
+    ## Usage Example:
+
+    ```python
+    from haystack.dataclasses import ChatMessage
+
+    from haystack_experimental.utils.hallucination_risk_calculator.dataclasses import HallucinationScoreConfig
+    from haystack_experimental.components.generators.chat.openai import OpenAIChatGenerator
+
+    # Evidence-based Example
+    llm = OpenAIChatGenerator(model="gpt-4o")
+    rag_result = llm.run(
+        messages=[
+            ChatMessage.from_user(
+                text="Task: Answer strictly based on the evidence provided below.\n"
+                "Question: Who won the Nobel Prize in Physics in 2019?\n"
+                "Evidence:\n"
+                "- Nobel Prize press release (2019): James Peebles (1/2); Michel Mayor & Didier Queloz (1/2).\n"
+                "Constraints: If evidence is insufficient or conflicting, refuse."
+            )
+        ],
+        hallucination_score_config=HallucinationScoreConfig(skeleton_policy="evidence_erase"),
+    )
+    print(f"Decision: {rag_result['replies'][0].meta['hallucination_decision']}")
+    print(f"Risk bound: {rag_result['replies'][0].meta['hallucination_risk']:.3f}")
+    print(f"Rationale: {rag_result['replies'][0].meta['hallucination_rationale']}")
+    print(f"Answer:\n{rag_result['replies'][0].text}")
+    print("---")
+    ```
+    """
+
+    @component.output_types(replies=list[ChatMessage])
+    def run(
+        self,
+        messages: list[ChatMessage],
+        streaming_callback: Optional[StreamingCallbackT] = None,
+        generation_kwargs: Optional[dict[str, Any]] = None,
+        *,
+        tools: Optional[Union[list[Tool], Toolset]] = None,
+        tools_strict: Optional[bool] = None,
+        hallucination_score_config: Optional[HallucinationScoreConfig] = None,
+    ) -> dict[str, list[ChatMessage]]:
+        """
+        Invokes chat completion based on the provided messages and generation parameters.
+
+        :param messages:
+            A list of ChatMessage instances representing the input messages.
+        :param streaming_callback:
+            A callback function that is called when a new token is received from the stream.
+        :param generation_kwargs:
+            Additional keyword arguments for text generation. These parameters will
+            override the parameters passed during component initialization.
+            For details on OpenAI API parameters, see [OpenAI documentation](https://platform.openai.com/docs/api-reference/chat/create).
+        :param tools:
+            A list of tools or a Toolset for which the model can prepare calls. If set, it will override the
+            `tools` parameter set during component initialization. This parameter can accept either a list of
+            `Tool` objects or a `Toolset` instance.
+        :param tools_strict:
+            Whether to enable strict schema adherence for tool calls. If set to `True`, the model will follow exactly
+            the schema provided in the `parameters` field of the tool definition, but this may increase latency.
+            If set, it will override the `tools_strict` parameter set during component initialization.
+        :param hallucination_score_config:
+            If provided, the generator will evaluate the hallucination risk of its responses using
+            the OpenAIPlanner and annotate each response with hallucination metrics.
+            This involves generating multiple samples and analyzing their consistency, which may increase
+            latency and cost. Use this option when you need to assess the reliability of the generated content
+            in scenarios where accuracy is critical.
+            For details, see the [research paper](https://arxiv.org/abs/2507.11768)
+
+        :returns:
+            A dictionary with the following key:
+            - `replies`: A list containing the generated responses as ChatMessage instances. If hallucination
+              scoring is enabled, each message will include additional metadata:
+                - `hallucination_decision`: "ANSWER" if the model decided to answer, "REFUSE" if it abstained.
+                - `hallucination_risk`: The EDFL hallucination risk bound.
+                - `hallucination_rationale`: The rationale behind the hallucination decision.
+        """
+        if len(messages) == 0:
+            return {"replies": []}
+
+        # Call parent implementation
+        result = super(OpenAIChatGenerator, self).run(
+            messages=messages,
+            streaming_callback=streaming_callback,
+            generation_kwargs=generation_kwargs,
+            tools=tools,
+            tools_strict=tools_strict,
+        )
+        completions = result["replies"]
+
+        # Add hallucination scoring if configured
+        if hallucination_score_config and messages[-1].text:
+            hallucination_meta = calculate_hallucination_metrics(
+                prompt=messages[-1].text, hallucination_score_config=hallucination_score_config, chat_generator=self
+            )
+            completions = [replace(m, _meta={**m.meta, **hallucination_meta}) for m in completions]
+
+        return {"replies": completions}
+
+    @component.output_types(replies=list[ChatMessage])
+    async def run_async(
+        self,
+        messages: list[ChatMessage],
+        streaming_callback: Optional[StreamingCallbackT] = None,
+        generation_kwargs: Optional[dict[str, Any]] = None,
+        *,
+        tools: Optional[Union[list[Tool], Toolset]] = None,
+        tools_strict: Optional[bool] = None,
+        hallucination_score_config: Optional[HallucinationScoreConfig] = None,
+    ) -> dict[str, list[ChatMessage]]:
+        """
+        Asynchronously invokes chat completion based on the provided messages and generation parameters.
+
+        This is the asynchronous version of the `run` method. It has the same parameters and return values
+        but can be used with `await` in async code.
+
+        :param messages:
+            A list of ChatMessage instances representing the input messages.
+        :param streaming_callback:
+            A callback function that is called when a new token is received from the stream.
+            Must be a coroutine.
+        :param generation_kwargs:
+            Additional keyword arguments for text generation. These parameters will
+            override the parameters passed during component initialization.
+            For details on OpenAI API parameters, see [OpenAI documentation](https://platform.openai.com/docs/api-reference/chat/create).
+        :param tools:
+            A list of tools or a Toolset for which the model can prepare calls. If set, it will override the
+            `tools` parameter set during component initialization. This parameter can accept either a list of
+            `Tool` objects or a `Toolset` instance.
+        :param tools_strict:
+            Whether to enable strict schema adherence for tool calls. If set to `True`, the model will follow exactly
+            the schema provided in the `parameters` field of the tool definition, but this may increase latency.
+            If set, it will override the `tools_strict` parameter set during component initialization.
+        :param hallucination_score_config:
+            If provided, the generator will evaluate the hallucination risk of its responses using
+            the OpenAIPlanner and annotate each response with hallucination metrics.
+            This involves generating multiple samples and analyzing their consistency, which may increase
+            latency and cost. Use this option when you need to assess the reliability of the generated content
+            in scenarios where accuracy is critical.
+            For details, see the [research paper](https://arxiv.org/abs/2507.11768)
+
+        :returns:
+            A dictionary with the following key:
+            - `replies`: A list containing the generated responses as ChatMessage instances. If hallucination
+              scoring is enabled, each message will include additional metadata:
+                - `hallucination_decision`: "ANSWER" if the model decided to answer, "REFUSE" if it abstained.
+                - `hallucination_risk`: The EDFL hallucination risk bound.
+                - `hallucination_rationale`: The rationale behind the hallucination decision.
+        """
+        if len(messages) == 0:
+            return {"replies": []}
+
+        # Call parent implementation
+        result = await super(OpenAIChatGenerator, self).run_async(
+            messages=messages,
+            streaming_callback=streaming_callback,
+            generation_kwargs=generation_kwargs,
+            tools=tools,
+            tools_strict=tools_strict,
+        )
+        completions = result["replies"]
+
+        # Add hallucination scoring if configured
+        if hallucination_score_config and messages[-1].text:
+            hallucination_meta = calculate_hallucination_metrics(
+                prompt=messages[-1].text, hallucination_score_config=hallucination_score_config, chat_generator=self
+            )
+            completions = [replace(m, _meta={**m.meta, **hallucination_meta}) for m in completions]
+
+        return {"replies": completions}
@@ -0,0 +1,3 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <[email protected]>
+#
+# SPDX-License-Identifier: Apache-2.0
@@ -0,0 +1,5 @@
+# ruff: noqa: D103
+# Original code Copyright (c) 2024 Hassana Labs
+# Licensed under the MIT License (see LICENSE-MIT).
+# Modified by deepset, 2025.
+# Licensed under the Apache License, Version 2.0 (see LICENSE-APACHE).
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+# SPDX-FileCopyrightText: 2022-present deepset GmbH <[email protected]>`
	`2`	`+#`
	`3`	`+# SPDX-License-Identifier: Apache-2.0`