Merge pull request #1402 from jannikmaierhoefer/add-observability-docs-page

geoand · web-flow · commit a32261015ad2 · 2025-04-03T16:57:44.000+03:00
docs: add page on observability
diff --git a/docs/modules/ROOT/nav.adoc b/docs/modules/ROOT/nav.adoc
@@ -46,3 +46,4 @@
 ** xref:fault-tolerance.adoc[Fault Tolerance]
 ** xref:websockets.adoc[WebSockets]
 ** xref:enable-disable-integrations.adoc[Enabling and Disabling Integrations]
+** xref:observability.adoc[Observability]
diff --git a/docs/modules/ROOT/pages/ai-services.adoc b/docs/modules/ROOT/pages/ai-services.adoc
@@ -334,187 +334,6 @@ public class MyCustomModerationSupplier implements Supplier<ModerationModel> {
 }
 ----
 
-== Observability
-
-Observability is built into services created via `@RegisterAiService` and is provided in the following form:
-
-* Metrics are enabled when `quarkus-micrometer` is part of the application
-* Traces are enabled when `quarkus-opentelemetry` is part of the application
-  
-=== Metrics
-
-Each AI method is automatically timed and the timer data is available using the `langchain4j.aiservices.timed` metric with the appropriate tags.
-A counter is also available using the `langchain4j.aiservices.counted` metric.
-
-For example, if the AI service looks like:
-
-[source,java]
-----
-@RegisterAiService
-public interface PoemAiService {
-
-    @SystemMessage("You are a professional poet")
-    @UserMessage("Write a poem about {topic}. The poem should be {lines} lines long")
-    String writeAPoem(String topic, int lines);
-}
-----
-
-and one chooses to use `quarkus-micrometer-registry-prometheus`, then the metrics could be:
-
-[source]
-----
-# HELP langchain4j_aiservices_timed_seconds
-# TYPE langchain4j_aiservices_timed_seconds summary
-langchain4j_aiservices_timed_seconds_count{aiservice="PoemAiService",method="writeAPoem",} 1.0
-langchain4j_aiservices_timed_seconds_sum{aiservice="PoemAiService",method="writeAPoem",} 4.241446681
-# HELP langchain4j_aiservices_timed_seconds_max
-# TYPE langchain4j_aiservices_timed_seconds_max gauge
-langchain4j_aiservices_timed_seconds_max{aiservice="PoemAiService",method="writeAPoem",} 4.241446681
-
-# HELP langchain4j_aiservices_counted_total
-# TYPE langchain4j_aiservices_counted_total counter
-langchain4j_aiservices_counted_total{aiservice="PoemAiService",exception="none",method="writeAPoem",result="success",} 1.0
-----
-
-=== Tracing
-
-Each AI method creates its own span using the `langchain4j.aiservices.$interface_name.$method_name` template for the name.
-Furthermore, tool invocations also create a span using `langchain4j.tools.$tool_name` template for the name.
-
-
-For example, if the AI service looks like:
-
-[source,java]
-----
-@RegisterAiService(tools = EmailService.class)
-public interface PoemAiService {
-
-    @SystemMessage("You are a professional poet")
-    @UserMessage("Write a poem about {topic}. The poem should be {lines} lines long. Then send this poem by email.")
-    String writeAPoem(String topic, int lines);
-
-}
-----
-
-a tool that looks like:
-
-[source,java]
-----
-@ApplicationScoped
-public class EmailService {
-
-    @Inject
-    Mailer mailer;
-
-    @Tool("send the given content by email")
-    public void sendAnEmail(String content) {
-        Log.info("Sending an email: " + content);
-        mailer.send(Mail.withText("sendMeALetter@quarkus.io", "A poem for you", content));
-    }
-
-}
-----
-
-and invocation of the AI service that looks like:
-
-[source,java]
-----
-@Path("/email-me-a-poem")
-public class EmailMeAPoemResource {
-
-    private final MyAiService service;
-
-    public EmailMeAPoemResource(MyAiService service) {
-        this.service = service;
-    }
-
-    @GET
-    public String emailMeAPoem() {
-        return service.writeAPoem("Quarkus", 4);
-    }
-
-}
-----
-
-then an example trace is:
-
-image::trace.png[width=1000,align="center"]
-
-In the trace above we can see the parent span which corresponds to the handling the GET HTTP request, but the real
-interesting thing is the `langchain4j.aiservices.MyAiService.writeAPoem` span which corresponds to the invocation of the AI service.
-The child spans of this span correspond (from to right) to calling the OpenAI API, invoking the `sendEmail` tool and finally invoking calling the OpenAI API again.
-
-==== Custom span data
-if you have the need for custom span data, you can simply add a bean implemtenting `ChatModelSpanContributor`.
-[source,java]
-----
-import io.quarkiverse.langchain4j.runtime.listeners.ChatModelSpanContributor;
-import dev.langchain4j.model.chat.listener.ChatModelErrorContext;
-import dev.langchain4j.model.chat.listener.ChatModelRequestContext;
-import dev.langchain4j.model.chat.listener.ChatModelResponseContext;
-import io.opentelemetry.api.trace.Span;
-
-@ApplicationScoped
-public class CustomSpanDataContributor implements ChatModelSpanContributor {
-    public void onRequest(ChatModelRequestContext requestContext, Span currentSpan) {
-        span.addAttribute("example", "request");
-    }
-
-    public void onResponse(ChatModelResponseContext responseContext, Span currentSpan) {
-        span.addAttribute("example", "response");
-    }
-
-    default void onError(ChatModelErrorContext errorContext, Span currentSpan) {
-        span.addAttribute("example", "failure");
-    }
-}
-----
-
-==== LangFuse
-
-Traces can be exported to link:https://langfuse.com[LangFuse] simply by making the following configuration in `application.properties`:
-
-[source,properties]
-----
-quarkus.otel.exporter.otlp.headers=Authorization=Basic <base64 of public:key>
-quarkus.otel.exporter.otlp.endpoint=https://cloud.langfuse.com/api/public/otel
-quarkus.otel.exporter.otlp.traces.protocol=http/protobuf
-----
-
-=== Auditing
-
-The extension allows users to audit the process of implementing an AiService by observing normal CDI events. The following example shows a class that audits all events.
-
-NOTE: These methods do not all need to live in the same class and the name of the class and the methods do not matter. It is only shown this way for demonstration purposes.
-
-[source,java]
-----
-@ApplicationScoped
-public class AuditingListener {
-	public void initialMessagesCreated(@Observes InitialMessagesCreatedEvent initialMessagesCreatedEvent) {
-        // Invoked when the original user and system messages have been created
-	}
-
-	public void llmInteractionComplete(@Observes LLMInteractionCompleteEvent llmInteractionCompleteEvent) {
-		// Invoked when the final result of the AiService method has been computed
-	}
-
-	public void llmInteractionFailed(@Observes LLMInteractionFailureEvent llmInteractionFailureEvent) {
-		// Invoked when there was an exception computing the result of the AiService method
-	}
-
-	public void responseFromLLMReceived(@Observes ResponseFromLLMReceivedEvent responseFromLLMReceivedEvent) {
-		// Invoked with a response from an LLM.
-        // It is important to note that this can be invoked multiple times when tools exist.
-	}
-
-	public void toolExecuted(@Observes ToolExecutedEvent toolExecutedEvent) {
-		// Invoked with a tool response from an LLM.
-        // It is important to note that this can be invoked multiple times when tools exist.
-	}
-}
-----
-
 == Working with images
 
 An _AI Service_ can also be used when working with images, both to describe an image and to generate one.
@@ -551,3 +370,7 @@ Generating images with an _AI Service_ comes with restrictions compared to text
 * Although auditing does work, it is however limited
 * Guardrails only work on the input
 ====
+
+== Observability
+
+Refer to link:https://docs.quarkiverse.io/quarkus-langchain4j/dev/observability.html[this page] to learn how to trace LangChain4j applications. 
diff --git a/docs/modules/ROOT/pages/observability.adoc b/docs/modules/ROOT/pages/observability.adoc