Only construct Observation.Context for Chat and Vector ops on Model observations #1661

jxblum · 2024-11-04T00:02:36Z

Many of the AI provider ChatModels prematurely construct an Observation.Context when ChatModel Observations may not even be enabled (as defined by Micrometer in ObservationDocumentation and Observation).

Simplify and polish code in OpenAI using the API with Spring constructs and Builders.
Simplify common code expressions used in AI provider ChatModels with ValueUtils.
Apply whitespace to improve readability.

…bservations. * Simplify and polish code in OpenAI using the API with Spring constructs and Builders. * Simplify common code expressions used in AI provider ChatModels with ValueUtils. * Apply whitespace to improve readability. Closes spring-projects#1661

markpollack · 2024-11-04T15:51:13Z

I'm a bit nervous to get this in before Nov 11, but nevertheless do want to know about best practices here - there are many it seems that are a bit opaque to me. How expensive is it to create an ObservationContext? @ThomasVitale @jonatan-ivanov ?

jonatan-ivanov · 2024-11-04T22:07:44Z

...s/spring-ai-anthropic/src/main/java/org/springframework/ai/anthropic/AnthropicChatModel.java

 		ChatCompletionRequest request = createRequest(prompt, false);

-		ChatModelObservationContext observationContext = ChatModelObservationContext.builder()
+		Supplier<ChatModelObservationContext> observationContext = () -> ChatModelObservationContext.builder()


How about calling it observationContextSupplier? (There are a few other occasions.)

The point of passing the context via a Supplier when you create an Observation is exactly this; when observations are "off" (noop), it will not call the supplier so no unnecessary context objects will be created. 👍🏼

In this particular case though, since the request can be different for every call, this replaces creating a new context to creating a new Supplier which I think might be more lightweight but there is a bit of additional complexity through the noop check + insanceof + cast + using Optional so which one is more lightweight, I'm not sure, only JMH can tell I guess.

I am fine with whatever name we use. In some cases (such as Wrapper types or Wrapper-like types, e.g. Supplier, or Optional), I simply use, or prefer, the name of the thing it wraps.

My reasoning is also similar in effect to List<User> users vs. List<User> userList. I like users, particularly if the collection-type might change (e.g. Set)

jonatan-ivanov · 2024-11-04T22:20:01Z

...-core/src/main/java/org/springframework/ai/chat/observation/ChatModelObservationSupport.java

+
+		// Avoid unnecessary construction of an Optional if Observations are not enabled
+		// (aka NOOP).
+		return Observation.NOOP.equals(observation) ? Optional.empty()


There is an isNoop, when would null be passed to this method?
This might be simplified as:

public static Optional<ChatModelObservationContext> getObservationContext(Observation observation) { return observation.isNoop() : Optional.empty() ? Optional.of((ChatModelObservationContext) observation.getContext()); }

or

public static <T> Optional<T> getObservationContext(Observation observation) { return observation.isNoop() : Optional.empty() ? Optional.of((T) observation.getContext()); }

Honestly, I went back and forth on this.

I realize that in the context of Micrometer, the Observation is likely to never to be null. But, arguably, contracts can change (or break). So, over many years of systems development, I learned to code defensively... assume the worse, fail gracefully and deliberately, not unexpectedly.

GENERAL THINKING:

This will never fail (providing the equals(..) method is coded correctly), where as, object.someMethod(..) can throw a potential NPE at runtime if our library is not used properly, even despite the "nullability" contract. Additionally, the "nullability" contract is not always apparent (despite implied or explicit annotations, which rely on tooling to do the right thing, e.g. warn the user). The support class does not make any assumptions about how the Observation was created (e.g. with a ObservationDocumentation, or otherwise, or even at all). The point is, we don't know how our method will be called.

This same principle applies to methods that return arrays or collections. It is a recommended practice for classes to never return a null array or collection, but rather an empty one. Still, I never take that for granted and assume a 3rd party library (or even my own code; mistakes happen) will always do the right thing, so I guard against the possibility of null arrays/collections.

For example:

List<User> users = userService.findByName("..."); nullSafeList(users).forEach(user -> ...);

Where:

static <T> List<T> nullSafeList(List<T> list) { return list != null ? list : Collections.emptyList(); }

Rather than:

List<User> users = userService.findByName("..."); users.forEache(user -> ...);

The later is bad if the contract of UserService.findByName(..):List<User> is broken, returning null instead of empty.

This might seem like paranoia, until it isn't.

Safety first.

jonatan-ivanov · 2024-11-04T22:21:54Z

...s/spring-ai-anthropic/src/main/java/org/springframework/ai/anthropic/AnthropicChatModel.java

+			ChatResponse chatResponse = toChatResponse(completionEntity.getBody());

-				observationContext.setResponse(chatResponse);
+			ChatModelObservationSupport.getObservationContext(observation)


You can do this instead (might be simpler):

if (!observation.isNoop()) { ((ChatModelObservationContext)observation.getContext()).setResponse(chatResponse); }

I generally agree, but I'd like to see something like this in Micrometer:

interface Observation { static boolean isNoop(Observation observation) { return observation == null || observation.isNoop(); } static boolean isNotNoop(Observation observation) { return !isNoop(observation); } // ... }

jonatan-ivanov · 2024-11-04T22:30:32Z

...-core/src/main/java/org/springframework/ai/chat/observation/ChatModelObservationSupport.java

+					.map(ChatModelObservationContext.class::cast);
+	}
+
+	public static Consumer<ChatResponse> setChatResponseInObservationContext(@Nullable Observation observation) {


This might be simplified as:

public static Consumer<ChatResponse> setChatResponseInObservationContext(Observation observation) { return chatResponse -> { if (!observation.isNoop()) { ((ChatModelObservationContext) observation.getContext()).setResponse(chatResponse) } }; }

Same comment as above.

jonatan-ivanov · 2024-11-04T22:52:14Z

models/spring-ai-openai/src/main/java/org/springframework/ai/openai/OpenAiChatModel.java

 						ChatCompletionMessage.Role.valueOf(message.getMessageType().name())));
 			}
-			else if (message.getMessageType() == MessageType.ASSISTANT) {
+			else if (MessageType.ASSISTANT.equals(messageType)) {


Is this an enum? If so, wouldn't == be ok (/faster)?

Yes, it is an enum. However, it is always safer to use equals(:Object) than ==. Besides, a properly constructed equals(:Object) method covers == anyway. Let the JIT compiler optimize.

Additionally, while it may not apply in this context, serialization will most certainly invalidate the use of ==.

I'd argue that == should only be used for primitive types, and rarely, if ever Objects.

jonatan-ivanov · 2024-11-04T22:57:15Z

...-core/src/main/java/org/springframework/ai/chat/observation/ChatModelObservationSupport.java

+ * @see Observation
+ * @since 1.0.0
+ */
+public abstract class ChatModelObservationSupport {


I guess this cannot be package-private but should we somehow indicate that this is for internal use?
Also, there might be another Observations-related util class iirc.

Agreed. Documented and moved to the o.s.ai.chat.observation.support package.

jonatan-ivanov · 2024-11-04T23:00:03Z

...main/java/org/springframework/ai/vectorstore/observation/AbstractObservationVectorStore.java

+		});
+	}
+
+	private Optional<VectorStoreObservationContext> getObservationContext(@Nullable Observation observation) {


ChatModelObservationSupport can be parameterized so it will work for multiple types, This might not be needed.

I thought about that, too. In the end. I kept Chat and Vector ObservationContext treatment separate for now (i.e. no generics), which are the only 2 use cases I know of (ATM).

jxblum · 2024-11-05T17:40:39Z

@markpollack

I'm a bit nervous to get this in before Nov 11, but nevertheless do want to know about best practices here - there are many it seems that are a bit opaque to me. How expensive is it to create an ObservationContext? @ThomasVitale @jonatan-ivanov ?

Always your call, of course, but I'd hope that most concerns were covered by tests, which I always run in entirety after each change I make (PR I submit), in addition to adding additional test coverage where needed. :)

…bservations. * Simplify and polish code in OpenAI using the API with Spring constructs and Builders. * Simplify common code expressions used in AI provider ChatModels with ValueUtils. * Apply whitespace to improve readability. Closes spring-projects#1661

markpollack · 2025-03-13T12:42:04Z

@jonatan-ivanov is this in a state to be merged?

jonatan-ivanov · 2025-03-13T18:05:47Z

models/spring-ai-minimax/src/main/java/org/springframework/ai/minimax/MiniMaxChatModel.java

+				logger.warn("No choices returned for prompt: {}, because: {}}", prompt,
+						chatCompletion.baseResponse().message());


Is this valid?
Should not it be one } less at the end?

Suggested change

logger.warn("No choices returned for prompt: {}, because: {}}", prompt,

chatCompletion.baseResponse().message());

logger.warn("No choices returned for prompt: {}, because: {}", prompt,

chatCompletion.baseResponse().message());

If so, can we fix it in this PR?

Hi @jonatan-ivanov - I am not following your question here.

In source, currently (line numbers changed a bit now since I last submitted this PR), there is already 1 too many } in the logger statement.

The change I made removes the extra (unnecessary) } at the end.

I'm talking about a different line: https://github.com/spring-projects/spring-ai/pull/1661/files#diff-4b360f06ee830dd00855e2fd3e0d00b78a3aa23ca4257145e71cd2d03a7f1caeR244

Gosh, it has been so long since I looked at this PR, I had to open my IDE to see the precise changes I made since IJ does a better job at highlighting the "actual" changes vs. what appears in GitHub.

Even in my PR, though it would appear with Git blame that I changed the lines (which is technically due to formatting), I did not introduce the extra } in the logger statement. In fact, I did not even change either logging statement (2) in this method.

Once again, if you look at the actual, "current" source line compared to the same line in my topic branch for the PR (with no Git blame), they are precisely the same.

Apparently, I did not change either logger statement at all.

What did change is:

I introduced a observation local variable (here) to make this less busy.

Additionally, I introduced a space after the choices List to be consistent with the separation between the varCompletion and subsequent if block, above. See original source have no space.

This logger statement should be fixed though.

FTR, and for additional clarity, this is what I see in IJ (on my topic branch pr/observation for this PR):

So can we fix line 244 (on the screenshot) in this PR too?

jonatan-ivanov · 2025-03-13T19:31:07Z

models/spring-ai-openai/src/main/java/org/springframework/ai/openai/OpenAiChatModel.java

-				}).toList();
+			List<Generation> generations = choices.stream().map(choice -> {
+				Map<String, Object> metadata = choiceMetadata(choice, chatCompletion.id(),
+					ValueUtils.defaultToEmptyString(choice.message().role(), OpenAiApi.ChatCompletionMessage.Role::name));


Doing the null check inside and outside of this method depending on the field threw me over a bit, would not it be better to use choice as the parameter?

I don't understand your comment.

Bear with me since line numbers in the source (now) have significantly changed since the PR, but all I was doing here was...

Simplify the Map construction nested in the Lambda (original source), and refactoring into a method, thereby simplifying and making the Lambda more readable.

What puzzled me that we are doing null checks and default setting inside of choiceMetadata for certain values AND we are also doing null checks and default setting here when we call choiceMetadata (outside of it). As a user of choiceMetadata, it's not trivial what parameters should I null-check-and-set-default and what I should not since choiceMetadata will take care about them.

jonatan-ivanov · 2025-03-13T19:35:52Z

...main/java/org/springframework/ai/vectorstore/observation/AbstractObservationVectorStore.java

 	}

 	@Override
+	@SuppressWarnings("all")


Can we suppress just what is necessary?

That would be nice. Unfortunately, Java falls short on this one. :(

I often wondered the same thing, and of course, there is no shortage of advice on the topic; for example - see first answer.

This is the compiler warning (null check) visible in IJ (not sure about Eclipse):

As you can see, using the "suggestion" from SO of @SuppressWarnings("null"), the "null check" compiler warning is still present leaving the code highlighted in the IDE.

At last, when we review the JLS for Java 17 on @SuppressWarnings annotation, there is no "null" value for @SuppressWarnings and the reason the compiler warning persists.

For good measure, I even tried @SuppressWarnings("unchecked"). That creates a "Redundant expression" issue IJ. Gah! o.O

So both IJ and javac honor @SuppressWarnings("all") for better or worse.

markpollack · 2025-09-28T22:36:57Z

hi. this is quite old - which is my fault and we have other higher priority issues with observability compraed to what this is addressing. i'm going to close it. sorry it didn't work out john... my bad.

jxblum force-pushed the pr/observation branch from dc799cd to 8969261 Compare November 4, 2024 00:03

jonatan-ivanov reviewed Nov 4, 2024

View reviewed changes

jxblum force-pushed the pr/observation branch from 8969261 to b531589 Compare November 5, 2024 18:38

jonatan-ivanov approved these changes Mar 13, 2025

View reviewed changes

markpollack added the code cleanup label Sep 28, 2025

markpollack closed this Sep 28, 2025

		logger.warn("No choices returned for prompt: {}, because: {}}", prompt,
		chatCompletion.baseResponse().message());

Only construct Observation.Context for Chat and Vector ops on Model observations #1661

Only construct Observation.Context for Chat and Vector ops on Model observations #1661

Uh oh!

Conversation

jxblum commented Nov 4, 2024

Uh oh!

markpollack commented Nov 4, 2024

Uh oh!

jonatan-ivanov Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Nov 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Nov 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Nov 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Nov 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Nov 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum commented Nov 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markpollack commented Mar 13, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

jonatan-ivanov Nov 4, 2024 •

edited

Loading

jxblum Nov 5, 2024 •

edited

Loading

jxblum Nov 5, 2024 •

edited

Loading

jxblum Nov 6, 2024 •

edited

Loading

jxblum Nov 5, 2024 •

edited

Loading

jxblum Nov 5, 2024 •

edited

Loading

jxblum commented Nov 5, 2024 •

edited

Loading

jxblum Mar 13, 2025 •

edited

Loading

jxblum Mar 13, 2025 •

edited

Loading

jxblum Mar 14, 2025 •

edited

Loading