Define API to capture metadata from AI responses #98

jxblum · 2023-11-14T01:44:47Z

This PR defines a new, strongly-typed API in Spring AI for capturing AI metadata and metrics sent in an AI response to a Prompt from an AI provider's (REST) API.

This new API includes both AI model usage metrics, such as Prompt and Generation (completion) token counts, along with AI provider access metrics, such as rate limits for both requests and tokens.

High-level feature additions in this PR include, but are not limited to:

New AiMetadata, RateLimit and Usage interfaces making up the API.
AiResponse now includes (optional) AiMetadata (AiMetadata.EMPTY by default)
Implementation of the new API with OpenAI.
Includes new method of testing AI provider REST API endpoints using OkHttp3 MockWebServer, Spring MockMvc and test class specific @RestController to mock the AI provider's API for testing purposes.

For example, you can now do something like the following:

  AiResponse response = aiClient.generate(prompt);

  // process the AI's response (such as chat completion)

  AiMetadata metadata = response.getMetadata();

  long totalTokenCount = metadata.getUsage().getTotalTokens();

  // do something responsible with this information

To see a complete example, have a look at the test.

In this API, I preferred strongly-typed objects (for example, AiMetadata) over storing key-values in Map<String, Object> objects present in AiResponse and Generation classes since it provides 1) type safety, 2) easier, more descriptive and programmatical access to allow for things like type conversion, encoding/decoding, etc and 3) more immediately apparent metadata avaiable from an AI provider that is uniformly accessible from Spring AI.

While this API may be more restrictive, or only capable of supporting the lowest-common denominator (LCD), we can always include support for free-form metadata, such as in the following example, which is not uncommon in Spring when you consider the PropertyResolver API, for instance:

aiMetadata.getPropertyAs("propertyName", SomeType.class);

Subclassing will also given users the ability to access AI provider-specific metadata.

In short, it really should not matter to the Spring AI developer whether metadata is stored internally in a Map<?, ?>, or by some other means.

TODO:

Upon initial review and discussion with both @markpollack and @tzolov, I recommend this feature be integrated and conditionally enabled based on a Spring property (for example: spring.ai.openai.metadata.capture-enabled). Spring Boot's auto-configuration (by property using @ConditionalOnProperty) can help in this regard. - DONE
Create other implementations of the AI metadata interfaces: Azure OpenAI, HuggingFace, etc.
Further exploration and enhancements could include integration with and exposing this AI metadata in Spring Boot Actuator.
In addition, there maybe clearer integration points directly with Micrometer as well.

jxblum · 2023-11-14T02:22:42Z

Note, I made the commits granular in this PR so that 1) the changes were easier to combine or remove as necessary (Spring Boot style) and 2) so that you could follow the progression of development (thinking, direction) in this new feature.

jxblum · 2023-11-14T21:59:40Z

I also think there is additional room for improvement on this initial implementation. For example. These can be addressed iteratively.

…sponse metadata. Closes spring-projects#98

jxblum · 2023-11-15T00:29:00Z

The source of information (metadata) pulled from an AI response during an AI request (Prompt) using OpenAI's API comes from:

The Chat Completion object.
Along with OpenAI's docuementation on Rate Limits.

…ata collection. Closes spring-projects#98

…y Spring AI. Closes spring-projects#98

markpollack · 2023-11-15T14:38:00Z

Note, I made the commits granular in this PR so that ...

Raising the bar! My PRs are typically a mess.

markpollack · 2023-11-15T14:49:45Z

spring-ai-core/src/main/java/org/springframework/ai/client/metadata/AiMetadata.java

+	};
+
+	default RateLimit getRateLimit() {
+		throw new IllegalStateException("No AI provider rate limit metadata was provided");


I think we could use the 'null object' pattern here instead of throwing an exception, since we want to promote portability and avoid devs having to write code to handle exceptions.

class RateLimit { public static final RateLimit NULL = new RateLimit(); ... }

markpollack · 2023-11-15T15:07:35Z

spring-ai-core/src/main/java/org/springframework/ai/client/metadata/Usage.java

+ * @author John Blum
+ * @since 0.7.0
+ */
+public interface Usage {


I think it is possible to go with the LCD approach that is type safe, though we need more investigation.

Here is some analysis. The huggingface one is the most different since it supports so many models.

One thing i thought was to provide some sort of type-safe client-specific helper class that given a hashmap would return typesafe results for devs who want to access the raw return information almost as if they were using the vendor specific API.

For example, here is the 'Details' class from the hugging face client.

@JsonProperty("best_of_sequences") private List<BestOfSequence> bestOfSequences = null; @JsonProperty("finish_reason") private FinishReason finishReason = null; @JsonProperty("generated_tokens") private Integer generatedTokens = null; @JsonProperty("prefill") private List<PrefillToken> prefill = new ArrayList<>(); @JsonProperty("seed") private Long seed = null; @JsonProperty("tokens") private List<Token> tokens = new ArrayList<>();

It is interesting they dont' have the separation into prompt tokens and tokens that were generated, though one might be able to calculate the prompt tokens in our own client and then do the subtraction from the total to get at the generated token value (estimate).

and for the azure open ai client

@Immutable public final class CompletionsUsage { /* * The number of tokens generated across all completions emissions. */ @Generated @JsonProperty(value = "completion_tokens") private int completionTokens; /* * The number of tokens in the provided prompts for the completions request. */ @Generated @JsonProperty(value = "prompt_tokens") private int promptTokens; /* * The total number of tokens processed for the completions request and response. */ @Generated @JsonProperty(value = "total_tokens") private int totalTokens;

And from the theo kanning open ai client

public class Usage { @JsonProperty("prompt_tokens") long promptTokens; @JsonProperty("completion_tokens") long completionTokens; @JsonProperty("total_tokens") long totalTokens;

some

Agreed, and an excellent point. I will investigate on this topic more. Thank you for the feedback and references.

I also like the general idea of a wrapper class (around the [Hash]Map) for AI provider specific metadata.

markpollack · 2023-11-15T15:12:53Z

spring-ai-core/src/main/java/org/springframework/ai/client/metadata/RateLimit.java

+ * @author John Blum
+ * @since 0.7.0
+ */
+public interface RateLimit {


similar to the comment on Usage, I'd like to compare what is out there, a bit harder to find this information from a quick google search as compared to usage.

markpollack · 2023-11-15T15:14:14Z

...ng-ai-openai/src/main/java/org/springframework/ai/openai/client/metadata/OpenAiMetadata.java

+		this(id, usage, null);
+	}
+
+	protected OpenAiMetadata(String id, OpenAiUsage usage, @Nullable OpenAiRateLimit rateLimit) {


could avoid the @nullable if use null object pattern?

markpollack · 2023-11-15T15:15:24Z

spring-ai-openai/src/main/java/org/springframework/ai/openai/client/OpenAiClient.java

+			Generation generation = new Generation(chatMessage.getContent(), Map.of("role", chatMessage.getRole()));
+			generations.add(generation);
+		}
+		return new AiResponse(generations, OpenAiMetadata.from(chatCompletionResult));


One area for the future is to have some sort of stats collection that goes on that can be sent to a dashboard. Adding in micrometer and a grafana dashboard could be a relatively easy win to help folks get a handle on costs.

Agreed. My initial reaction was to minimally start with Spring Boot Actuator, particularly as I am most familiar with Actuator. Perhaps we can loop in Micrometer team for thoughts here as well.

markpollack · 2023-11-15T15:18:26Z

...utoconfigure/src/main/java/org/springframework/ai/autoconfigure/openai/OpenAiProperties.java

 		this.baseUrl = baseUrl;
 	}

+	public String getEmbeddingApiKey() {


how did the need for this come about? Can one have a different API key for embedding vs generation/inference?

My apologies for the confusion, but I rearranged the order of the state variable in the code and that is why this appears in my commit. Here is the original definition of OpenAiProperties.

It was an organizational thing.

markpollack · 2023-11-15T15:20:47Z

spring-ai-openai/src/test/java/org/springframework/ai/openai/OpenAiMockTestConfiguration.java

+@SpringBootConfiguration
+@Profile("spring-ai-openai-mocks")
+@SuppressWarnings("unused")
+public class OpenAiMockTestConfiguration {


could this be done at a higher level based on AIClient interface vs vendor specific interfaces? The end goal is easy mocking that should also be portable across model providers.

Agreed on making this (potentially) easier for our developers to use in their testing efforts. I will need to think on this more carefully. In the meantime, I was simply addressing my infrastructure and framework testing needs.

I did call out having a spring-ai-test module in our design document as something to help developers with along these lines. I think overtime, especially with a few more implementations of this AI metadata model for different AI providers (e.g. Azure OpenAI and Huggingface) under our belt, we can iron down the reusable testing components.

markpollack · 2023-11-15T15:21:13Z

...igure/src/main/java/org/springframework/ai/autoconfigure/openai/OpenAiAutoConfiguration.java

+		OkHttpClient.Builder clientBuilder = new OkHttpClient.Builder(OpenAiService.defaultClient(apiKey, duration));
+
+		if (properties.getMetadata().isRateLimitMetricsEnabled()) {
+			clientBuilder.addInterceptor(new OpenAiHttpResponseHeadersInterceptor());


…sponse metadata. Closes spring-projects#98

…ata collection. Closes spring-projects#98

…y Spring AI. Closes spring-projects#98

Closes spring-projects#98

…data in GenerationMetadata. Now, instead of throwing an IllegalStateException, Spring AI returns a Null Object implementation of the RateLimit and Usage metadata from GenerationMetadata. In additionm, Spring AI provides abstract base classes to conveniently implement the RateLimit and Usage interfaces for new AI clients. Closes spring-projects#98

Closes spring-projects#98

jxblum · 2023-11-15T18:20:19Z

Note, I made the commits granular in this PR so that ...

Raising the bar! My PRs are typically a mess.

Thank you @markpollack. I appreciate your feedback and review on this PR.

I addressed most of your concerns and feedback across a few new commits already. Specifically, I did the following:

Renamed AiMetadata to GenerationMetadata.
Repackaged the AI metadata under org.springframework.ai.metadata.
Implemented the NULL Object pattern for RateLimit and Usage interfaces and returned the NULL value objects from GenerationMetadata instead of throwing an IllegalStateException.
Edited the Javadoc and documentation to match the API changes.

I am going to continue by building an implementation of the AI metadata API for Azure OpenAI and possibly Hugging Face.

Closes spring-projects#98

jxblum · 2023-11-15T23:56:05Z

I completed an initial implementation of the AI metadata API for Microsoft Azure OpenAI Service.

Additionally, I rebased this PR on the latest changes from main so that this PR remains in a buildable and shippable state.

…sponse metadata. Closes spring-projects#98

…ata collection. Closes spring-projects#98

…y Spring AI. Closes spring-projects#98

Closes spring-projects#98

…data in GenerationMetadata. Now, instead of throwing an IllegalStateException, Spring AI returns a Null Object implementation of the RateLimit and Usage metadata from GenerationMetadata. In additionm, Spring AI provides abstract base classes to conveniently implement the RateLimit and Usage interfaces for new AI clients. Closes spring-projects#98

See spring-projects#98

…sponse metadata. See spring-projects#98

…ata collection. See spring-projects#98

…y Spring AI. See spring-projects#98

See spring-projects#98

…data in GenerationMetadata. Now, instead of throwing an IllegalStateException, Spring AI returns a Null Object implementation of the RateLimit and Usage metadata from GenerationMetadata. In additionm, Spring AI provides abstract base classes to conveniently implement the RateLimit and Usage interfaces for new AI clients. See spring-projects#98

See spring-projects#98

* Define GenerationMetadata property in AiResponse. * Add OpenAI implementations of AiMetadata, RateLimit and Usage interfaces. * Add REST Assured JsonPath dependency to spring-ai-openai module. * Add OkHttp dependency to spring-ai-openai module. * Add OkHttp Interceptor to parse OpenAI rate limit metadata from HTTP headers. * Add OkHttp MockWebServer dependency to spring-ai-openai module, test scope * Add Jakarta Servlet API dependency to spring-ai-openai module, test scope * Add Spring Web MVC dependency to spring-ai-open-ai module., test scope * Define OpenAI API response headers in an Enum. * Add OpenAI test configuration using mock objects. * Add integration test to assert successful extraction of OpenAI API response metadata. * Include Spring Boot auto-configuration for (conditional) OpenAI metadata collection. * Edit documentation and include information on AI metadata collected by Spring AI. * Provide AI metadata implementation for Microsoft Azure OpenAI Service. * Capture optional PromptMetadata in AiResponse. * Define metadata for an AI generation choice. * Capture AI choice metadata in Generation. * Integrate ChoiceMetadata into AiResponse returned by OpenAI. Fixes spring-projects#98

markpollack · 2023-11-21T22:29:40Z

removed some of the older fields that were intended to capture metadata about requests.

merged as 37a4884

* Define GenerationMetadata property in AiResponse. * Add OpenAI implementations of AiMetadata, RateLimit and Usage interfaces. * Add REST Assured JsonPath dependency to spring-ai-openai module. * Add OkHttp dependency to spring-ai-openai module. * Add OkHttp Interceptor to parse OpenAI rate limit metadata from HTTP headers. * Add OkHttp MockWebServer dependency to spring-ai-openai module, test scope * Add Jakarta Servlet API dependency to spring-ai-openai module, test scope * Add Spring Web MVC dependency to spring-ai-open-ai module., test scope * Define OpenAI API response headers in an Enum. * Add OpenAI test configuration using mock objects. * Add integration test to assert successful extraction of OpenAI API response metadata. * Include Spring Boot auto-configuration for (conditional) OpenAI metadata collection. * Edit documentation and include information on AI metadata collected by Spring AI. * Provide AI metadata implementation for Microsoft Azure OpenAI Service. * Capture optional PromptMetadata in AiResponse. * Define metadata for an AI generation choice. * Capture AI choice metadata in Generation. * Integrate ChoiceMetadata into AiResponse returned by OpenAI. Fixes #98

* Define GenerationMetadata property in AiResponse. * Add OpenAI implementations of AiMetadata, RateLimit and Usage interfaces. * Add REST Assured JsonPath dependency to spring-ai-openai module. * Add OkHttp dependency to spring-ai-openai module. * Add OkHttp Interceptor to parse OpenAI rate limit metadata from HTTP headers. * Add OkHttp MockWebServer dependency to spring-ai-openai module, test scope * Add Jakarta Servlet API dependency to spring-ai-openai module, test scope * Add Spring Web MVC dependency to spring-ai-open-ai module., test scope * Define OpenAI API response headers in an Enum. * Add OpenAI test configuration using mock objects. * Add integration test to assert successful extraction of OpenAI API response metadata. * Include Spring Boot auto-configuration for (conditional) OpenAI metadata collection. * Edit documentation and include information on AI metadata collected by Spring AI. * Provide AI metadata implementation for Microsoft Azure OpenAI Service. * Capture optional PromptMetadata in AiResponse. * Define metadata for an AI generation choice. * Capture AI choice metadata in Generation. * Integrate ChoiceMetadata into AiResponse returned by OpenAI. Fixes spring-projects#98

jxblum changed the title ~~Define API for capturing AI metadata from AI responses~~ Define API to capture AI metadata from AI responses Nov 14, 2023

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 14, 2023

Add integration test to assert successful extraction of OpenAI API re…

f6ec9a6

…sponse metadata. Closes spring-projects#98

jxblum force-pushed the pr/ai-metadata branch from 308cf66 to f6ec9a6 Compare November 14, 2023 23:16

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Add integration test to assert successful extraction of OpenAI API re…

f418362

…sponse metadata. Closes spring-projects#98

jxblum force-pushed the pr/ai-metadata branch from f6ec9a6 to f418362 Compare November 15, 2023 00:13

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Include Spring Boot auto-configuration for (conditional) OpenAI metad…

c7502da

…ata collection. Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Edit documentation and include information on AI metadata collected b…

99f031f

…y Spring AI. Closes spring-projects#98

jxblum changed the title ~~Define API to capture AI metadata from AI responses~~ Define API to capture metadata from AI responses Nov 15, 2023

markpollack reviewed Nov 15, 2023

View reviewed changes

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Add integration test to assert successful extraction of OpenAI API re…

fa73db8

…sponse metadata. Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Include Spring Boot auto-configuration for (conditional) OpenAI metad…

0c9b35d

…ata collection. Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Edit documentation and include information on AI metadata collected b…

ee5b965

…y Spring AI. Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Rename AiMetadata to GenerationMetadata.

a2d27d3

Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Repackage AI metadata under org.springframework.ai.metadata.

7ac2060

Closes spring-projects#98

jxblum force-pushed the pr/ai-metadata branch from 99f031f to 7ac2060 Compare November 15, 2023 18:02

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Edit documentation on AI metadata to match API.

15e9224

Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 15, 2023

Edit documentation on AI metadata to match API.

6dd3159

Closes spring-projects#98

jxblum force-pushed the pr/ai-metadata branch from 15e9224 to 6dd3159 Compare November 15, 2023 18:27

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 16, 2023

Add integration test to assert successful extraction of OpenAI API re…

5cf0d04

…sponse metadata. Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 16, 2023

Include Spring Boot auto-configuration for (conditional) OpenAI metad…

97ecc3d

…ata collection. Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 16, 2023

Edit documentation and include information on AI metadata collected b…

146eb8b

…y Spring AI. Closes spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 16, 2023

Rename AiMetadata to GenerationMetadata.

c2397cf

Closes spring-projects#98

jxblum added 18 commits November 20, 2023 19:47

Add OkHttp MockWebServer dependency to spring-ai-openai module.

df6513e

Add Jakarta Servlet API dependency to spring-ai-openai module.

b18d0cb

Add Spring Web MVC dependency to spring-ai-open-ai module.

90f4395

Define OpenAI API response headers in an Enum.

1a0adf4

See spring-projects#98

Add OpenAI test configuration using mock objects.

d7d125b

See spring-projects#98

Add integration test to assert successful extraction of OpenAI API re…

b9af509

…sponse metadata. See spring-projects#98

Include Spring Boot auto-configuration for (conditional) OpenAI metad…

ddc57fe

…ata collection. See spring-projects#98

Edit documentation and include information on AI metadata collected b…

af118c2

…y Spring AI. See spring-projects#98

Rename AiMetadata to GenerationMetadata.

c7a1364

See spring-projects#98

Repackage AI metadata under org.springframework.ai.metadata.

714db57

See spring-projects#98

Edit documentation on AI metadata to match API.

ea9cea8

See spring-projects#98

Provide AI metadata implementation for Microsoft Azure OpenAI Service.

2c994e2

See spring-projects#98

Define metadata for an AI prompt.

9e16051

See spring-projects#98

Rename getMetadata() method to getGenerationMetadata() in AiResponse.

583fdc2

See spring-projects#98

Capture optional PromptMetadata in AiResponse.

f0ccfda

See spring-projects#98

Capture prompt metadata from Azure OpenAI.

0babe21

See spring-projects#98

Define metadata for an AI generation choice.

402e847

See spring-projects#98

jxblum force-pushed the pr/ai-metadata branch from 86329df to 9a14aca Compare November 21, 2023 03:57

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 21, 2023

Capture AI generation choice metadata in Generation.

76ffaee

See spring-projects#98

jxblum added a commit to jxblum/spring-ai that referenced this pull request Nov 21, 2023

Integrate ChoiceMetadata into AiResponse returned by OpenAI.

9a14aca

See spring-projects#98

jxblum added 2 commits November 20, 2023 20:06

Capture AI choice metadata in Generation.

bf84a40

See spring-projects#98

Integrate ChoiceMetadata into AiResponse returned by OpenAI.

9ea9c42

See spring-projects#98

jxblum force-pushed the pr/ai-metadata branch from 9a14aca to 9ea9c42 Compare November 21, 2023 04:06

markpollack added this to the 0.8.0 milestone Nov 21, 2023

markpollack closed this Nov 21, 2023

Define API to capture metadata from AI responses #98

Define API to capture metadata from AI responses #98

Uh oh!

Conversation

jxblum commented Nov 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jxblum commented Nov 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jxblum commented Nov 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jxblum commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markpollack commented Nov 15, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jxblum commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jxblum commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markpollack commented Nov 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jxblum commented Nov 14, 2023 •

edited

Loading

jxblum commented Nov 14, 2023 •

edited

Loading

jxblum commented Nov 14, 2023 •

edited

Loading

jxblum commented Nov 15, 2023 •

edited

Loading

jxblum Nov 15, 2023 •

edited

Loading

jxblum Nov 15, 2023 •

edited

Loading

jxblum commented Nov 15, 2023 •

edited

Loading

jxblum commented Nov 15, 2023 •

edited

Loading