Add reference doc for Usage changes

ilayaperumalg · markpollack · commit b86772d44c59 · 2025-02-03T15:05:50.000-05:00
- Add documentation for the recent changes on Usage handling
- Add details on how to refer model specific native usage
- Update upgrade notes
diff --git a/spring-ai-docs/src/main/antora/modules/ROOT/pages/api/usage-handling.adoc b/spring-ai-docs/src/main/antora/modules/ROOT/pages/api/usage-handling.adoc
@@ -0,0 +1,100 @@
+= Using Chat/Embedding Response Usage
+
+== Overview
+Spring AI has enhanced its Model Usage handling by introducing `getNativeUsage()` method in the Usage interface and providing a `DefaultUsage` implementation.
+This change simplifies how different AI models can track and report their usage metrics while maintaining consistency across the framework.
+
+== Key Changes
+
+=== Usage Interface Enhancement
+The `Usage` interface now includes a new method:
+```java
+Object getNativeUsage();
+```
+This method allows access to the model-specific native usage data, enabling more detailed usage tracking when needed.
+
+=== Using with ChatClient
+
+Here's a complete example showing how to track usage with OpenAI's ChatClient:
+
+```java
+@SpringBootConfiguration
+public class Configuration {
+
+        @Bean
+        public OpenAiApi chatCompletionApi() {
+            return new OpenAiApi(System.getenv("OPENAI_API_KEY"));
+        }
+
+        @Bean
+        public OpenAiChatModel openAiClient(OpenAiApi openAiApi) {
+            return new OpenAiChatModel(openAiApi);
+        }
+
+    }
+
+@Service
+public class ChatService {
+
+    private final OpenAiChatModel chatModel;
+
+    public ChatService(OpenAiChatModel chatModel) {
+        this.chatModel = chatModel;
+    }
+
+    public void demonstrateUsage() {
+        // Create a chat prompt
+        Prompt prompt = new Prompt("What is the weather like today?");
+
+        ChatClient chatClient = ChatClient.builder(this.chatModel).build();
+
+        // Get the chat response
+        ChatResponse response = chatClient.call(prompt);
+
+        // Access the usage information
+        Usage usage = response.getMetadata().getUsage();
+
+        // Get standard usage metrics
+        System.out.println("Prompt Tokens: " + usage.getPromptTokens());
+        System.out.println("Completion Tokens: " + usage.getCompletionTokens());
+        System.out.println("Total Tokens: " + usage.getTotalTokens());
+
+        // Access native OpenAI usage data with detailed token information
+        if (usage.getNativeUsage() instanceof org.springframework.ai.openai.api.OpenAiApi.Usage) {
+            org.springframework.ai.openai.api.OpenAiApi.Usage nativeUsage =
+                (org.springframework.ai.openai.api.OpenAiApi.Usage) usage.getNativeUsage();
+
+            // Detailed prompt token information
+            System.out.println("Prompt Tokens Details:");
+            System.out.println("- Audio Tokens: " + nativeUsage.promptTokensDetails().audioTokens());
+            System.out.println("- Cached Tokens: " + nativeUsage.promptTokensDetails().cachedTokens());
+
+            // Detailed completion token information
+            System.out.println("Completion Tokens Details:");
+            System.out.println("- Reasoning Tokens: " + nativeUsage.completionTokenDetails().reasoningTokens());
+            System.out.println("- Accepted Prediction Tokens: " + nativeUsage.completionTokenDetails().acceptedPredictionTokens());
+            System.out.println("- Audio Tokens: " + nativeUsage.completionTokenDetails().audioTokens());
+            System.out.println("- Rejected Prediction Tokens: " + nativeUsage.completionTokenDetails().rejectedPredictionTokens());
+        }
+    }
+}
+```
+
+== Benefits
+
+**Standardization**: Provides a consistent way to handle usage across different AI models
+**Flexibility**: Supports model-specific usage data through the native usage feature
+**Simplification**: Reduces boilerplate code with the default implementation
+**Extensibility**: Easy to extend for specific model requirements while maintaining compatibility
+
+=== Type Safety Considerations
+
+When working with native usage data, consider type casting carefully:
+```java
+// Safe way to access native usage
+if (usage.getNativeUsage() instanceof org.springframework.ai.openai.api.OpenAiApi.Usage) {
+    org.springframework.ai.openai.api.OpenAiApi.Usage nativeUsage =
+        (org.springframework.ai.openai.api.OpenAiApi.Usage) usage.getNativeUsage();
+    // Work with native usage data
+}
+```
diff --git a/spring-ai-docs/src/main/antora/modules/ROOT/pages/upgrade-notes.adoc b/spring-ai-docs/src/main/antora/modules/ROOT/pages/upgrade-notes.adoc
@@ -3,23 +3,41 @@
 
 == Upgrading to 1.0.0.M6
 
+
 === Changes to Usage Interface and DefaultUsage Implementation
 
-* In the link:https://github.com/spring-projects/spring-ai/blob/main/spring-ai-core/src/main/java/org/springframework/ai/chat/metadata/Usage.java[Usage] interface, changed the method name from `getGenerationTokens` to `getCompletionTokens` as that is more common terminology in the industry.
-* In link:https://github.com/spring-projects/spring-ai/blob/main/spring-ai-core/src/main/java/org/springframework/ai/chat/metadata/DefaultUsage.java[DefaultUsage], changed parameter types from `Long` to `Integer` for all token-related fields:
+The `Usage` interface and its default implementation `DefaultUsage` have undergone the following changes:
+
+1. Method Rename:
+* `getGenerationTokens()` is now `getCompletionTokens()`
+
+2. Type Changes:
+* All token count fields in `DefaultUsage` changed from `Long` to `Integer`:
 ** `promptTokens`
 ** `completionTokens` (formerly `generationTokens`)
 ** `totalTokens`
-* Constructor changes in link:https://github.com/spring-projects/spring-ai/blob/main/spring-ai-core/src/main/java/org/springframework/ai/chat/metadata/DefaultUsage.java[DefaultUsage]:
-** Changed parameter types from `Long` to `Integer`
-** Renamed `generationTokens` parameter to `completionTokens`
-** Removed constructors that used `Long` parameters
 
-=== JSON Serialization Changes
+==== Required Actions
+
+* Replace all calls to `getGenerationTokens()` with `getCompletionTokens()`
+
+* Update `DefaultUsage` constructor calls:
+[source,java]
+----
+// Old (M5)
+new DefaultUsage(Long promptTokens, Long generationTokens, Long totalTokens)
+
+// New (M6)
+new DefaultUsage(Integer promptTokens, Integer completionTokens, Integer totalTokens)
+----
+
+
+NOTE: For more information on handling Usage, refer xref:api/usage-handling.adoc[here]
+
+==== JSON Ser/Deser changes
+While M6 maintains backward compatibility for JSON deserialization of the `generationTokens` field, this field will be removed in M7. Any persisted JSON documents using the old field name should be updated to use `completionTokens`.
 
-* The JSON format now uses `completionTokens` instead of `generationTokens`
-* For backward compatibility, JSON deserialization still supports the old format with `generationTokens`
-* Example of new JSON format:
+Example of the new JSON format:
 [source,json]
 ----
 {
@@ -29,22 +47,15 @@
 }
 ----
 
-=== Migration Guide
-
-1. Update your code to use `getCompletionTokens()` instead of `getGenerationTokens()`
-2. When creating new instances of link:https://github.com/spring-projects/spring-ai/blob/main/spring-ai-core/src/main/java/org/springframework/ai/chat/metadata/DefaultUsage.java[DefaultUsage], use `Integer` instead of `Long` for token counts. This is a breaking change and requires updating all constructor calls.
-3. When constructing link:https://github.com/spring-projects/spring-ai/blob/main/spring-ai-core/src/main/java/org/springframework/ai/chat/metadata/DefaultUsage.java[DefaultUsage], use the parameter name `completionTokens` instead of `generationTokens`
-4. If you're serializing `DefaultUsage` objects to JSON, update your code to use the new field names
-
 
 === Removal of deprecated Amazon Bedrock chat models
 
 Starting 1.0.0-M6, Spring AI transitioned to using Amazon Bedrock's Converse API for all Chat conversation implementations in Spring AI.
 All the Amazon Bedrock Chat models are removed except the Embedding models for Cohere and Titan.
 
-=== Migration Guide
+NOTE: Refer to xref:api/chat/bedrock-converse.adoc[Bedrock Converse] documentation for using the chat models.
+
 
-Refer to xref:api/chat/bedrock-converse.adoc[Bedrock Converse] documentation for using the chat models.
 
 == Upgrading to 1.0.0.M5