-
Notifications
You must be signed in to change notification settings - Fork 1.8k
feat(anthropic): add support for prompt caching with fix #4139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
Claudio-code
wants to merge
10
commits into
spring-projects:main
Choose a base branch
from
Claudio-code:fix-gh-1413-pr-2
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+1,553
−261
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: Soby Chacko <[email protected]> Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: Alexandros Pappas <[email protected]> update documentation Signed-off-by: Alexandros Pappas <[email protected]> Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: leijendary <[email protected]> Remove references to spring-ai-core module in jdbc chat memory Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: Eddú Meléndez <[email protected]> Signed-off-by: “claudio-code” <[email protected]>
Implements Anthropic's prompt caching feature to improve token efficiency. - Adds cache control support in AnthropicApi and AnthropicChatModel - Creates AnthropicCacheType enum with EPHEMERAL cache type - Extends AbstractMessage and UserMessage to support cache parameters - Updates Usage tracking to include cache-related token metrics - Adds integration test to verify prompt caching functionality This implementation follows Anthropic's prompt caching API (beta-2024-07-31) which allows for more efficient token usage by caching frequently used prompts. Signed-off-by: “claudio-code” <[email protected]>
…ing-projectsgh-1413-pr-2 Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
@Claudio-code Something doesn't look right with the PR. The PR contains a few unrelated commits. Could you rebase your PR branch against the latest main and then add the changes related to the anthropic prompt caching as a single commit on top? Make sure to use DCO on that commit. Thanks! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Added a fix to the cache implementation feat(anthropic): add support for prompt caching #2669.
Added support for caching by assistant messages.
Needs to add support for systemMessages.
Needs to add support for tool caching.