Skip to content

feat(anthropic): add support for prompt caching with fix #4139

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 10 commits into
base: main
Choose a base branch
from

Conversation

Claudio-code
Copy link
Contributor

@Claudio-code Claudio-code commented Aug 13, 2025

sobychacko and others added 7 commits August 13, 2025 19:31
Signed-off-by: Soby Chacko <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: Alexandros Pappas <[email protected]>

update documentation

Signed-off-by: Alexandros Pappas <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: leijendary <[email protected]>

Remove references to spring-ai-core module in jdbc chat memory

Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: Eddú Meléndez <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
Implements Anthropic's prompt caching feature to improve token efficiency.

- Adds cache control support in AnthropicApi and AnthropicChatModel
- Creates AnthropicCacheType enum with EPHEMERAL cache type
- Extends AbstractMessage and UserMessage to support cache parameters
- Updates Usage tracking to include cache-related token metrics
- Adds integration test to verify prompt caching functionality

This implementation follows Anthropic's prompt caching API (beta-2024-07-31) which allows
for more efficient token usage by caching frequently used prompts.

Signed-off-by: “claudio-code” <[email protected]>
@Claudio-code Claudio-code marked this pull request as ready for review August 13, 2025 22:51
Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
Signed-off-by: “claudio-code” <[email protected]>
@sobychacko
Copy link
Contributor

@Claudio-code Something doesn't look right with the PR. The PR contains a few unrelated commits. Could you rebase your PR branch against the latest main and then add the changes related to the anthropic prompt caching as a single commit on top? Make sure to use DCO on that commit. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants