Skip to content

Commit 990da8d

Browse files
committed
Add note on exact caching
1 parent 4924dfd commit 990da8d

File tree

1 file changed

+4
-1
lines changed
  • app/_kong_plugins/ai-semantic-cache

1 file changed

+4
-1
lines changed

app/_kong_plugins/ai-semantic-cache/index.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ faqs:
5959
failed to create memorydb instance failed to create index: LIMIT Number of indexes (11) exceeds the limit (10)
6060
```
6161
62-
This means that the hardcoded MemoryDB instance limit has been reached.
62+
This means that the hardcoded MemoryDB instance limit has been reached.
6363
To resolve this, create more MemoryDB instances to handle multiple {{page.name}} plugin instances.
6464
---
6565

@@ -141,6 +141,9 @@ The AI Semantic Cache plugin improves how AI systems provide responses by using
141141

142142
Together, these caching methods enhance the efficiency and relevance of AI responses, making interactions faster and more contextually accurate.
143143

144+
{:.info}
145+
> When Exact Caching is enabled, the AI Semantic Cache plugin may still return results for queries that are similar but not identical.This is expected behavior: the plugin performs similarity-based caching regardless of the Exact Caching setting.
146+
144147
### Headers sent to the client
145148

146149
When the AI Semantic Cache plugin is active, {{site.base_gateway}} sends additional headers

0 commit comments

Comments
 (0)