Skip to content

Commit dc15c3a

Browse files
authored
Merge pull request #1867 from redis/fix-search-endpoint
Langcache: Fix a few /v1/{cahceId}/entires/search endpoints
2 parents 866d9aa + 8789800 commit dc15c3a

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

content/develop/ai/langcache/_index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -52,15 +52,15 @@ The following diagram displays how you can integrate LangCache into your GenAI a
5252
{{< image filename="images/rc/langcache-process.png" alt="The LangCache process diagram." >}}
5353

5454
1. A user sends a prompt to your AI app.
55-
1. Your app sends the prompt to LangCache through the `POST /v1/caches/{cacheId}/search` endpoint.
55+
1. Your app sends the prompt to LangCache through the `POST /v1/caches/{cacheId}/entries/search` endpoint.
5656
1. LangCache calls an embedding model service to generate an embedding for the prompt.
5757
1. LangCache searches the cache to see if a similar response already exists by matching the embeddings of the new query with the stored embeddings.
5858
1. If a semantically similar entry is found (also known as a cache hit), LangCache gets the cached response and returns it to your app. Your app can then send the cached response back to the user.
5959
1. If no match is found (also known as a cache miss), your app receives an empty response from LangCache. Your app then queries your chosen LLM to generate a new response.
6060
1. Your app sends the prompt and the new response to LangCache through the `POST /v1/caches/{cacheId}/entries` endpoint.
6161
1. LangCache stores the embedding with the new response in the cache for future use.
6262

63-
See the [LangCache API reference]({{< relref "/develop/ai/langcache/api-reference" >}}) for more information on how to use the LangCache API.
63+
See the [LangCache API and SDK examples]({{< relref "/develop/ai/langcache/api-examples" >}}) for more information on how to use the LangCache API.
6464

6565
## Get started
6666

-10.9 KB
Loading

0 commit comments

Comments
 (0)