You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It’s useful to chunk large documents into smaller pieces when creating embeddings.
24
+
An embedding is a text’s semantic representation capturing the meaning of what the text is about.
25
+
If the given text is long and contains too many diverse subjects, the informative value of its embedding deteriorates.
26
+
17
27
## Description
18
28
19
-
The user question is embedded using the same embedder that has been used before to create the chunk embeddings. A vector similarity search is executed on the chunk embeddings to retrieve k (number previously configured by developer / user) most similar chunks.
29
+
The user question is embedded using the same embedder that has been used before to create the chunk embeddings.
30
+
A vector similarity search is executed on the chunk embeddings to retrieve k (number previously configured by developer / user) most similar chunks.
20
31
21
32
## Usage
22
33
23
-
This pattern is useful if the user asks for specific information about a topic that exists in one or more (but not too many) chunks. The question should not require complex aggregations or knowledge about the whole dataset. Since the pattern only contains a vector similarity search it is easy to understand, implement and get started with.
34
+
This pattern is useful if the user asks for specific information about a topic that exists in one or more (but not too many) chunks.
35
+
The question should not require complex aggregations or knowledge about the whole dataset.
36
+
Since the pattern only contains a vector similarity search it is easy to understand, implement and get started with.
37
+
38
+
## Required pre-processing
39
+
40
+
Split documents into chunks and use an embedding model to embed the text content of the chunks.
41
+
See [chunking](/guides/chunking).
42
+
43
+
## Retrieval Query
44
+
45
+
No additional query is necessary since the Neo4j Vector retriever retrieves similar chunks by default.
24
46
25
47
## Further reading
26
48
@@ -29,6 +51,7 @@ This pattern is useful if the user asks for specific information about a topic t
0 commit comments