[DOCS][101] Add BYO vectors ingestion tutorial #115112

leemthompo · 2024-10-18T14:26:50Z

👁️ URL preview

Adds a new bite-sized tutorial to Search your data > Semantic search
This is a toy example to learn syntax of ingesting a set of existing vectors. Tries to add enough links to relevant material for follow-up without too much cognitive overload.
Don't want to overload with information about the knn search side of things, but still making sure users can get where they need to next if they wanna drill down.

github-actions · 2024-10-18T14:27:04Z

Documentation preview:

✨ Changed pages

kderusso

Nice work!

kderusso · 2024-10-18T17:50:27Z

docs/reference/search/search-your-data/ingest-vectors.asciidoc

+<titleabbrev>Bring your own vector embeddings</titleabbrev>
++++
+
+This tutorial demonstrates how to index documents that already have dense vector embeddings into {es}.


Is it worth adding an example for sparse_vector embeddings here as well?

I think it would be best to keep this tightly focused to dense vectors and investigate demand for sparse vector equivalent going forward.

docs/reference/search/search-your-data/ingest-vectors.asciidoc

kderusso · 2024-10-18T17:54:22Z

docs/reference/search/search-your-data/ingest-vectors.asciidoc

+[[bring-your-own-vectors-search-documents]]
+=== Step 3: Search documents with embeddings
+
+Now you can query these document vectors using a <<knn-retriever,`knn` retriever>>.


Nice to see retriever examples! 🎉

docs/reference/search/search-your-data/ingest-vectors.asciidoc

Added explanation for dims parameter Separated single and bulk document indexing examples Improved explanations and wording throughout Added tip for beginners about semantic search Mentioned client-side vector generation as an alternative

jeffvestal · 2024-10-23T14:20:57Z

docs/reference/search/search-your-data/ingest-vectors.asciidoc

+[TIP]
+====
+The `dense_vector` type supports quantization to reduce the memory footprint required when searching float vectors.
+Learn more about balancing performance and accuracy in <<dense-vector-quantization,Dense vector quantization>>.


Is it worth mentioning that we auto-quantize with int8_hnsw by default for dense_vector field type?

Good idea! 👍

elasticsearchmachine · 2024-10-24T11:27:14Z

Pinging @elastic/es-docs (Team:Docs)

szabosteve

This is awesome, LGTM!

elasticsearchmachine · 2024-10-24T16:02:35Z

💔 Backport failed

The backport operation could not be completed due to the following error:

An unexpected error occurred when attempting to backport this PR.

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 115112

(cherry picked from commit d500daf)

leemthompo · 2024-10-24T16:03:35Z

💚 All backports created successfully

Status	Branch	Result
✅	8.16
✅	8.15

Questions ?

Please refer to the Backport tool documentation

(cherry picked from commit d500daf)

leemthompo · 2024-10-24T16:19:54Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

(cherry picked from commit d500daf)

leemthompo added 2 commits October 18, 2024 16:23

[DOCS] Add BYO vectors ingestion tutorial

887e3c3

Del whitespace

a1ec5a2

leemthompo added the >docs General docs changes label Oct 18, 2024

leemthompo self-assigned this Oct 18, 2024

elasticsearchmachine added the v9.0.0 label Oct 18, 2024

Comment out flakey test, update ids

6490c55

kderusso reviewed Oct 18, 2024

View reviewed changes

leemthompo added 2 commits October 22, 2024 13:13

Updates, address feedback suggestions

602a311

Added explanation for dims parameter Separated single and bulk document indexing examples Improved explanations and wording throughout Added tip for beginners about semantic search Mentioned client-side vector generation as an alternative

Add semantic search SVG overview

0be4cb0

jeffvestal reviewed Oct 23, 2024

View reviewed changes

Mention default int8_hnsw quantization

03da7c9

leemthompo added auto-backport Automatically create backport pull requests when merged v8.15.0 v8.16.0 Team:Docs Meta label for docs team labels Oct 24, 2024

leemthompo marked this pull request as ready for review October 24, 2024 11:26

leemthompo requested a review from kderusso October 24, 2024 11:26

leemthompo requested a review from szabosteve October 24, 2024 13:59

szabosteve approved these changes Oct 24, 2024

View reviewed changes

leemthompo changed the title ~~[DOCS] Add BYO vectors ingestion tutorial~~ [DOCS][101] Add BYO vectors ingestion tutorial Oct 24, 2024

leemthompo merged commit d500daf into elastic:main Oct 24, 2024
5 checks passed

elasticsearchmachine added the backport pending label Oct 24, 2024

leemthompo mentioned this pull request Oct 24, 2024

[8.16] [DOCS][101] Add BYO vectors ingestion tutorial (#115112) #115573

Merged

leemthompo added a commit to leemthompo/elasticsearch that referenced this pull request Oct 24, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (elastic#115112)

3bc9219

(cherry picked from commit d500daf)

leemthompo mentioned this pull request Oct 24, 2024

[8.15] [DOCS][101] Add BYO vectors ingestion tutorial (#115112) #115574

Merged

leemthompo added a commit to leemthompo/elasticsearch that referenced this pull request Oct 24, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (elastic#115112)

82fcf17

(cherry picked from commit d500daf)

leemthompo deleted the byo-vectors branch October 24, 2024 16:05

leemthompo added the v8.17.0 label Oct 24, 2024

leemthompo mentioned this pull request Oct 24, 2024

[8.x] [DOCS][101] Add BYO vectors ingestion tutorial (#115112) #115576

Merged

leemthompo added a commit to leemthompo/elasticsearch that referenced this pull request Oct 24, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (elastic#115112)

2c53a75

(cherry picked from commit d500daf)

elasticsearchmachine pushed a commit that referenced this pull request Oct 24, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (#115112) (#115574)

4a32bf0

(cherry picked from commit d500daf)

elasticsearchmachine pushed a commit that referenced this pull request Oct 24, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (#115112) (#115573)

addd7d6

(cherry picked from commit d500daf)

elasticsearchmachine pushed a commit that referenced this pull request Oct 24, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (#115112) (#115576)

e847481

(cherry picked from commit d500daf)

georgewallace pushed a commit to georgewallace/elasticsearch that referenced this pull request Oct 25, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (elastic#115112)

407b86b

jfreden pushed a commit to jfreden/elasticsearch that referenced this pull request Nov 4, 2024

[DOCS][101] Add BYO vectors ingestion tutorial (elastic#115112)

e14b329

[DOCS][101] Add BYO vectors ingestion tutorial #115112

[DOCS][101] Add BYO vectors ingestion tutorial #115112

Uh oh!

Conversation

leemthompo commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👁️ URL preview

Uh oh!

github-actions bot commented Oct 18, 2024

Uh oh!

kderusso left a comment

Choose a reason for hiding this comment

Uh oh!

kderusso Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

leemthompo Oct 22, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kderusso Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jeffvestal Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leemthompo Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Oct 24, 2024

Uh oh!

szabosteve left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 24, 2024

💔 Backport failed

Uh oh!

leemthompo commented Oct 24, 2024

💚 All backports created successfully

Questions ?

Uh oh!

leemthompo commented Oct 24, 2024

💚 All backports created successfully

Questions ?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

leemthompo commented Oct 18, 2024 •

edited

Loading

jeffvestal Oct 23, 2024 •

edited

Loading