Add getting started with AI search

lcawl · lcawl · commit 0ef8120527f5 · 2025-06-23T17:29:04.000-07:00
diff --git a/solutions/search/serverless-elasticsearch-get-started-semantic.md b/solutions/search/serverless-elasticsearch-get-started-semantic.md
@@ -0,0 +1,185 @@
+---
+navigation_title: Get started with semantic search
+applies_to:
+  serverless:
+products:
+  - id: cloud-serverless
+---
+# Build an AI-powered search experience in {{es-serverless}}
+
+<!--
+As you ramp up on Elastic, you’ll use the Elasticsearch Relevance Engine™ (ESRE), designed to power AI search applications. With ESRE, you can take advantage of a suite of developer tools including Elastic’s textual search, vector database, and our proprietary transformer model for semantic search.
+-->
+
+Elastic offers a variety of search techniques, starting with BM25, the industry standard for textual search. It provides precise matching for specific searches, matching exact keywords, and it improves with tuning.
+
+<!--
+As you get started on vector search, keep in mind there are two forms of vector search: “dense” (aka, kNN vector search) and “sparse."
+TBD: Which type is implemented when you use semantic_text field?
+-->
+
+Elastic also offers an out-of-the-box Learned Sparse Encoder model for semantic search.
+This model outperforms on a variety of data sets, such as financial data, weather records, and question-answer pairs, among others.
+The model is built to provide great relevance across domains, without the need for additional fine tuning.
+
+<!--
+Check out this interactive demo to see how search results are more relevant when you test Elastic's Learned Sparse Encoder model against Elastic’s textual BM25 algorithm.
+
+In addition, Elastic also supports dense vectors to implement similarity search on unstructured data beyond text, such as videos, images, and audio.
+-->
+
+The advantage of semantic search and vector search is that these technologies allow you to use intuitive language in your search queries.
+For example, if you want to search for workplace guidelines on a second income, you could search for "side hustle", which is not a term you're likely to see in a formal HR document.
+
+## Setup
+
+In this guide, we'll demonstrate how to ingest data with the Elastic web crawler then try out some queries.
+If you want to play along, follow these setup steps. Otherwise, you can jump ahead to the query examples.
+
+1. Optional: Create an {{ecloud}} project, in particular an {{es-serverless}} project that is optimized for vectors. 
+   If you want to perform the steps in this guide you'll need:
+
+   * The {{es}} URL: the endpoint to which you will send your data
+   * The API Key: the easiest of the authentication methods
+   % TBD: Is this mandatory? Clarify value.
+
+1. Optional: Add data.
+   In this guide, the data is derived from a live website, the [{{es}} Labs](https://www.elastic.co/search-labs).
+
+   1. Let’s create an {{es}} index named `elasticsearch-labs-blog`.
+   1. Create mappings for a text field and a semantic text field.
+      <!--
+      Now before writing data to the index, let’s do a small configuration to ensure you have semantic search right from the start. Click on Mappings and + Add field, create a text field called `body`, this is where the crawler will put the content of the web pages it reads. Next, add a semantic text type field, let’s give it a very creative name: `semantic_text`.
+      
+      By using both a text field and semantic_text field, the process combines the strengths of traditional keyword search and advanced semantic search. This hybrid search provides comprehensive search capabilities, ensuring that users can find relevant information based on both the raw text and its underlying meaning.
+      -->
+   1. Ingest data with the Elastic Open Web Crawler.
+      <!--
+      You will need Docker to use the Open Web Crawler. Here is a simple config file, it tells the crawler to read the https://www.elastic.co/search-labs blog and write it to the `elasticsearch-labs-blog` index at `elasticsearch.host` using the `elasticsearch.api_key`... then create a `docker-compose.yml` file. Start the service with `docker-compose up -d` then start the crawling process with `docker-compose exec -it crawler bin/crawler crawl /app/config/crawler-config-blog.yml`. After a few minutes you should have the whole {{es}} labs blogs indexed.
+      
+      What just happened? The blog content was indexed to the `body` field, then this content was transformed into a sparse vector inside the `semantic_text` field. This transformation involved two main steps. First, the content was divided into smaller, manageable chunks to ensure that the text is broken down into meaningful segments, which can be more effectively processed and searched. Next, each chunk of text was transformed into a sparse vector representation using text expansion techniques. This step leverages ELSER (Elastic Search Engine for Relevance) to convert the text into a format that captures the semantic meaning, enabling more accurate and relevant search results.
+      -->
+
+## Test a keyword query
+
+% TBD Move below the semantic search example? Assumes people are already familiar with old search options?
+
+Now it’s time to search for the information you’re looking for.
+If you’re a developer who’s implementing search for a web application, you can use the Console/Dev Tools to test and refine search results from your indexed data.
+
+Let’s start with a simple `multi_match query`, which will match the text against the “title” and “body” fields.
+Since this is a classic lexical search (not semantic yet) the results of a query like “how to implement multilingual search” will match the words you are providing.
+
+```console
+GET elasticsearch-labs-blog/_search
+{
+  "_source": ["title"],
+  "query": {
+    "multi_match": {
+      "query": "how to implement multilingual search",
+      "fields": ["title","body"]
+    }
+  }
+}
+```
+
+In this case the top 5 matches are good, but not great.
+
+```txt
+"Multilingual vector search with the E5 embedding model"
+"Scalar quantization optimized for vector databases"
+"How to migrate your Ruby app from OpenSearch to Elasticsearch"
+"How to search languages with compound words"
+"How To"
+```
+
+## Test a semantic search query
+
+Now try the same but with a semantic query, it will translate the text “how to implement multilingual search?” automatically into a vector representation and perform the query against the `semantic_text` field.
+
+```console
+GET elasticsearch-labs-blog/_search
+{
+ "_source": ["title"],
+ "query": {
+   "semantic": {
+     "field": "semantic_text",
+     "query": "how to implement multilingual search?"
+   }
+ }
+}
+```
+
+The 5 top results you get back from this semantic search look much better.
+
+```txt
+"Multilingual vector search with the E5 embedding model"
+"How to search languages with compound words"
+"Dataset translation with LangChain, Python & Vector Database for multilingual insights"
+"Building multilingual RAG with Elastic and Mistral"
+"Agentic RAG with Elasticsearch & Langchain"
+```
+
+## Test a hybrid search query
+
+A more advanced example using Reciprocal Rank Fusion (RRF) is a technique used in hybrid retrieval systems to improve the relevance of search results.
+It combines different retrieval methods, such as lexical (traditional) search and semantic search, to enhance the overall search performance.
+
+By leveraging RRF, the query ensures that the final list of documents is a balanced mix of the top results from both retrieval methods, thereby improving the overall relevance and diversity of the search results.
+This fusion technique mitigates the limitations of individual retrieval methods, providing a more comprehensive and accurate set of results.
+
+% TBD: Are there reasons for not using hybrid search? e.g. additional storage space, speed
+
+```console
+GET elasticsearch-labs-blog/_search
+{
+ "_source": [
+   "title"
+ ],
+ "retriever": {
+   "rrf": {
+     "retrievers": [
+       {
+         "standard": {
+           "query": {
+             "multi_match": {
+               "fields": ["title","body"],
+               "query": "how to implement multilingual search"
+             }
+           }
+         }
+       },
+       {
+         "standard": {
+           "query": {
+             "semantic": {
+               "field": "semantic_text",
+               "query": "how to implement multilingual search"
+             }
+           }
+         }
+       }
+     ]
+   }
+ }
+}
+```
+
+The top 5 hits with hybrid search contain very good results, all highly relevant to how you can implement a multilingual search with Elasticsearch:
+
+```txt
+"Multilingual vector search with the E5 embedding model"
+"How to search languages with compound words"
+"Dataset translation with LangChain, Python & Vector Database for multilingual insights"
+"Building multilingual RAG with Elastic and Mistral"
+"Evaluating scalar quantization in Elasticsearch"
+```
+
+## Next steps
+
+Thanks for taking the time to set up semantic search for your data with {{ecloud}}.
+
+<!--
+Ready to get started? Spin up a free 14-day trial on Elastic Cloud or try out these 15 minute hands-on learning on Search AI 101.
+TBD: Link to other quickstarts or to the deeper semantic search options.
+-->
diff --git a/solutions/toc.yml b/solutions/toc.yml
@@ -8,6 +8,7 @@ toc:
           - file: search/run-elasticsearch-locally.md
           - file: search/serverless-elasticsearch-get-started.md
           - file: search/search-connection-details.md
+          - file: search/serverless-elasticsearch-get-started-semantic.md
       - file: search/api-quickstarts.md
         children:
           - file: search/elasticsearch-basics-quickstart.md