MicrosoftDocs
diff --git a/‎articles/search/TOC.yml
Lines changed: 18 additions & 16 deletions b/‎articles/search/TOC.yml
Lines changed: 18 additions & 16 deletions
diff --git a/‎articles/search/hybrid-search-overview.md
Lines changed: 132 additions & 0 deletions b/‎articles/search/hybrid-search-overview.md
Lines changed: 132 additions & 0 deletions
diff --git a/‎articles/search/hybrid-search-ranking.md
Lines changed: 72 additions & 0 deletions b/‎articles/search/hybrid-search-ranking.md
Lines changed: 72 additions & 0 deletions
@@ -123,32 +123,34 @@
     href: samples-rest.md
 - name: Concepts
   items:
-  - name: Retrieval Augmented Generation (RAG)
-    href: retrieval-augmented-generation-overview.md
   - name: Search
     items:
     - name: Full-text search
-      items:
-      - name: Overview
-        href: search-lucene-query-architecture.md
-      - name: Query types and composition
-        href: search-query-overview.md
-      - name: Relevance scoring
-        href: index-similarity-and-scoring.md
+      href: search-lucene-query-architecture.md
     - name: Vector search
-      items:
-      - name: Overview
-        href: vector-search-overview.md
-      - name: Vector index size limit
-        href: vector-search-index-size.md
-      - name: Vector query execution
-        href: vector-search-ranking.md
+      href: vector-search-overview.md
+    - name: Hybrid search
+      href: hybrid-search-overview.md
+    - name: Retrieval Augmented Generation (RAG)
+      href: retrieval-augmented-generation-overview.md
+    - name: Other query types
+      href: search-query-overview.md
+  - name: Relevance
+    items:
+    - name: Scoring in keyword queries (BM25)
+      href: index-similarity-and-scoring.md
+    - name: Scoring in vector queries
+      href: vector-search-ranking.md
+    - name: Scoring in hybrid queries (RRF)
+      href: hybrid-search-ranking.md
     - name: Semantic search
       href: semantic-search-overview.md
   - name: Indexing
     items:
     - name: Search indexes
       href: search-what-is-an-index.md
+    - name: Vector index size limit
+      href: vector-search-index-size.md
     - name: Import
       href: search-what-is-data-import.md
     - name: Import Data wizard
 
@@ -0,0 +1,132 @@
+---
+title: Hybrid search
+titleSuffix: Azure Cognitive Search
+description: Describes concepts and architecture of hybrid query processing and document retrieval. Hybrid queries combine vector search and full text search.
+
+author: robertklee
+ms.author: robertlee
+ms.service: cognitive-search
+ms.topic: conceptual
+ms.date: 09/27/2023
+---
+
+# Hybrid search using vectors and full text in Azure Cognitive Search
+
+> [!IMPORTANT]
+> Hybrid search uses the [vector features](vector-search-overview.md) currently in public preview under [supplemental terms of use](https://azure.microsoft.com/support/legal/preview-supplemental-terms/).
+
+Hybrid search is a combination of full text and vector queries that execute against a search index that contains both searchable plain text content and generated embeddings. For query purposes, hybrid search is:
+
++ A single query request that includes `search` and `vectors` parameters, multiple vector queries, or one vector query targeting multiple fields
++ Parallel query execution
++ Merged results in the query response, scored using [Reciprocal Rank Fusion (RRF)](hybrid-search-ranking.md)
+
+This article explains the concepts, benefits, and limitations of hybrid search.
+
+## How does hybrid search work?
+
+In Azure Cognitive Search, vector indexes containing embeddings can live alongside textual and numerical fields allowing you to issue hybrid full text and vector queries. Hybrid queries can take advantage of existing functionality like filtering, faceting, sorting, scoring profiles, and [semantic ranking](semantic-search-overview.md) in a single search request.
+
+Hybrid search combines results from both full text and vector queries, which use different ranking functions such as BM25 and cosine similarity. To present these results in a single ranked list, a method of merging the ranked result lists is needed.
+
+## Structure of a hybrid query
+
+Hybrid search is predicated on having a search index that contains fields of various types, including plain text and numbers, geo coordinates for geospatial search, and vectors for a mathematical representation of a chunk of text or image, audio, and video. You can use almost all query capabilities in Cognitive Search with a vector query, except for client-side interactions such as  autocomplete and suggestions.
+
+A representative hybrid query might be as follows (notice the vector is trimmed for brevity):
+
+```http
+POST https://{{searchServiceName}}.search.windows.net/indexes/hotels-vector-quickstart/docs/search?api-version=2023-07-01-Preview
+  content-type: application/JSON
+{
+    "count": true,
+    "search": "historic hotel walk to restaurants and shopping",
+    "select": "HotelId, HotelName, Category, Description, Address/City, Address/StateProvince",
+    "filter": "geo.distance(Location, geography'POINT(-77.03241 38.90166)') le 300",
+    "facets": [ "Address/StateProvince"], 
+    "vectors": [
+        {
+            "value": [ <array of embeddings> ]
+            "k": 7,
+            "fields": "DescriptionVector"
+        },
+        {
+            "value": [ <array of embeddings> ]
+            "k": 7,
+            "fields": "Description_frVector"
+        }
+    ],
+    "queryType": "semantic",
+    "queryLanguage": "en-us",
+    "semanticConfiguration": "my-semantic-config"
+}
+```
+
+Key points include:
+
++ `search` specifies a full text search query.
++ `vectors` for vector queries, which can be multiple, targeting multiple vector fields. If the embedding space includes multi-lingual content, vector queries can find the match with no language analyzers or translation required.
++ `select` specifies which fields to return in results, which can be text fields that are human readable.
++ `filters` can specify geospatial search or other include and exclude criteria, such as whether parking is included. The geospatial query in this example finds hotels within a 300-kilometer radius of Washington D.C.
++ `facets` can be used to compute facet buckets over results that are returned from hybrid queries.
++ `queryType=semantic` invokes semantic ranking, applying machine reading comprehension to surface more relevant search results.
+
+Filters and facets target data structures within the index that are distinct from the inverted indexes used for full text search and the vector indexes used for vector search. As such, when filters and faceted operations execute, the search engine can apply the operational result to the hybrid search results in the response.
+
+Notice how there's no `orderby` in the query. Explicit sort orders override relevanced-ranked results, so if you want similarity and BM25 relevance, omit sorting in your query.
+
+A response from the above query might look like this:
+
+```http
+{
+    "@odata.count": 3,
+    "@search.facets": {
+        "Address/StateProvince": [
+            {
+                "count": 1,
+                "value": "NY"
+            },
+            {
+                "count": 1,
+                "value": "VA"
+            }
+        ]
+    },
+    "value": [
+        {
+            "@search.score": 0.03333333507180214,
+            "@search.rerankerScore": 2.5229012966156006,
+            "HotelId": "49",
+            "HotelName": "Old Carrabelle Hotel",
+            "Description": "Spacious rooms, glamorous suites and residences, rooftop pool, walking access to shopping, dining, entertainment and the city center.",
+            "Category": "Luxury",
+            "Address": {
+                "City": "Arlington",
+                "StateProvince": "VA"
+            }
+        },
+        {
+            "@search.score": 0.032522473484277725,
+            "@search.rerankerScore": 2.111117362976074,
+            "HotelId": "48",
+            "HotelName": "Nordick's Motel",
+            "Description": "Only 90 miles (about 2 hours) from the nation's capital and nearby most everything the historic valley has to offer.  Hiking? Wine Tasting? Exploring the caverns?  It's all nearby and we have specially priced packages to help make our B&B your home base for fun while visiting the valley.",
+            "Category": "Boutique",
+            "Address": {
+                "City": "Washington D.C.",
+                "StateProvince": null
+            }
+        }
+    ]
+}
+```
+
+## Benefits
+
+Hybrid search combines the strengths of vector search and keyword search. The advantage of vector search is finding information that's similar to your search query, even if there are no keyword matches in the inverted index. The advantage of keyword or full text search is precision, and the ability to apply semantic ranking that improves the quality of the initial results. Some scenarios, such as product codes, highly specialized jargon, dates, etc. can perform better with keyword search because it can identify exact matches.
+
+Benchmark testing on real-world and benchmark datasets indicates that hybrid retrieval with semantic ranking offers significant benefits in search relevance.
+
+## See also
+
+[Outperform vector search with hybrid retrieval and ranking (Tech blog)](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/azure-cognitive-search-outperforming-vector-search-with-hybrid/ba-p/3929167)
@@ -0,0 +1,72 @@
+---
+title: Hybrid search scoring (RRF)
+titleSuffix: Azure Cognitive Search
+description: Describes the Reciprocal Rank Fusion (RRF) algorithm used to unify search scores from parallel queries in Azure Cognitive Search.
+
+author: yahnoosh
+ms.author: jlembicz
+ms.service: cognitive-search
+ms.topic: conceptual
+ms.date: 09/27/2023
+---
+
+# Relevance scoring in hybrid search using Reciprocal Rank Fusion (RRF)
+
+> [!IMPORTANT]
+> Hybrid search uses the [vector features](vector-search-overview.md) currently in public preview under [supplemental terms of use](https://azure.microsoft.com/support/legal/preview-supplemental-terms/).
+
+For hybrid search scoring, Cognitive Search uses the Reciprocal Rank Fusion (RRF) algorithm. RRF combines the results of different search methods - such as vector search and full text search or multiple vector queries executing in parallel - to produce a single relevance score. RRF is based on the concept of *reciprocal rank*, which is the inverse of the rank of the first relevant document in a list of search results. 
+
+The goal of the technique is to take into account the position of the items in the original rankings, and give higher importance to items that are ranked higher in multiple lists. This can help improve the overall quality and reliability of the final ranking, making it more useful for the task of fusing multiple ordered search results.
+
+In Azure Cognitive Search, RRF is used whenever there are two or more queries that execute in parallel. Each query produces a ranked result set, and RRF is used to merge and homogenize the rankings into a single result set, returned in the query response.
+
+## How RRF ranking works
+
+RRF works by taking the search results from multiple methods, assigning a reciprocal rank score to each document in the results, and then combining the scores to create a new ranking. The concept is that documents appearing in the top positions across multiple search methods are likely to be more relevant and should be ranked higher in the combined result.
+
+Here's a simple explanation of the RRF process:
+
+1. Obtain ranked search results from multiple queries executing in parallel for full text search and vector search.
+
+1. Assign reciprocal rank scores for result in each of the ranked lists. RRF generates a new **`@search.score`** for each match in each result set. For each document in the search results, we assign a reciprocal rank score based on its position in the list. The score is calculated as `1/(rank + k)`, where `rank` is the position of the document in the list, and `k` is a constant, which was experimentally observed to perform best if it's set to a small value like 60. **Note that this `k` value is a constant in the RRF algorithm and entirely separate from the `k` that controls the number of nearest neighbors.**
+
+1. Combine scores. For each document, the engine sums the reciprocal rank scores obtained from each search system, producing a combined score for each document. 
+
+1. Rank documents based on combined scores and sort them. The resulting list is the fused ranking. 
+
+Only fields marked as `searchable` in the index are used for scoring. Only fields marked as `retrievable`, or fields that are specified in `searchFields` in the query, are returned in search results, along with their search score.
+
+### Parallel query execution
+
+RRF is used anytime there's more than one query execution. The following examples illustrate query patterns where parallel query execution occurs:
+
++ A full text query, plus one vector query (simple hybrid scenario), equals two query executions.
++ A full text query, plus one vector query targeting two vector fields, equals three query executions.
++ A full text query, plus two vector queries targeting five vector fields, equals 11 query executions
+
+## Scores in a hybrid search results
+
+Whenever results are ranked, **`@search.score`** property contains the value used to order the results. Scores are generated by ranking algorithms that vary for each method. Each algorithm has its own range and magnitude.
+
+The following chart identifies the scoring property returned on each match, algorithm, and range of scores for each relevance ranking algorithm. 
+
+| Search method | Parameter | Scoring algorithm | Range |
+|---------------|-----------|-------------------|-------|
+| full-text search | `@search.score` | BM25 algorithm | No upper limit. |
+| vector search | `@search.score` | HNSW algorithm, using the similarity metric specified in the HNSW configuration. | 0.333 - 1.00 (Cosine), 0 to 1 for Euclidean and DotProduct. | 
+| hybrid search | `@search.score` | RRF algorithm | Upper limit is only bounded by the number of queries being fused, with each query contributing a maximum of approximately 1 to the RRF score. |
+| semantic ranking | `@search.rerankerScore` | Semantic ranking | 1.00 - 4.00 |
+
+Semantic ranking doesn't participate in RRF. Its score (`@search.rerankerScore`) is always reported separately in the query response. Semantic ranking can rerank full text and hybrid search results, assuming those results include fields having semantically rich content.
+
+## Number of ranked results in a hybrid query response
+
+By default, if you aren't using pagination, the search engine returns the top 50 highest ranking matches for full text search, and it returns `k` matches for vector search. In a hybrid query, `top` determines the number of results in the response. Based on defaults, the top 50 highest ranked matches of the unified result set are returned. Full text search is subject to a maximum limit of 1,000 matches (see [API response limits](search-limits-quotas-capacity.md#api-response-limits)). Once 1,000 matches are found, the search engine no longer looks for more.
+
+You can use `top`, `skip`, and `next` for paginated results. Paging results is how you determine the number of results on each logical page and navigate through the full payload. For more information, see [How to work with search results](search-pagination-page-layout.md).
+
+## See also
+
++ [Learn more about hybrid search](hybrid-search-overview.md)
++ [Learn more about vector search](vector-search-overview.md)