You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-analyzers.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ manager: nitinme
7
7
ms.author: heidist
8
8
ms.service: azure-ai-search
9
9
ms.topic: conceptual
10
-
ms.date: 05/23/2024
10
+
ms.date: 01/16/2025
11
11
ms.custom:
12
12
- devx-track-csharp
13
13
- ignite-2023
@@ -22,9 +22,9 @@ An *analyzer* is a component of the [full text search engine](search-lucene-quer
22
22
+ Lower-case any upper-case words
23
23
+ Reduce words into primitive root forms for storage efficiency and so that matches can be found regardless of tense
24
24
25
-
Analysis applies to `Edm.String` fields that are marked as "searchable", which indicates full text search.
25
+
The output of a lexical analyzer is a sequence of [tokens](https://suif.stanford.edu/dragonbook/lecture-notes/Stanford-CS143/03-Lexical-Analysis.pdf).
26
26
27
-
For fields of this configuration, analysis occurs during indexing when tokens are created, and then again during query execution when queries are parsed and the engine scans for matching tokens. A match is more likely to occur when the same analyzer is used for both indexing and queries, but you can set the analyzer for each workload independently, depending on your requirements.
27
+
Lexical analysis applies to `Edm.String` fields that are marked as "searchable", which indicates full text search. For fields of this configuration, analysis occurs during indexing when tokens are created, and then again during query execution when queries are parsed and the engine scans for matching tokens. A match is more likely to occur when the same analyzer is used for both indexing and queries, but you can set the analyzer for each workload independently, depending on your requirements.
28
28
29
29
Query types that are *not* full text search, such as filters or fuzzy search, don't go through the analysis phase on the query side. Instead, the parser sends those strings directly to the search engine, using the pattern that you provide as the basis for the match. Typically, these query forms require whole-string tokens to make pattern matching work. To ensure whole term tokens are preserved during indexing, you might need [custom analyzers](index-add-custom-analyzers.md). For more information about when and why query terms are analyzed, see [Full text search in Azure AI Search](search-lucene-query-architecture.md).
|[Azure.Search.Documents 11](/dotnet/api/overview/azure/search.documents-readme)| Active | New client library from the Azure .NET SDK team, initially released July 2020. See the [Change Log](https://github.com/Azure/azure-sdk-for-net/blob/Azure.Search.Documents_11.3.0/sdk/search/Azure.Search.Documents/CHANGELOG.md) for information about minor releases. |
63
-
|[Microsoft.Azure.Search 10](https://www.nuget.org/packages/Microsoft.Azure.Search/)| Retired | Released May 2019. This is the last version of the Microsoft.Azure.Search package and it's now deprecated. It's succeeded by Azure.Search.Documents. |
64
-
|[Microsoft.Azure.Management.Search 4.0.0](https://www.nuget.org/packages/Microsoft.Azure.Management.Search/4.0.0)| Active | Targets the Management REST api-version=2020-08-01. |
65
-
|[Microsoft.Azure.Management.Search 3.0.0](https://www.nuget.org/packages/Microsoft.Azure.Management.Search/3.0.0)| Retired | Targets the Management REST api-version=2015-08-19. |
60
+
| SDK version | Status | Change log | Description |
|[Azure.Search.Documents 11](/dotnet/api/overview/azure/search.documents-readme)| Active |[Change Log](https://github.com/Azure/azure-sdk-for-net/blob/main/sdk/search/Azure.Search.Documents/CHANGELOG.md)| APIs for data plane operations on a service, such as read-write operations on content and objects. |
63
+
|[Azure.ResourceManager.Search](https://www.nuget.org/packages/Microsoft.Azure.Management.Search/4.0.0)| Active |[Change Log](https://github.com/Azure/azure-sdk-for-net/blob/main/sdk/search/Azure.ResourceManager.Search/CHANGELOG.md)| APIs for control plane operations on the search service. |
|[Java azure-search-documents 11](/java/api/overview/azure/search-documents-readme)| Active | Use the `azure-search-documents` client library for data plane operations. |
72
-
|[Java Management Client 1.35.0](/java/api/overview/azure/search/management)| Active | Use the `azure-mgmt-search` client library for control plane operations. |
|[azure-search-documents 11](/java/api/overview/azure/search-documents-readme)| Active |[Change Log](https://github.com/Azure/azure-sdk-for-java/blob/main/sdk/search/azure-search-documents/CHANGELOG.md) Use the `azure-search-documents` client library for data plane operations. |
70
+
|[azure-resourcemanager-search 2](/java/api/overview/azure/resourcemanager-search-readme)| Active |[Change Log](https://github.com/Azure/azure-sdk-for-java/blob/main/sdk/resourcemanager/azure-resourcemanager-search/CHANGELOG.md)|Use the `azure-resourcemanager-search` client library for control plane operations. |
|[JavaScript @azure/search-documents11.0](/javascript/api/overview/azure/search-documents-readme)| Active | Use the `@azure/search-documents` client library for data plane operations. |
79
-
|[JavaScript @azure/arm-search](https://www.npmjs.com/package/@azure/arm-search)| Active | Use the `@azure/arm-search`client library for control plane operations. |
|[@azure/search-documents12](/javascript/api/overview/azure/search-documents-readme)| Active|[Change Log](https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/search/search-documents/CHANGELOG.md)| Use the `@azure/search-documents` client library for data plane operations. |
77
+
|[@azure/arm-search 4](/javascript/api/overview/azure/arm-search-readme)| Active |[Change Log](https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/search/arm-search/CHANGELOG.md)|Use the `@azure/arm-search`package for control plane operations. |
|[Python azure-search-documents 11.0](/python/api/azure-search-documents)| Active | Use the `azure-search-documents` client library for data plane operations. |
86
-
|[Python azure-mgmt-search 8.0](https://pypi.org/project/azure-mgmt-search/)| Active | Use the `azure-mgmt-search` client library for control plane operations. |
|[azure-search-documents 11](/python/api/overview/azure/search-documents-readme)| Active|[Change Log](https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/search/azure-search-documents/CHANGELOG.md)| Use the `azure-search-documents` client library for data plane operations. |
84
+
|[azure-mgmt-search 9](https://pypi.org/project/azure-mgmt-search/)| Active|[Change Log](https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/search/azure-mgmt-search/CHANGELOG.md)| Use the `azure-mgmt-search` client library for control plane operations. |
Copy file name to clipboardExpand all lines: articles/search/search-faq-frequently-asked-questions.yml
+6-6Lines changed: 6 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ metadata:
9
9
ms.author: heidist
10
10
ms.service: azure-ai-search
11
11
ms.topic: faq
12
-
ms.date: 05/28/2024
12
+
ms.date: 01/16/2025
13
13
title: Azure AI Search Frequently Asked Questions
14
14
summary: Find answers to commonly asked questions about Azure AI Search.
15
15
@@ -36,7 +36,7 @@ sections:
36
36
answer: |
37
37
For vectors, the embedding models you use determines the linguistic experience.
38
38
39
-
For nonvector strings and numbers, the default analyzer used for tokenization is standard Lucene and it is language agnostic. Otherwise, language support is expressed through [language analyzers](index-add-language-analyzers.md#supported-language-analyzers) that apply linguistic rules to inbound (indexing) and outbound (queries) content. Some features, such as [speller](speller-how-to-add.md#supported-languages), are limited to a subset of languages.
39
+
For nonvector strings and numbers, the default analyzer used for tokenization is standard Lucene and it's language agnostic. Otherwise, language support is expressed through [language analyzers](index-add-language-analyzers.md#supported-language-analyzers) that apply linguistic rules to inbound (indexing) and outbound (queries) content. Some features, such as [speller](speller-how-to-add.md#supported-languages) and [query rewrite](semantic-how-to-query-rewrite.md), are limited to a subset of languages.
40
40
41
41
- question: |
42
42
How do I integrate search into my solution?
@@ -101,12 +101,12 @@ sections:
101
101
- question: |
102
102
Does Azure AI Search support vector search?
103
103
answer: |
104
-
Azure AI Search supports vector indexing and retrieval. It can vectorize query strings and content if you use the preview and beta libraries.
104
+
Azure AI Search supports vector indexing and retrieval. It can chunk and vectorize query strings and content if you use [integrated vectorization](vector-search-integrated-vectorization.md) and take a dependency on indexers and skillsets.
105
105
106
106
- question: |
107
107
How does vector search work in Azure AI Search?
108
108
answer: |
109
-
With standalone vector search, you first use an embedding model to transform content into a vector representation within an embedding space. You can then provide these vectors in a document payload to the search index for indexing. To serve search requests, you use the same deep neural network (DNN) from indexing to transform the search query into a vector representation, and vector search finds the most similar vectors and return the corresponding documents.
109
+
With standalone vector search, you first use an embedding model to transform content into a vector representation within an embedding space. You can then provide these vectors in a document payload to the search index for indexing. To serve search requests, you use the same embedding model to transform the search query into a vector representation, and vector search finds the most similar vectors and return the corresponding documents.
110
110
111
111
In Azure AI Search, you can index vector data as fields in documents alongside textual and other types of content. There are [multiple data types](/rest/api/searchservice/supported-data-types#edm-data-types-for-vector-fields) for vector fields.
112
112
@@ -130,7 +130,7 @@ sections:
130
130
- question: |
131
131
Why do I see different vector index size limits between my new search services and existing search services?
132
132
answer: |
133
-
We're rolling out improved vector index size limits worldwide for new search services, but we're still building out infrastructure capacity in certain regions. New search services created in supported regions will see increased vector index size limits. Unfortunately, we can't migrate existing services to the new limits.
133
+
Azure AI Search rolled out improved vector index size limits worldwide for new search services, but [some regions experience capacity constraints](search-region-support.md), and some regions don't have the required infrastructure. New search services created in supported regions should see increased vector index size limits. Unfortunately, we can't migrate existing services to the new limits. Also, only vector indexes that use the Hierarchical Navigable Small World (HNSW) algorithm report on vector index size in the Azure portal. If your index uses exhaustive KNN, vector index size is reported as zero, even though the index contains vectors.
134
134
135
135
- question: |
136
136
How do I enable vector search on a search index?
@@ -141,7 +141,7 @@ sections:
141
141
142
142
* Add a "vectorSearch" section to the index schema specifying the configuration used by vector search fields, including the parameters of the Approximate Nearest Neighbor algorithm used, like HNSW.
143
143
144
-
* Use the latest stable version[**2024-07-01**](/rest/api/searchservice), or an Azure SDK to create or update the index, load documents, and issue queries.
144
+
* Use the latest stable version[**2024-07-01**](/rest/api/searchservice), or an Azure SDK to create or update the index, load documents, and issue queries. For more information, see [Create a vector index](vector-search-how-to-create-index.md).
Copy file name to clipboardExpand all lines: articles/search/search-howto-index-json-blobs.md
+3-5Lines changed: 3 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ ms.service: azure-ai-search
11
11
ms.custom:
12
12
- ignite-2023
13
13
ms.topic: how-to
14
-
ms.date: 06/25/2024
14
+
ms.date: 01/16/2025
15
15
---
16
16
17
17
# Index JSON blobs and files in Azure AI Search
@@ -39,7 +39,6 @@ Within the indexer definition, you can optionally set [field mappings](search-in
39
39
> [!NOTE]
40
40
> When a JSON parsing mode is used, Azure AI Search assumes that all blobs use the same parser (either for **`json`**, **`jsonArray`** or **`jsonLines`**). If you have a mix of different file types in the same data source, consider using [file extension filters](search-blob-storage-integration.md#controlling-which-blobs-are-indexed) to control which files are imported.
41
41
42
-
43
42
The following sections describe each mode in more detail. If you're unfamiliar with indexer clients and concepts, see [Create a search indexer](search-howto-create-indexers.md). You should also be familiar with the details of [basic blob indexer configuration](search-howto-indexing-azure-blob-storage.md), which isn't repeated here.
44
43
45
44
<aname="parsing-single-blobs"></a>
@@ -76,8 +75,7 @@ api-key: [admin key]
76
75
```
77
76
78
77
> [!NOTE]
79
-
> As with all indexers, if fields do not clearly match, you should expect to explicitly specify individual [field mappings](search-indexer-field-mappings.md) unless you are using the implicit fields mappings available for blob content and metadata, as described in [basic blob indexer configuration](search-howto-indexing-azure-blob-storage.md).
80
-
78
+
> As with all indexers, if fields don't clearly match, you should expect to explicitly specify individual [field mappings](search-indexer-field-mappings.md) unless you're using the implicit fields mappings available for blob content and metadata, as described in [basic blob indexer configuration](search-howto-indexing-azure-blob-storage.md).
81
79
82
80
### json example (single hotel JSON files)
83
81
@@ -208,7 +206,7 @@ You can also refer to individual array elements by using a zero-based index. For
208
206
```
209
207
210
208
> [!NOTE]
211
-
> If "sourceFieldName" refers to a property that doesn't exist in the JSON blob, that mapping is skipped without an error. This behavior allows indexing to continue for JSON blobs that have a different schema (which is a common use case). Because there is no validation check, check the mappings carefully for typos so that you aren't losing documents for the wrong reason.
209
+
> If "sourceFieldName" refers to a property that doesn't exist in the JSON blob, that mapping is skipped without an error. This behavior allows indexing to continue for JSON blobs that have a different schema (which is a common use case). Because there's no validation check, check the mappings carefully for typos so that you aren't losing documents for the wrong reason.
0 commit comments