You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-howto-create-indexers.md
+17-5Lines changed: 17 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -96,7 +96,7 @@ AI enrichment is out of scope for this article. For more information, start with
96
96
97
97
+ Be under the [maximum limits](search-limits-quotas-capacity.md#indexer-limits) for your service tier. The Free tier allows three objects of each type and 1-3 minutes of indexer processing or 3-10 if there's a skillset.
98
98
99
-
## Prepare data
99
+
## Prepare external data
100
100
101
101
Indexers work with data sets. When you run an indexer, it connects to your data source, retrieves the data from the container or folder, optionally serializes it into JSON before passing it to the search engine for indexing. This section describes the requirements of incoming data for text-based indexing.
102
102
@@ -115,21 +115,33 @@ Cognitive Search can't search over binary data in any format, although it can ex
115
115
116
116
Given that indexers don't fix data problems, other forms of data cleansing or manipulation might be needed. For more information, you should refer to the product documentation of your [Azure database product](../index.yml?product=databases).
117
117
118
+
## Prepare a data source
119
+
120
+
Indexers require a data source that specifies the type, location, and connection information.
121
+
122
+
1. Make sure you're using a [supported data source type](search-indexer-overview.md#supported-data-sources).
123
+
124
+
1.[Create a data source](/rest/api/searchservice/create-data-source). The following list is a few of the more frequently used data sources:
Recall that indexers pass off the search documents to the search engine for indexing. Just as indexers have properties that determine execution behavior, an index schema has properties that profoundly affect how strings are indexed (only strings are analyzed and tokenized).
132
+
Indexers also require a search index. Recall that indexers pass data off to the search engine for indexing. Just as indexers have properties that determine execution behavior, an index schema has properties that profoundly affect how strings are indexed (only strings are analyzed and tokenized).
121
133
122
134
1. Start with [Create a search index](search-how-to-create-search-index.md).
123
135
124
-
1. Set up the fields collection.
136
+
1. Set up the fields collection and field attributes.
125
137
126
-
The output of an indexer is a search index, and the attributed fields in the index will receive the incoming data. Fields are the only receptors of external content. Depending on how the fields are attributed, the values for each field will be analyzed, tokenized, or stored as verbatim strings for filters, fuzzy search, and typeahead queries.
138
+
Fields are the only receptors of external content. Depending on how the fields are attributed in the schema, the values for each field will be analyzed, tokenized, or stored as verbatim strings for filters, fuzzy search, and typeahead queries.
127
139
128
140
Indexers can automatically map source fields to target index fields when the names and types are equivalent. If a field can't be implicitly mapped, remember that you can [define an explicit field mapping](search-indexer-field-mappings.md) that tells the indexer how to route the content.
129
141
130
142
1. Review the analyzer assignments on each field. Analyzers can transform strings. As such, indexed strings might be different from what you passed in. You can evaluate the effects of analyzers using [Analyze Text (REST)](/rest/api/searchservice/test-analyzer). For more information about analyzers, see [Analyzers for text processing](search-analyzers.md).
131
143
132
-
In terms of how indexers interact with an index, an indexer only checks field names and types. There's no validation step that ensures incoming content is correct for the corresponding search field in the index.
144
+
During indexing, an indexer only checks field names and types. There's no validation step that ensures incoming content is correct for the corresponding search field in the index.
0 commit comments