Skip to content

Commit e91337e

Browse files
authored
Merge pull request #187883 from HeidiSteen/heidist-fresh
[azure search] managed identity doc update
2 parents 8c9ee8d + e56649f commit e91337e

17 files changed

+437
-486
lines changed

articles/search/TOC.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -209,7 +209,7 @@
209209
href: search-security-manage-encryption-keys.md
210210
- name: Find encrypted objects
211211
href: search-security-get-encryption-keys.md
212-
- name: Inbound security
212+
- name: Inbound connections
213213
items:
214214
- name: Use key authentication
215215
href: search-security-api-keys.md
@@ -223,8 +223,10 @@
223223
href: service-create-private-endpoint.md
224224
- name: Troubleshoot private connections
225225
href: troubleshoot-shared-private-link-resources.md
226-
- name: Outbound security (indexers)
226+
- name: Outbound connections
227227
items:
228+
- name: Connect using a managed identity
229+
href: search-howto-managed-identities-data-sources.md
228230
- name: Connect over an IP range
229231
href: search-indexer-howto-access-ip-restricted.md
230232
- name: Connect as a trusted service
@@ -323,8 +325,6 @@
323325
href: search-howto-index-json-blobs.md
324326
- name: Connect with managed identities
325327
items:
326-
- name: Managed identities overview
327-
href: search-howto-managed-identities-data-sources.md
328328
- name: Azure Storage
329329
href: search-howto-managed-identities-storage.md
330330
- name: Azure Cosmos DB
18.6 KB
Loading
40.2 KB
Loading
15.3 KB
Loading
-6.41 KB
Loading
-3.81 KB
Loading
7.58 KB
Loading

articles/search/search-file-storage-integration.md

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,8 @@ This article supplements [**Create an indexer**](search-howto-create-indexers.md
2727

2828
+ Files containing text. If you have binary data, you can include [AI enrichment](cognitive-search-concept-intro.md) for image analysis.
2929

30+
+ Read permissions on Azure Storage. A "full access" connection string includes a key that grants access to the content, but if you're using Azure roles instead, make sure the [search service managed identity](search-howto-managed-identities-data-sources.md) has **Data and Reader** permissions.
31+
3032
## Supported document formats
3133

3234
The Azure Files indexer can extract text from the following document formats:
@@ -62,16 +64,16 @@ A data source definition can also include [soft deletion policies](search-howto-
6264

6365
Indexers can connect to a file share using the following connections.
6466

65-
| Managed identity connection string |
66-
|------------------------------------|
67-
|`{ "connectionString" : "ResourceId=/subscriptions/<your subscription ID>/resourceGroups/<your resource group name>/providers/Microsoft.Storage/storageAccounts/<your storage account name>/;" }`|
68-
|This connection string does not require an account key, but you must have previously configured a search service to [connect using a managed identity](search-howto-managed-identities-storage.md).|
69-
7067
| Full access storage account connection string |
7168
|-----------------------------------------------|
7269
|`{ "connectionString" : "DefaultEndpointsProtocol=https;AccountName=<your storage account>;AccountKey=<your account key>;" }` |
7370
| You can get the connection string from the Storage account page in Azure portal by selecting **Access keys** in the left navigation pane. Make sure to select a full connection string and not just a key. |
7471

72+
| Managed identity connection string |
73+
|------------------------------------|
74+
|`{ "connectionString" : "ResourceId=/subscriptions/<your subscription ID>/resourceGroups/<your resource group name>/providers/Microsoft.Storage/storageAccounts/<your storage account name>/;" }`|
75+
|This connection string does not require an account key, but you must have previously configured a search service to [connect using a managed identity](search-howto-managed-identities-data-sources.md).|
76+
7577
| Storage account shared access signature** (SAS) connection string |
7678
|-------------------------------------------------------------------|
7779
| `{ "connectionString" : "BlobEndpoint=https://<your account>.blob.core.windows.net/;SharedAccessSignature=?sv=2016-05-31&sig=<the signature>&spr=https&se=<the validity end time>&srt=co&ss=b&sp=rl;" }` |

articles/search/search-howto-index-azure-data-lake-storage.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ manager: nitinme
99

1010
ms.service: cognitive-search
1111
ms.topic: how-to
12-
ms.date: 01/19/2022
12+
ms.date: 02/11/2022
1313
---
1414

1515
# Index data from Azure Data Lake Storage Gen2
@@ -28,9 +28,9 @@ For a code sample in C#, see [Index Data Lake Gen2 using Azure AD](https://githu
2828

2929
+ [Access tiers](../storage/blobs/access-tiers-overview.md) for ADLS Gen2 include hot, cool, and archive. Only hot and cool can be accessed by search indexers.
3030

31-
+ Blobs containing text. If you have binary data, you can include [AI enrichment](cognitive-search-concept-intro.md) for image analysis.
31+
+ Blobs containing text. If you have binary data, you can include [AI enrichment](cognitive-search-concept-intro.md) for image analysis. Note that blob content cannot exceed the [indexer limits](search-limits-quotas-capacity.md#indexer-limits) for your search service tier.
3232

33-
Note that blob content cannot exceed the [indexer limits](search-limits-quotas-capacity.md#indexer-limits) for your search service tier.
33+
+ Read permissions on Azure Storage. A "full access" connection string includes a key that grants access to the content, but if you're using Azure roles instead, make sure the [search service managed identity](search-howto-managed-identities-data-sources.md) has **Storage Blob Data Reader** permissions.
3434

3535
## Access control
3636

@@ -75,16 +75,16 @@ A data source definition can also include [soft deletion policies](search-howto-
7575

7676
Indexers can connect to a blob container using the following connections.
7777

78-
| Managed identity connection string |
79-
|------------------------------------|
80-
|`{ "connectionString" : "ResourceId=/subscriptions/<your subscription ID>/resourceGroups/<your resource group name>/providers/Microsoft.Storage/storageAccounts/<your storage account name>/;" }`|
81-
|This connection string does not require an account key, but you must have previously configured a search service to [connect using a managed identity](search-howto-managed-identities-storage.md).|
82-
8378
| Full access storage account connection string |
8479
|-----------------------------------------------|
8580
|`{ "connectionString" : "DefaultEndpointsProtocol=https;AccountName=<your storage account>;AccountKey=<your account key>;" }` |
8681
| You can get the connection string from the Storage account page in Azure portal by selecting **Access keys** in the left navigation pane. Make sure to select a full connection string and not just a key. |
8782

83+
| Managed identity connection string |
84+
|------------------------------------|
85+
|`{ "connectionString" : "ResourceId=/subscriptions/<your subscription ID>/resourceGroups/<your resource group name>/providers/Microsoft.Storage/storageAccounts/<your storage account name>/;" }`|
86+
|This connection string does not require an account key, but you must have previously configured a search service to [connect using a managed identity](search-howto-managed-identities-data-sources.md).|
87+
8888
| Storage account shared access signature** (SAS) connection string |
8989
|-------------------------------------------------------------------|
9090
| `{ "connectionString" : "BlobEndpoint=https://<your account>.blob.core.windows.net/;SharedAccessSignature=?sv=2016-05-31&sig=<the signature>&spr=https&se=<the validity end time>&srt=co&ss=b&sp=rl;" }` |
@@ -190,7 +190,7 @@ By default, most blobs are indexed as a single search document in the index, inc
190190
+ [Indexing JSON blobs](search-howto-index-json-blobs.md)
191191
+ [Indexing CSV blobs](search-howto-index-csv-blobs.md)
192192

193-
A compound or embedded document (such as a ZIP archive, a Word document with embedded Outlook email containing attachments, or a .MSG file with attachments) is also indexed as a single document. For example, all images extracted from the attachments of an .MSG file will be returned in the normalized_images field. If you have images, consider adding [AI enrichment](cognitive-search-concept-intro.md) to get more search utility from that content.
193+
A compound or embedded document (such as a ZIP archive, a Word document with embedded Outlook email containing attachments, or an .MSG file with attachments) is also indexed as a single document. For example, all images extracted from the attachments of an .MSG file will be returned in the normalized_images field. If you have images, consider adding [AI enrichment](cognitive-search-concept-intro.md) to get more search utility from that content.
194194

195195
Textual content of a document is extracted into a string field named "content".
196196

@@ -256,7 +256,7 @@ Add the following metadata properties and values to blobs in Blob Storage. When
256256
| Property name | Property value | Explanation |
257257
| ------------- | -------------- | ----------- |
258258
| `AzureSearch_Skip` |`"true"` |Instructs the blob indexer to completely skip the blob. Neither metadata nor content extraction is attempted. This is useful when a particular blob fails repeatedly and interrupts the indexing process. |
259-
| `AzureSearch_SkipContent` |`"true"` |This is equivalent of `"dataToExtract" : "allMetadata"` setting described [above](#PartsOfBlobToIndex) scoped to a particular blob. |
259+
| `AzureSearch_SkipContent` |`"true"` |This is equivalent to the `"dataToExtract" : "allMetadata"` setting described [above](#PartsOfBlobToIndex) scoped to a particular blob. |
260260

261261
## How to index large datasets
262262

articles/search/search-howto-indexing-azure-blob-storage.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ manager: nitinme
99

1010
ms.service: cognitive-search
1111
ms.topic: how-to
12-
ms.date: 01/19/2022
12+
ms.date: 02/11/2022
1313
---
1414

1515
# Index data from Azure Blob Storage
@@ -28,9 +28,9 @@ This article supplements [**Create an indexer**](search-howto-create-indexers.md
2828

2929
+ [Access tiers](../storage/blobs/access-tiers-overview.md) for Blob storage include hot, cool, and archive. Only hot and cool can be accessed by search indexers.
3030

31-
+ Blobs containing text. If you have binary data, you can include [AI enrichment](cognitive-search-concept-intro.md) for image analysis.
31+
+ Blobs containing text. If you have binary data, you can include [AI enrichment](cognitive-search-concept-intro.md) for image analysis. Note that blob content cannot exceed the [indexer limits](search-limits-quotas-capacity.md#indexer-limits) for your search service tier.
3232

33-
Note that blob content cannot exceed the [indexer limits](search-limits-quotas-capacity.md#indexer-limits) for your search service tier.
33+
+ Read permissions on Azure Storage. A "full access" connection string includes a key that grants access to the content, but if you're using Azure roles instead, make sure the [search service managed identity](search-howto-managed-identities-data-sources.md) has **Storage Blob Data Reader** permissions.
3434

3535
<a name="SupportedFormats"></a>
3636

@@ -69,16 +69,16 @@ A data source definition can also include [soft deletion policies](search-howto-
6969

7070
Indexers can connect to a blob container using the following connections.
7171

72-
| Managed identity connection string |
73-
|------------------------------------|
74-
|`{ "connectionString" : "ResourceId=/subscriptions/<your subscription ID>/resourceGroups/<your resource group name>/providers/Microsoft.Storage/storageAccounts/<your storage account name>/;" }`|
75-
|This connection string does not require an account key, but you must have previously configured a search service to [connect using a managed identity](search-howto-managed-identities-storage.md).|
76-
7772
| Full access storage account connection string |
7873
|-----------------------------------------------|
7974
|`{ "connectionString" : "DefaultEndpointsProtocol=https;AccountName=<your storage account>;AccountKey=<your account key>;" }` |
8075
| You can get the connection string from the Storage account page in Azure portal by selecting **Access keys** in the left navigation pane. Make sure to select a full connection string and not just a key. |
8176

77+
| Managed identity connection string |
78+
|------------------------------------|
79+
|`{ "connectionString" : "ResourceId=/subscriptions/<your subscription ID>/resourceGroups/<your resource group name>/providers/Microsoft.Storage/storageAccounts/<your storage account name>/;" }`|
80+
|This connection string does not require an account key, but you must have previously configured a search service to [connect using a managed identity](search-howto-managed-identities-data-sources.md).|
81+
8282
| Storage account shared access signature** (SAS) connection string |
8383
|-------------------------------------------------------------------|
8484
| `{ "connectionString" : "BlobEndpoint=https://<your account>.blob.core.windows.net/;SharedAccessSignature=?sv=2016-05-31&sig=<the signature>&spr=https&se=<the validity end time>&srt=co&ss=b&sp=rl;" }` |
@@ -184,7 +184,7 @@ By default, most blobs are indexed as a single search document in the index, inc
184184
+ [Indexing JSON blobs](search-howto-index-json-blobs.md)
185185
+ [Indexing CSV blobs](search-howto-index-csv-blobs.md)
186186

187-
A compound or embedded document (such as a ZIP archive, a Word document with embedded Outlook email containing attachments, or a .MSG file with attachments) is also indexed as a single document. For example, all images extracted from the attachments of an .MSG file will be returned in the normalized_images field. If you have images, consider adding [AI enrichment](cognitive-search-concept-intro.md) to get more search utility from that content.
187+
A compound or embedded document (such as a ZIP archive, a Word document with embedded Outlook email containing attachments, or an .MSG file with attachments) is also indexed as a single document. For example, all images extracted from the attachments of an .MSG file will be returned in the normalized_images field. If you have images, consider adding [AI enrichment](cognitive-search-concept-intro.md) to get more search utility from that content.
188188

189189
Textual content of a document is extracted into a string field named "content".
190190

@@ -250,7 +250,7 @@ Add the following metadata properties and values to blobs in Blob Storage. When
250250
| Property name | Property value | Explanation |
251251
| ------------- | -------------- | ----------- |
252252
| "AzureSearch_Skip" |`"true"` |Instructs the blob indexer to completely skip the blob. Neither metadata nor content extraction is attempted. This is useful when a particular blob fails repeatedly and interrupts the indexing process. |
253-
| "AzureSearch_SkipContent" |`"true"` |This is equivalent of "dataToExtract" : "allMetadata" setting described [above](#PartsOfBlobToIndex) scoped to a particular blob. |
253+
| "AzureSearch_SkipContent" |`"true"` |This is equivalent to the `"dataToExtract" : "allMetadata"` setting described [above](#PartsOfBlobToIndex) scoped to a particular blob. |
254254

255255
## How to index large datasets
256256

0 commit comments

Comments
 (0)