Skip to content

Commit 635027d

Browse files
Merge pull request #278377 from gmndrg/main
Update search-howto-index-changed-deleted-blobs.md
2 parents 701309b + 01bbf83 commit 635027d

File tree

1 file changed

+6
-1
lines changed

1 file changed

+6
-1
lines changed

articles/search/search-howto-index-changed-deleted-blobs.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.service: cognitive-search
1010
ms.custom:
1111
- ignite-2023
1212
ms.topic: how-to
13-
ms.date: 12/19/2023
13+
ms.date: 06/14/2024
1414
---
1515

1616
# Change and delete detection using indexers for Azure Storage in Azure AI Search
@@ -24,15 +24,20 @@ There are two ways to implement a soft delete strategy:
2424
+ [Native blob soft delete](#native-blob-soft-delete), applies to Blob Storage only
2525
+ [Soft delete using custom metadata](#soft-delete-using-custom-metadata)
2626

27+
The deletion detection strategy should be applied from the very first indexer run. If you didn't establish the deletion policy prior to the initial run, any documents that were deleted before the policy was implemented will remain in your index, even if you add the policy to the indexer later and reset it. If this has occurred, it is suggested that you create a new index using a new indexer, ensuring the deletion policy is in place from the beginning.
28+
29+
2730
## Prerequisites
2831

2932
+ Use an Azure Storage indexer for [Blob Storage](search-howto-indexing-azure-blob-storage.md), [Table Storage](search-howto-indexing-azure-tables.md), [File Storage](search-howto-indexing-azure-tables.md), or [Data Lake Storage Gen2](search-howto-index-azure-data-lake-storage.md)
3033

3134
+ Use consistent document keys and file structure. Changing document keys or directory names and paths (applies to ADLS Gen2) breaks the internal tracking information used by indexers to know which content was indexed, and when it was last indexed.
3235

36+
3337
> [!NOTE]
3438
> ADLS Gen2 allows directories to be renamed. When a directory is renamed, the timestamps for the blobs in that directory do not get updated. As a result, the indexer will not re-index those blobs. If you need the blobs in a directory to be reindexed after a directory rename because they now have new URLs, you will need to update the `LastModified` timestamp for all the blobs in the directory so that the indexer knows to re-index them during a future run. The virtual directories in Azure Blob Storage cannot be changed, so they do not have this issue.
3539
40+
3641
## Native blob soft delete
3742

3843
For this deletion detection approach, Azure AI Search depends on the [native blob soft delete](../storage/blobs/soft-delete-blob-overview.md) feature in Azure Blob Storage to determine whether blobs have transitioned to a soft deleted state. When blobs are detected in this state, a search indexer uses this information to remove the corresponding document from the index.

0 commit comments

Comments
 (0)