Merge pull request #5462 from PatrickFarley/aoai-build

JamesJBarnett · web-flow · commit 6d62fe29c1b0 · 2025-07-03T13:58:58.000-07:00
re-add pii filter docs
diff --git a/articles/ai-foundry/openai/concepts/content-filter-personal-information.md b/articles/ai-foundry/openai/concepts/content-filter-personal-information.md
@@ -0,0 +1,26 @@
+---
+title: Personally Identifiable Information (PII) Filter
+description: Learn about the Personally Identifiable Information (PII) filter for identifying and flagging known personal information in large language model outputs.
+author: PatrickFarley
+ms.author: pafarley
+ms.date: 07/03/2025
+ms.topic: conceptual
+ms.service: azure-ai-openai
+---
+
+# Personally identifiable information (PII) filter
+
+Personally identifiable information (PII) refers to any information that can be used to identify a particular individual, such as a name, address, phone number, email address, social security number, driver's license number, passport number, or similar information.
+
+PII detection is used to prevent PII from being exposed or shared, protecting users from identity theft, financial fraud, or other types of privacy violations.
+
+In the context of large language models (LLMs), PII detection involves analyzing text content in LLM completions. When PII has been identified, it can be flagged for further review, or the output can be blocked. The PII filter scans the output of LLMs to identify and flag known personal information. It's designed to help organizations prevent the generation of content that closely matches sensitive personal information.
+
+
+## PII types
+
+There are many different types of PII, and you can specify which types you want to filter. The set of PII types that can be detected by the filter matches the set that's defined in the [Azure AI Language docs](/azure/ai-services/language-service/personally-identifiable-information/concepts/entity-categories).
+
+## Filtering modes
+
+The PII filter can be configured to operate in two modes. **Annotate** mode flags PII that's returned in the model output. **Annotate and Block** mode blocks the entire output if PII is detected. The filtering mode can be set for each PII category individually.
diff --git a/articles/ai-foundry/openai/concepts/content-filter.md b/articles/ai-foundry/openai/concepts/content-filter.md
@@ -47,6 +47,8 @@ The following table summarizes the risk categories supported by Azure OpenAI's c
 | [Groundedness](/azure/ai-services/openai/concepts/content-filter-groundedness)<sup>2</sup> | Groundedness detection flags whether the text responses of large language models (LLMs) are grounded in the source materials provided by the users. Ungrounded material refers to instances where the LLMs produce information that is non-factual or inaccurate from what was present in the source materials. Requires [document embedding and formatting](./content-filter-document-embedding.md). |
 | [Protected Material for Text](/azure/ai-services/openai/concepts/content-filter-protected-material)<sup>1</sup> | Protected material text describes known text content (for example, song lyrics, articles, recipes, and selected web content) that can be outputted by large language models.|
 | [Protected Material for Code](/azure/ai-services/openai/concepts/content-filter-protected-material) | Protected material code describes source code that matches a set of source code from public repositories, which can be outputted by large language models without proper citation of source repositories.|
+| [Personally identifiable information (PII)](/azure/ai-services/openai/concepts/content-filter-personal-information) | Personally identifiable information (PII) refers to any information that can be used to identify a particular individual. PII detection involves analyzing text content in LLM completions and filtering any PII that was returned. |
+
 
 <sup>1</sup> If you're an owner of text material and want to submit text content for protection, [file a request](https://aka.ms/protectedmaterialsform).
 
diff --git a/articles/ai-foundry/openai/how-to/content-filters.md b/articles/ai-foundry/openai/how-to/content-filters.md
@@ -48,6 +48,8 @@ You can configure the following filter categories in addition to the default har
 | Protected material - code |GA| On | Completion | Filters protected code or gets the example citation and license information in annotations for code snippets that match any public code sources, powered by GitHub Copilot. For more information about consuming annotations, see the [Protected material concepts guide](/azure/ai-services/openai/concepts/content-filter-protected-material) |
 | Protected material - text | GA| On | Completion | Identifies and blocks known text content from being displayed in the model output (for example, song lyrics, recipes, and selected web content).  |
 | Groundedness | Preview |Off | Completion |Detects whether the text responses of large language models (LLMs) are grounded in the source materials provided by the users. Ungroundedness refers to instances where the LLMs produce information that is non-factual or inaccurate from what was present in the source materials. Requires: [Document embedding and formatting](/azure/ai-services/openai/concepts/content-filter?tabs=warning%2Cuser-prompt%2Cpython-new#embedding-documents-in-your-prompt).|
+| Personally identifiable information (PII) | Preview | Off | Completion | Filters information that can be used to identify a particular individual, such as a name, address, phone number, email address, social security number, driver's license number, passport number, or similar information. |
+
 
 [!INCLUDE [create-content-filter](../../../ai-foundry/includes/create-content-filter.md)]
 
diff --git a/articles/ai-foundry/openai/whats-new.md b/articles/ai-foundry/openai/whats-new.md
@@ -20,6 +20,10 @@ This article provides a summary of the latest releases and major documentation u
 
 ## June 2025
 
+### PII detection content filter
+
+Personally identifiable information (PII) detection is now available as a built-in content filter. This feature allows you to identify and block sensitive information in LLM outputs, enhancing data privacy. For more information, see the [PII detection](./concepts/content-filter-personal-information.md) documentation.
+
 ### codex-mini & o3-pro models released
 
 - `codex-mini` and `o3-pro` are now available. To learn more, see the [getting started with reasoning models page](./how-to/reasoning.md)
diff --git a/articles/ai-foundry/toc.yml b/articles/ai-foundry/toc.yml
@@ -237,6 +237,8 @@ items:
             href: ./openai/concepts/content-filter-groundedness.md
           - name: Protected material detection
             href: ./openai/concepts/content-filter-protected-material.md
+          - name: Personally identifiable information (PII) detection
+            href: ./openai/concepts/content-filter-personal-information.md
           - name: Content filter configurability
             href: ./openai/concepts/content-filter-configurability.md
           - name: Content filter annotations