Skip to content

Commit 6d62fe2

Browse files
Merge pull request #5462 from PatrickFarley/aoai-build
re-add pii filter docs
2 parents 62591cc + 2667ae4 commit 6d62fe2

File tree

5 files changed

+36
-0
lines changed

5 files changed

+36
-0
lines changed
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
---
2+
title: Personally Identifiable Information (PII) Filter
3+
description: Learn about the Personally Identifiable Information (PII) filter for identifying and flagging known personal information in large language model outputs.
4+
author: PatrickFarley
5+
ms.author: pafarley
6+
ms.date: 07/03/2025
7+
ms.topic: conceptual
8+
ms.service: azure-ai-openai
9+
---
10+
11+
# Personally identifiable information (PII) filter
12+
13+
Personally identifiable information (PII) refers to any information that can be used to identify a particular individual, such as a name, address, phone number, email address, social security number, driver's license number, passport number, or similar information.
14+
15+
PII detection is used to prevent PII from being exposed or shared, protecting users from identity theft, financial fraud, or other types of privacy violations.
16+
17+
In the context of large language models (LLMs), PII detection involves analyzing text content in LLM completions. When PII has been identified, it can be flagged for further review, or the output can be blocked. The PII filter scans the output of LLMs to identify and flag known personal information. It's designed to help organizations prevent the generation of content that closely matches sensitive personal information.
18+
19+
20+
## PII types
21+
22+
There are many different types of PII, and you can specify which types you want to filter. The set of PII types that can be detected by the filter matches the set that's defined in the [Azure AI Language docs](/azure/ai-services/language-service/personally-identifiable-information/concepts/entity-categories).
23+
24+
## Filtering modes
25+
26+
The PII filter can be configured to operate in two modes. **Annotate** mode flags PII that's returned in the model output. **Annotate and Block** mode blocks the entire output if PII is detected. The filtering mode can be set for each PII category individually.

articles/ai-foundry/openai/concepts/content-filter.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -47,6 +47,8 @@ The following table summarizes the risk categories supported by Azure OpenAI's c
4747
| [Groundedness](/azure/ai-services/openai/concepts/content-filter-groundedness)<sup>2</sup> | Groundedness detection flags whether the text responses of large language models (LLMs) are grounded in the source materials provided by the users. Ungrounded material refers to instances where the LLMs produce information that is non-factual or inaccurate from what was present in the source materials. Requires [document embedding and formatting](./content-filter-document-embedding.md). |
4848
| [Protected Material for Text](/azure/ai-services/openai/concepts/content-filter-protected-material)<sup>1</sup> | Protected material text describes known text content (for example, song lyrics, articles, recipes, and selected web content) that can be outputted by large language models.|
4949
| [Protected Material for Code](/azure/ai-services/openai/concepts/content-filter-protected-material) | Protected material code describes source code that matches a set of source code from public repositories, which can be outputted by large language models without proper citation of source repositories.|
50+
| [Personally identifiable information (PII)](/azure/ai-services/openai/concepts/content-filter-personal-information) | Personally identifiable information (PII) refers to any information that can be used to identify a particular individual. PII detection involves analyzing text content in LLM completions and filtering any PII that was returned. |
51+
5052

5153
<sup>1</sup> If you're an owner of text material and want to submit text content for protection, [file a request](https://aka.ms/protectedmaterialsform).
5254

articles/ai-foundry/openai/how-to/content-filters.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,8 @@ You can configure the following filter categories in addition to the default har
4848
| Protected material - code |GA| On | Completion | Filters protected code or gets the example citation and license information in annotations for code snippets that match any public code sources, powered by GitHub Copilot. For more information about consuming annotations, see the [Protected material concepts guide](/azure/ai-services/openai/concepts/content-filter-protected-material) |
4949
| Protected material - text | GA| On | Completion | Identifies and blocks known text content from being displayed in the model output (for example, song lyrics, recipes, and selected web content). |
5050
| Groundedness | Preview |Off | Completion |Detects whether the text responses of large language models (LLMs) are grounded in the source materials provided by the users. Ungroundedness refers to instances where the LLMs produce information that is non-factual or inaccurate from what was present in the source materials. Requires: [Document embedding and formatting](/azure/ai-services/openai/concepts/content-filter?tabs=warning%2Cuser-prompt%2Cpython-new#embedding-documents-in-your-prompt).|
51+
| Personally identifiable information (PII) | Preview | Off | Completion | Filters information that can be used to identify a particular individual, such as a name, address, phone number, email address, social security number, driver's license number, passport number, or similar information. |
52+
5153

5254
[!INCLUDE [create-content-filter](../../../ai-foundry/includes/create-content-filter.md)]
5355

articles/ai-foundry/openai/whats-new.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,10 @@ This article provides a summary of the latest releases and major documentation u
2020

2121
## June 2025
2222

23+
### PII detection content filter
24+
25+
Personally identifiable information (PII) detection is now available as a built-in content filter. This feature allows you to identify and block sensitive information in LLM outputs, enhancing data privacy. For more information, see the [PII detection](./concepts/content-filter-personal-information.md) documentation.
26+
2327
### codex-mini & o3-pro models released
2428

2529
- `codex-mini` and `o3-pro` are now available. To learn more, see the [getting started with reasoning models page](./how-to/reasoning.md)

articles/ai-foundry/toc.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -237,6 +237,8 @@ items:
237237
href: ./openai/concepts/content-filter-groundedness.md
238238
- name: Protected material detection
239239
href: ./openai/concepts/content-filter-protected-material.md
240+
- name: Personally identifiable information (PII) detection
241+
href: ./openai/concepts/content-filter-personal-information.md
240242
- name: Content filter configurability
241243
href: ./openai/concepts/content-filter-configurability.md
242244
- name: Content filter annotations

0 commit comments

Comments
 (0)