Content safety update

ositanachi · ositanachi · commit 272b6c38c8e1 · 2025-01-28T12:58:58.000-05:00
Updated model catalog content safety page and created include file for content safety harms and categories
diff --git a/articles/ai-studio/ai-services/includes/content-safety-harm-categories.md b/articles/ai-studio/ai-services/includes/content-safety-harm-categories.md
@@ -0,0 +1,30 @@
+---
+title: include file
+description: include file
+ms.reviewer: osiotugo
+ms.author: osiotugo
+ms.service: azure-ai-studio
+ms.topic: include
+ms.date: 01/28/2025
+ms.custom: include
+---
+
+## Understand harm categories
+
+### Harm categories
+
+| Category  | Description         |API term |
+| --------- | ------------------- | --- |
+| Hate and Fairness      | Hate and fairness harms refer to any content that attacks or uses discriminatory language with reference to a person or identity group based on certain differentiating attributes of these groups. <br><br>This includes, but is not limited to:<ul><li>Race, ethnicity, nationality</li><li>Gender identity groups and expression</li><li>Sexual orientation</li><li>Religion</li><li>Personal appearance and body size</li><li>Disability status</li><li>Harassment and bullying</li></ul> | `Hate` |
+| Sexual  | Sexual describes language related to anatomical organs and genitals, romantic relationships and sexual acts, acts portrayed in erotic or affectionate terms, including those portrayed as an assault or a forced sexual violent act against one’s will. <br><br> This includes but is not limited to:<ul><li>Vulgar content</li><li>Prostitution</li><li>Nudity and Pornography</li><li>Abuse</li><li>Child exploitation, child abuse, child grooming</li></ul>   | `Sexual` |
+| Violence  | Violence describes language related to physical actions intended to hurt, injure, damage, or kill someone or something; describes weapons, guns, and related entities. <br><br>This includes, but isn't limited to:  <ul><li>Weapons</li><li>Bullying and intimidation</li><li>Terrorist and violent extremism</li><li>Stalking</li></ul>  | `Violence` |
+| Self-Harm  | Self-harm describes language related to physical actions intended to purposely hurt, injure, damage one’s body or kill oneself. <br><br> This includes, but isn't limited to: <ul><li>Eating Disorders</li><li>Bullying and intimidation</li></ul>  | `SelfHarm` |
+
+### Severity levels 
+
+| Level | Description |
+| --- | ---|
+|Safe |Content might be related to violence, self-harm, sexual, or hate categories but the terms are used in general, journalistic, scientific, medical, and similar professional contexts, which are appropriate for most audiences. |
+|Low |Content that expresses prejudiced, judgmental, or opinionated views, includes offensive use of language, stereotyping, use cases exploring a fictional world (for example, gaming, literature) and depictions at low intensity.| 
+|Medium |Content that uses offensive, insulting, mocking, intimidating, or demeaning language towards specific identity groups, includes depictions of seeking and executing harmful instructions, fantasies, glorification, promotion of harm at medium intensity. |
+|High |Content that displays explicit and severe harmful instructions, actions, damage, or abuse; includes endorsement, glorification, or promotion of severe harmful acts, extreme or illegal forms of harm, radicalization, or nonconsensual power exchange or abuse. |
diff --git a/articles/ai-studio/how-to/model-catalog-content-safety.md b/articles/ai-studio/how-to/model-catalog-content-safety.md
@@ -20,9 +20,9 @@ In this article, learn about content safety capabilities for models from the mod
 
 ## Content filter defaults
 
-Azure AI uses a default configuration of [Azure AI Content Safety](/azure/ai-services/content-safety/overview) content filters that detect harmful content across four categories hate, self-harm, sexual, and violence for models deployed via serverless APIs. To learn more about content filtering (preview), see [Harm categories in Azure AI Content Safety](/azure/ai-services/content-safety/concepts/harm-categories).
+Azure AI uses a default configuration of [Azure AI Content Safety](/azure/ai-services/content-safety/overview) content filters that detect harmful content across four categories hate and fairness, self-harm, sexual, and violence for models deployed via serverless APIs. To learn more about content filtering (preview), see [Harm categories in Azure AI Content Safety](/azure/ai-services/content-safety/concepts/harm-categories).
 
-The default content filtering configuration for text models is set to filter at the medium severity threshold, filtering any detected content at this level or higher. For image models, the default content filtering configuration is set at the low configuration threshold, filtering at this level or higher. Models deployed using the [Azure AI model inference service]() can create configurable filters by clicking the **Content filters** tab within the **Safety + security** page.
+The default content filtering configuration for text models is set to filter at the medium severity threshold, filtering any detected content at this level or higher. For image models, the default content filtering configuration is set at the low configuration threshold, filtering at this level or higher. Models deployed using the [Azure AI model inference service](/articles/ai-foundry/model-inference/how-to/configure-content-filters.md) can create configurable filters by clicking the **Content filters** tab within the **Safety + security** page.
 
 > [!TIP]
 > Content filtering (preview) is not available for certain model types that are deployed via serverless APIs. These model types include embedding models and time series models.
@@ -34,6 +34,14 @@ Content filtering (preview) occurs synchronously as the service processes prompt
 
 Suppose you decide to use an API other than the [Azure AI Model Inference API](/azure/ai-studio/reference/reference-model-inference-api) to work with a model that is deployed via a serverless API. In such a situation, content filtering (preview) isn't enabled unless you implement it separately by using Azure AI Content Safety. To get started with Azure AI Content Safety, see [Quickstart: Analyze text content](/azure/ai-services/content-safety/quickstart-text). You run a higher risk of exposing users to harmful content if you don't use content filtering (preview) when working with models that are deployed via serverless APIs.
 
+[!INCLUDE [content-safety-harm-categories](/articles/ai-studio/ai-services/includes/content-safety-harm-categories.md)]
+
 ## How charges are calculated
 
 Pricing details are viewable at [Azure AI Content Safety pricing](https://azure.microsoft.com/pricing/details/cognitive-services/content-safety/). Charges are incurred when the Azure AI Content Safety validates the prompt or completion. If Azure AI Content Safety blocks the prompt or completion, you're charged for both the evaluation of the content and the inference calls.
+
+## Related content
+
+- [How to configure content filters (preview) for models in Azure AI services](/articles/ai-foundry/model-inference/how-to/configure-content-filters.md)
+- [Azure AI Content Safety Overview](/articles/ai-services/content-safety/overview.md)
+- [Model catalog and collections in Azure AI Foundry portal](/articles/ai-studio/how-to/model-catalog-overview.md)