Skip to content

Commit fdb70e1

Browse files
committed
ready for review
1 parent da6c284 commit fdb70e1

File tree

15 files changed

+854
-564
lines changed

15 files changed

+854
-564
lines changed

articles/ai-services/.openpublishing.redirection.ai-services.json

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -814,6 +814,21 @@
814814
"source_path_from_root": "/articles/ai-services/ai-services-and-ecosystem.md",
815815
"redirect_url": "/azure/ai-services/what-are-ai-services",
816816
"redirect_document_id": false
817+
},
818+
{
819+
"source_path_from_root": "/articles/ai-services/language-service/personally-identifiable-information/how-to-call.md",
820+
"redirect_url": "/azure/ai-services/language-service/personally-identifiable-information/how-to/redact-text-pii",
821+
"redirect_document_id": true
822+
},
823+
{
824+
"source_path_from_root": "/articles/ai-services/language-service/personally-identifiable-information/how-to-call-for-conversations.md",
825+
"redirect_url": "/azure/ai-services/language-service/personally-identifiable-information/how-to/redact-conversation-pii",
826+
"redirect_document_id": true
827+
},
828+
{
829+
"source_path_from_root": "/articles/ai-services/language-service/native-document-support/use-native-documents.md",
830+
"redirect_url": "/azure/ai-services/language-service/native-document-support/overview",
831+
"redirect_document_id": true
817832
}
818833
]
819834
}

articles/ai-services/language-service/native-document-support/managed-identities.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ ms.date: 11/21/2024
1212

1313
# Managed identities for Language resources
1414

15-
Managed identities for Azure resources are service principals that create a Microsoft Entra identity and specific permissions for Azure managed resources. Managed identities are a safer way to grant access to storage data and replace the requirement for you to include shared access signature tokens (SAS) with your [source and target container URLs](use-native-documents.md#create-azure-blob-storage-containers).
15+
Managed identities for Azure resources are service principals that create a Microsoft Entra identity and specific permissions for Azure managed resources. Managed identities are a safer way to grant access to storage data and replace the requirement for you to include shared access signature tokens (SAS) with your [source and target container URLs](overview.md#create-azure-blob-storage-containers).
1616

1717
:::image type="content" source="media/managed-identity-flow.png" alt-text="Screenshot of managed identity flow (RBAC).":::
1818

@@ -24,7 +24,7 @@ Managed identities for Azure resources are service principals that create a Micr
2424

2525
> [!IMPORTANT]
2626
>
27-
> * When using managed identities, don't include a SAS token URL with your HTTP requests—your requests will fail. Using managed identities replaces the requirement for you to include shared access signature tokens (SAS) with your [source and target container URLs](use-native-documents.md#create-azure-blob-storage-containers).
27+
> * When using managed identities, don't include a SAS token URL with your HTTP requests—your requests will fail. Using managed identities replaces the requirement for you to include shared access signature tokens (SAS) with your [source and target container URLs](overview.md#create-azure-blob-storage-containers).
2828
>
2929
> * To use managed identities for Language operations, you must [create your Language resource](https://ms.portal.azure.com/#create/Microsoft.CognitiveServicesTextAnalytics) in a specific geographic Azure region such as **East US**. If your Language resource region is set to **Global**, then you can't use managed identity authentication. You can, however, still use [Shared Access Signature (SAS) tokens](shared-access-signatures.md).
3030
>
@@ -135,4 +135,4 @@ You must grant the Language resource access to your storage account before it ca
135135
## Next steps
136136

137137
> [!div class="nextstepaction"]
138-
> [Get started with native document support](use-native-documents.md#include-native-documents-with-an-http-request)
138+
> [Get started with native document support](overview.md#include-native-documents-with-an-http-request)
Lines changed: 76 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,76 @@
1+
---
2+
title: Native document support for Azure AI Language (preview)
3+
titleSuffix: Azure AI services
4+
description: How to use native document with Azure AI Languages Personally Identifiable Information and Summarization capabilities.
5+
author: laujan
6+
manager: nitinme
7+
ms.service: azure-ai-language
8+
ms.custom:
9+
- ignite-2024
10+
ms.topic: how-to
11+
ms.date: 02/19/2025
12+
ms.author: lajanuar
13+
---
14+
15+
<!-- markdownlint-disable MD033 -->
16+
<!-- markdownlint-disable MD051 -->
17+
<!-- markdownlint-disable MD024 -->
18+
<!-- markdownlint-disable MD036 -->
19+
<!-- markdownlint-disable MD049 -->
20+
<!-- markdownlint-disable MD001 -->
21+
22+
# Native document support for Azure AI Language (preview)
23+
24+
> [!IMPORTANT]
25+
>
26+
> * Azure AI Language public preview releases provide early access to features that are in active development.
27+
> * Features, approaches, and processes can change, before General Availability (GA), based on user feedback.
28+
29+
Azure AI Language is a cloud-based service that applies Natural Language Processing (NLP) features to text-based data. The native document support capability enables you to send API requests asynchronously, using an HTTP POST request body to send your data and HTTP GET request query string to retrieve the status results. Your processed documents are located in your Azure Blob Storage target container.
30+
31+
A native document refers to the file format used to create the original document such as Microsoft Word (docx) or a portable document file (pdf). Native document support eliminates the need for text preprocessing before using Azure AI Language resource capabilities. Currently, native document support is available for the following capabilities:
32+
33+
* [Personally Identifiable Information (PII)](../personally-identifiable-information/overview.md). The PII detection feature can identify, categorize, and redact sensitive information in unstructured text. The `PiiEntityRecognition` API supports native document processing.
34+
35+
* [Document summarization](../summarization/overview.md). Document summarization uses natural language processing to generate extractive (salient sentence extraction) or abstractive (contextual word extraction) summaries for documents. Both `AbstractiveSummarization` and `ExtractiveSummarization` APIs support native document processing.
36+
37+
## Supported document formats
38+
39+
Applications use native file formats to create, save, or open native documents. Currently **PII** and **Document summarization** capabilities supports the following native document formats:
40+
41+
|File type|File extension|Description|
42+
|---------|--------------|-----------|
43+
|Text| `.txt`|An unformatted text document.|
44+
|Adobe PDF| `.pdf`|A portable document file formatted document.|
45+
|Microsoft Word| `.docx`|A Microsoft Word document file.|
46+
47+
## Input guidelines
48+
49+
***Supported file formats***
50+
51+
|Type|support and limitations|
52+
|---|---|
53+
|**PDFs**| Fully scanned PDFs aren't supported.|
54+
|**Text within images**| Digital images with embedded text aren't supported.|
55+
|**Digital tables**| Tables in scanned documents aren't supported.|
56+
57+
***Document Size***
58+
59+
|Attribute|Input limit|
60+
|---|---|
61+
|**Total number of documents per request** |**≤ 20**|
62+
|**Total content size per request**| **≤ 10 MB**|
63+
64+
## Request headers and parameters
65+
66+
|parameter |Description |
67+
|---------|---------|
68+
|`-X POST <endpoint>` | Specifies your Language resource endpoint for accessing the API. |
69+
|`--header Content-Type: application/json` | The content type for sending JSON data. |
70+
|`--header "Ocp-Apim-Subscription-Key:<key>` | Specifies the Language resource key for accessing the API. |
71+
|`-data` | The JSON file containing the data you want to pass with your request. |
72+
73+
## Related content
74+
75+
> [!div class="nextstepaction"]
76+
> [PII detection overview](../personally-identifiable-information/overview.md "Learn more about Personally Identifiable Information detection.") [Document Summarization overview](../summarization/overview.md "Learn more about automatic document summarization.")

articles/ai-services/language-service/native-document-support/shared-access-signatures.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -130,5 +130,5 @@ That's it! You learned how to create SAS tokens to authorize how clients access
130130
## Next steps
131131

132132
> [!div class="nextstepaction"]
133-
> [Learn more about native document support](use-native-documents.md "Learn how to process and analyze native documents.") [Learn more about granting access with SAS ](/azure/storage/common/storage-sas-overview "Grant limited access to Azure Storage resources using shared access SAS.")
133+
> [Learn more about native document support](overview.md "Learn how to process and analyze native documents.") [Learn more about granting access with SAS ](/azure/storage/common/storage-sas-overview "Grant limited access to Azure Storage resources using shared access SAS.")
134134
>

articles/ai-services/language-service/overview.md

Lines changed: 13 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -17,23 +17,23 @@ Azure AI Language is a cloud-based service that provides Natural Language Proces
1717

1818
## Available features
1919

20-
This Language service unifies the following previously available Azure AI services: Text Analytics, QnA Maker, and LUIS. If you need to migrate from these services, see [the migration section](#migrate-from-text-analytics-qna-maker-or-language-understanding-luis) below.
20+
This Language service unifies the following previously available Azure AI services: Text Analytics, QnA Maker, and LUIS. If you need to migrate from these services, see [the migration section](#migrate-from-text-analytics-qna-maker-or-language-understanding-luis).
2121

2222
The Language service also provides several new features as well, which can either be:
2323

24-
* Preconfigured, which means the AI models that the feature uses are not customizable. You just send your data, and use the feature's output in your applications.
25-
* Customizable, which means you'll train an AI model using our tools to fit your data specifically.
24+
* Preconfigured, which means the AI models that the feature uses aren't customizable. You just send your data, and use the feature's output in your applications.
25+
* Customizable, which means you train an AI model using our tools to fit your data specifically.
2626

2727
> [!TIP]
28-
> Unsure which feature to use? See [Which Language service feature should I use?](#which-language-service-feature-should-i-use) to help you decide.
28+
> Unsure which feature to use? See [Which Language service feature should I use](#which-language-service-feature-should-i-use) to help you decide.
2929
30-
[**Language Studio**](./language-studio.md) enables you to use the below service features without needing to write code.
30+
[**Language Studio**](./language-studio.md) enables you to use the following service features without needing to write code.
3131

3232
### Named Entity Recognition (NER)
3333

3434
:::row:::
3535
:::column span="":::
36-
:::image type="content" source="media/studio-examples/named-entity-recognition.png" alt-text="A screenshot of a named entity recognition example." lightbox="media/studio-examples/named-entity-recognition.png":::
36+
:::image type="content" source="media/studio-examples/named-entity-recognition.png" alt-text="A screenshot of a named entity recognition example." lightbox="media/studio-examples/named-entity-recognition.png":::
3737
:::column-end:::
3838
:::column span="":::
3939
[Named entity recognition](./named-entity-recognition/overview.md) is a preconfigured feature that categorizes entities (words or phrases) in unstructured text across several predefined category groups. For example: people, events, places, dates, [and more](./named-entity-recognition/concepts/named-entity-categories.md).
@@ -48,7 +48,7 @@ The Language service also provides several new features as well, which can eithe
4848
:::image type="content" source="media/studio-examples/personal-information-detection.png" alt-text="A screenshot of a PII detection example." lightbox="media/studio-examples/personal-information-detection.png":::
4949
:::column-end:::
5050
:::column span="":::
51-
[PII detection](./personally-identifiable-information/overview.md) is a preconfigured feature that identifies, categorizes, and redacts sensitive information in both [unstructured text documents](./personally-identifiable-information/how-to-call.md), and [conversation transcripts](./personally-identifiable-information/how-to-call-for-conversations.md). For example: phone numbers, email addresses, forms of identification, [and more](./personally-identifiable-information/concepts/entity-categories.md).
51+
[PII detection](./personally-identifiable-information/overview.md) is a preconfigured feature that identifies, categorizes, and redacts sensitive information in [unstructured text documents](./personally-identifiable-information/how-to/redact-text-pii.md), [conversation transcripts](./personally-identifiable-information/how-to/redact-conversation-pii.md), and [native documents](./personally-identifiable-information/how-to/redact-document-pii.md). PII includes phone numbers, email addresses, forms of identification, [and more](./personally-identifiable-information/concepts/entity-categories.md).
5252

5353
:::column-end:::
5454
:::row-end:::
@@ -182,7 +182,7 @@ The Language service also provides several new features as well, which can eithe
182182

183183
## Which Language service feature should I use?
184184

185-
This section will help you decide which Language service feature you should use for your application:
185+
This section helps you decide which Language service feature you should use for your application:
186186

187187
|What do you want to do? |Document format |Your best solution | Is this solution customizable?* |
188188
|---------|---------|---------|---------|
@@ -200,23 +200,23 @@ This section will help you decide which Language service feature you should use
200200
| Predict the intention of user inputs and extract information from them. | Unstructured user inputs | [Conversational language understanding](./conversational-language-understanding/overview.md) ||
201201
| Connect apps from conversational language understanding, LUIS, and question answering. | Unstructured user inputs | [Orchestration workflow](./orchestration-workflow/overview.md) ||
202202

203-
\* If a feature is customizable, you can train an AI model using our tools to fit your data specifically. Otherwise a feature is preconfigured, meaning the AI models it uses cannot be changed. You just send your data, and use the feature's output in your applications.
203+
\* If a feature is customizable, you can train an AI model using our tools to fit your data specifically. Otherwise a feature is preconfigured, meaning the AI models it uses can't be changed. You just send your data, and use the feature's output in your applications.
204204

205205
## Migrate from Text Analytics, QnA Maker, or Language Understanding (LUIS)
206206

207207
Azure AI Language unifies three individual language services in Azure AI services - Text Analytics, QnA Maker, and Language Understanding (LUIS). If you have been using these three services, you can easily migrate to the new Azure AI Language. For instructions see [Migrating to Azure AI Language](concepts/migrate.md).
208208

209209
## Tutorials
210210

211-
After you've had a chance to get started with the Language service, try our tutorials that show you how to solve various scenarios.
211+
After you get started with the Language service quickstarts, try our tutorials that show you how to solve various scenarios.
212212

213213
* [Extract key phrases from text stored in Power BI](key-phrase-extraction/tutorials/integrate-power-bi.md)
214214
* [Use Power Automate to sort information in Microsoft Excel](named-entity-recognition/tutorials/extract-excel-information.md)
215215
* [Use Flask to translate text, analyze sentiment, and synthesize speech](/training/modules/python-flask-build-ai-web-app/)
216216
* [Use Azure AI services in canvas apps](/powerapps/maker/canvas-apps/cognitive-services-api?context=/azure/ai-services/language-service/context/context)
217-
* [Create a FAQ Bot](question-answering/tutorials/bot-service.md)
217+
* [Create an FAQ Bot](question-answering/tutorials/bot-service.md)
218218

219-
## Additional code samples
219+
## Code samples
220220

221221
You can find more code samples on GitHub for the following languages:
222222

@@ -237,7 +237,7 @@ Use Language service containers to deploy API features on-premises. These Docker
237237

238238
## Responsible AI
239239

240-
An AI system includes not only the technology, but also the people who will use it, the people who will be affected by it, and the environment in which it is deployed. Read the following articles to learn about responsible AI use and deployment in your systems:
240+
An AI system includes not only the technology, but also the people who use it, the people affected by it, and the deployment environment. Read the following articles to learn about responsible AI use and deployment in your systems:
241241

242242
* [Transparency note for the Language service](/legal/cognitive-services/text-analytics/transparency-note)
243243
* [Integration and responsible use](/legal/cognitive-services/text-analytics/guidance-integration-responsible-use)

0 commit comments

Comments
 (0)