You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/document-intelligence/faq.yml
+13-13Lines changed: 13 additions & 13 deletions
Original file line number
Diff line number
Diff line change
@@ -30,15 +30,15 @@ sections:
30
30
31
31
There are no changes to pricing. The names Cognitive Services and Applied AI Services continue to be used in Azure billing, cost analysis, price lists, and price APIs.
32
32
33
-
There are no breaking changes to APIs or client libraries (SDKs). REST APIs and SDK versions 2024-02-29-preview, 2023-10-31-preview, and later are renamed `document intelligence`.
33
+
There are no breaking changes to APIs or client libraries. REST APIs and SDK versions 2024-02-29-preview, 2023-10-31-preview, and going forward are renamed `document intelligence`.
34
34
35
35
Some platforms are still awaiting the renaming update. In Microsoft documentation, all mentions of Form Recognizer and Document Intelligence refer to the same Azure service.
36
36
37
37
- question: |
38
38
How is Document Intelligence related to document generative AI?
39
39
answer: |
40
40
41
-
You can use a document generative AI solution to chat with your documents, generate captivating content from those documents, and access Azure OpenAI Service models on your data. With Azure AI Document Intelligence and Azure OpenAI combined, you can build an enterprise application to seamlessly interact with your documents by using natural languages, easily find answers and gain valuable insights, and generate new and engaging content from your existing documents. Find more details in the [technical community blog](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/document-generative-ai-the-power-of-azure-ai-document/ba-p/3875015).
41
+
You can use a document generative AI solution to chat with your documents, generate captivating content from those documents, and access Azure OpenAI Service models on your data. With Azure AI Document Intelligence and Azure OpenAI combined, you can build an enterprise application to seamlessly interact with your documents. Find more details in the [technical community blog](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/document-generative-ai-the-power-of-azure-ai-document/ba-p/3875015).
42
42
43
43
- question: |
44
44
How is Document Intelligence related to retrieval-augmented generation?
@@ -158,7 +158,7 @@ sections:
158
158
What is a bounding box?
159
159
answer: |
160
160
161
-
A bounding box (`polygon` in v3.0 and later versions) is an abstract rectangle that surrounds text elements in a document or form used as a reference point for object detection.
161
+
A bounding box (`polygon` in v3.0 and later versions) is an abstract rectangle that surrounds text elements in a document used as a reference point for object detection.
162
162
163
163
The bounding box specifies position by using an x and y coordinate plane presented in an array of four numerical pairs. Each pair represents a corner of the box in the following order: upper left, upper right, lower right, lower left.
164
164
@@ -217,7 +217,7 @@ sections:
217
217
I have multiple pages in a document. Why are only two pages analyzed in Document Intelligence Studio?
218
218
answer: |
219
219
220
-
For free-tier (F0) resources, only the first two pages are analyzed whether you're using Document Intelligence Studio, the REST API, or SDKs.
220
+
For free-tier (F0) resources, only the first two pages are analyzed whether you're using Document Intelligence Studio, the REST API, or client libraries.
221
221
222
222
In Document Intelligence Studio, select the **Settings** (gear) button, select the **Resources** tab, and check the price tier that you're using to analyze the documents. If you want to analyze all pages in a document, change to a paid (S0) resource.
223
223
@@ -262,9 +262,9 @@ sections:
262
262
263
263
"Yes. Document Intelligence Studio has separate URL endpoints for sovereign cloud regions:"
264
264
265
-
- "URL for the Azure US Government cloud (Azure Fairfax): [Document Intelligence Studio US Government](https://formrecognizer.appliedai.azure.us/studio)".
265
+
- "URL for the Azure US Government cloud (Azure Fairfax): [Document Intelligence Studio US Government](https://formrecognizer.appliedai.azure.us/studio)."
266
266
267
-
- "URL Microsoft Azure operated by 21Vianet (Azure in China): [Document Intelligence Studio China](https://formrecognizer.appliedai.azure.cn/studio)."
267
+
- "URL Microsoft Azure operated by 21Vianet (Azure China): [Document Intelligence Studio China](https://formrecognizer.appliedai.azure.cn/studio)."
Where can I find the supported API version for the latest programming language SDKs?
291
+
Where can I find the supported API version for the latest programming language client libraries?
292
292
answer: |
293
293
294
294
This table provides links to the latest SDK versions and shows the relationship between supported Document Intelligence SDK and API versions:
@@ -325,7 +325,7 @@ sections:
325
325
How can I specify a range of pages to be analyzed in a document?
326
326
answer: |
327
327
328
-
Use the `pages` parameter (supported in v2.1, v3.0, and later versions of the REST API) to specify pages for multiple-page PDF and TIFF documents. Accepted input includes the following ranges:
328
+
Use the `pages` parameter (supported in v2.1, v3.0, and later versions of the REST API) and specify pages for multiple-page PDF and TIFF documents. Accepted input includes the following ranges:
329
329
330
330
- Single pages. For example, if you specify `1, 2`, pages 1 and 2 are processed.
331
331
- Finite ranges. For example, if you specify `2-5`, pages 2 to 5 are processed.
@@ -368,7 +368,7 @@ sections:
368
368
369
369
- When the service analyzes Microsoft Word and HTML files that the read and layout models support, it counts pages in blocks of 3,000 characters each. For example, if your document contains 7,000 characters, the two pages with 3,000 characters each and one page with 1,000 characters add up to a total of three pages.
370
370
371
-
- When you're using the read or layout model to analyze Microsoft Word, Excel, PowerPoint, and HTML files, embedded or linked images aren't supported. So the service doesn't count them as added images.
371
+
- The read and layout models don't support analysis of embedded or linked images in Microsoft Word, Excel, PowerPoint, and HTML files. Therefore, service doesn't count them as added images.
372
372
373
373
- Training a custom model is always free with Document Intelligence. You're charged only when the service uses a model to analyze a document.
374
374
@@ -449,7 +449,7 @@ sections:
449
449
450
450
Document Intelligence doesn't have an explicit retrain operation. Each train operation generates a new model.
451
451
452
-
If you find that your model needs retraining, add more samples to your training dataset and train a new model.
452
+
If you find that your model needs to retrain, add more samples to your training dataset and train a new model.
453
453
454
454
- question: |
455
455
How many custom models can I compose into a single custom model?
@@ -554,7 +554,7 @@ sections:
554
554
555
555
[Disconnected containers](../../ai-services/containers/disconnected-containers.md) enable you to use APIs that are disconnected from the internet. [Billing information](../../ai-services/containers/disconnected-container-faq.yml#how-does-billing-work) isn't sent via the internet. Instead, you're charged based on a purchased commitment tier. Currently, disconnected container usage is available for Document Intelligence custom and invoice models.
556
556
557
-
The model capabilities provided in connected and disconnected containers are the same and are supported by Document Intelligence v2.1.
557
+
The Document Intelligence v2.1 model capabilities for connected and disconnected containers are the same.
558
558
559
559
- question: |
560
560
What data do connected containers send to the cloud?
@@ -565,7 +565,7 @@ sections:
565
565
For an example of the information that connected containers send to Microsoft for billing, see the [Azure AI container FAQ](../../ai-services/containers/disconnected-container-faq.yml#how-does-billing-work).
566
566
567
567
- question: |
568
-
Why am I receiving the error "Container isn't in a valid state. Subscription validation failed with status 'OutOfQuota' API key is out of quota"?
568
+
Why am I receiving the error *Container isn't in a valid state. Subscription validation failed with status 'OutOfQuota' API key is out of quota*?
569
569
answer: |
570
570
571
571
Document Intelligence connected containers send billing information to Azure by using a Document Intelligence resource on your Azure account. You could get this message if the containers can't communicate with the billing endpoint.
@@ -624,7 +624,7 @@ sections:
624
624
- question: |
625
625
Where can I find more solutions to my Azure AI Document Intelligence questions?
626
626
answer: |
627
-
[Microsoft Q&A](/answers/topics/azure-form-recognizer.html) is the home for technical questions and answers at Microsoft. You can filter queries that are specific to Document Intelligence.
627
+
[Microsoft Q & A](/answers/topics/azure-form-recognizer.html) is the home for technical questions and answers at Microsoft. You can filter queries that are specific to Document Intelligence.
628
628
629
629
- question: |
630
630
What should I do if the service doesn't recognize specific text, or recognizes it incorrectly, when I'm labeling documents?
@@ -50,43 +50,43 @@ Configure each of the resources to ensure that the resources can communicate wit
50
50
51
51
* Configure the Document Intelligence Studio to use the newly created Document Intelligence resource by accessing the settings page and selecting the resource.
52
52
53
-
*Validate that the configuration works by selecting the Read API and analyzing a sample document. If the resource was configured correctly, the request successfully completes.
53
+
*Ensure and validate that the configuration works by selecting the Read API and analyzing a sample document. If the resource was configured correctly, the request successfully completes.
54
54
55
55
* Add a training dataset to a container in the Storage account you created.
56
56
57
57
* Select the custom model tile to create a custom project. Ensure that you select the same Document Intelligence resource and the storage account you created in the previous step.
58
58
59
59
* Select the container with the training dataset you uploaded in the previous step. Ensure that if the training dataset is within a folder, the folder path is set appropriately.
60
60
61
-
*If you have the required permissions, the Studio sets the CORS setting required to access the storage account. If you don't have the permissions, you need to ensure that the CORS settings are configured on the Storage account before you can proceed.
61
+
*Ensure that you have the required permissions, the Studio sets the CORS setting required to access the storage account. If you don't have the permissions, you need to make certain that the CORS settings are configured on the Storage account before you can proceed.
62
62
63
-
*Validate that the Studio is configured to access your training data, if you can see your documents in the labeling experience, all the required connections are established.
63
+
*Ensure and validate that the Studio is configured to access your training data. If you can see your documents in the labeling experience, all the required connections are established.
64
64
65
65
You now have a working implementation of all the components needed to build a Document Intelligence solution with the default security model:
66
66
67
67
:::image type="content" source="media/managed-identities/default-config-di.png" alt-text="Screenshot of default security configuration.":::
68
68
69
69
Next, complete the following steps:
70
70
71
-
*Setup managed identity on the Document Intelligence resource.
71
+
*Configure managed identity on the Document Intelligence resource.
72
72
73
73
* Secure the storage account to restrict traffic from only specific virtual networks and IP addresses.
74
74
75
75
* Configure the Document Intelligence managed identity to communicate with the storage account.
76
76
77
-
* Disable public access to the Document Intelligence resource and create a private endpoint to make it accessible from only specific virtual networks and IP addresses.
77
+
* Disable public access to the Document Intelligence resource and create a private endpoint. Your resource is then only accessible from specific virtual networks and IP addresses.
78
78
79
79
* Add a private endpoint for the storage account in a selected virtual network.
80
80
81
-
*Validate that you can train models and analyze documents from within the virtual network.
81
+
*Ensure and validate that you can train models and analyze documents from within the virtual network.
82
82
83
83
## Setup managed identity for Document Intelligence
84
84
85
85
Navigate to the Document Intelligence resource in the Azure portal and select the **Identity** tab. Toggle the **System assigned** managed identity to **On** and save the changes:
86
86
87
87
:::image type="content" source="media/managed-identities/v2-fr-mi.png" alt-text="Screenshot of configure managed identity.":::
88
88
89
-
## Secure the Storage account to limit traffic
89
+
## Secure the Storage account
90
90
91
91
Start configuring secure communications by navigating to the **Networking** tab on your **Storage account** in the Azure portal.
92
92
@@ -141,7 +141,7 @@ Great! You configured your Document Intelligence resource to use a managed ident
141
141
> When you try the [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio), you'll see the READ API and other prebuilt models don't require storage access to process documents. However, training a custom model requires additional configuration because the Studio can't directly communicate with a storage account.
142
142
> You can enable storage access by selecting **Add your client IP address** from the **Networking** tab of the storage account to configure your machine to access the storage account via IP allowlisting.
143
143
144
-
## Configure private endpoints for access from VNETs
144
+
## Configure private endpoints for access from `VNET`s
145
145
146
146
> [!NOTE]
147
147
>
@@ -252,7 +252,7 @@ To validate your deployment, you can deploy a virtual machine (VM) to the virtua
252
252
253
253
1. Configure a [Data Science VM](https://azuremarketplace.microsoft.com/marketplace/apps/microsoft-dsvm.dsvm-win-2019?tab=Overview) in the virtual network.
254
254
255
-
1. Remotely connect into the VM from your desktop to launch a browser session to access Document Intelligence Studio.
255
+
1. Remotely connect into the VM from your desktop and launch a browser session that accesses Document Intelligence Studio.
256
256
257
257
1. Analyze requests and the training operations should now work successfully.
258
258
@@ -267,13 +267,13 @@ That's it! You can now configure secure access for your Document Intelligence re
1. Make sure the client computer can access Document Intelligence resource and storage account, either they are in the same VNET, or client IP address is allowed in **Networking > Firewalls and virtual networks** setting page of both Document Intelligence resource and storage account.
270
+
1. Make sure the client computer can access Document Intelligence resource and storage account, either they are in the same `VNET`, or client IP address is allowed in **Networking > Firewalls and virtual networks** setting page of both Document Intelligence resource and storage account.
271
271
272
272
***AuthorizationFailure**:
273
273
274
274
:::image type="content" source="media/managed-identities/auth-failure.png" alt-text="Screenshot of authorization failure error.":::
275
275
276
-
**Resolution**: Make sure the client computer can access Document Intelligence resource and storage account, either they are in the same VNET, or client IP address is allowed in **Networking > Firewalls and virtual networks** setting page of both Document Intelligence resource and storage account.
276
+
**Resolution**: Make sure the client computer can access Document Intelligence resource and storage account, either they are in the same `VNET`, or client IP address is allowed in **Networking > Firewalls and virtual networks** setting page of both Document Intelligence resource and storage account.
0 commit comments