You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/document-intelligence/quickstarts/includes/java-sdk.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ author: laujan
6
6
manager: nitinme
7
7
ms.service: azure-ai-document-intelligence
8
8
ms.topic: include
9
-
ms.date: 12/17/2024
9
+
ms.date: 02/10/2025
10
10
ms.author: lajanuar
11
11
---
12
12
<!-- markdownlint-disable MD025 -->
@@ -39,7 +39,7 @@ In this quickstart, use the following features to analyze and extract data and v
39
39
>[!TIP]
40
40
>
41
41
> * Visual Studio Code offers a **Coding Pack for Java** for Windows and macOS.The coding pack is a bundle of VS Code, the Java Development Kit (JDK), and a collection of suggested extensions by Microsoft. The Coding Pack can also be used to fix an existing development environment.
42
-
> * If you are using VS Code and the Coding Pack For Java, install the [**Gradle for Java**](https://marketplace.visualstudio.com/items?itemName=vscjava.vscode-gradle) extension.
42
+
> * If you're using VS Code and the Coding Pack For Java, install the [**Gradle for Java**](https://marketplace.visualstudio.com/items?itemName=vscjava.vscode-gradle) extension.
43
43
44
44
* If you aren't using Visual Studio Code, make sure you have the following installed in your development environment:
45
45
@@ -50,7 +50,7 @@ In this quickstart, use the following features to analyze and extract data and v
50
50
* An Azure AI services or Document Intelligence resource. Once you have your Azure subscription, create a [single-service](https://portal.azure.com/#create/Microsoft.CognitiveServicesFormRecognizer) or [multi-service](https://portal.azure.com/#create/Microsoft.CognitiveServicesAIServices) Document Intelligence resource, in the Azure portal, to get your key and endpoint. You can use the free pricing tier (`F0`) to try the service, and upgrade later to a paid tier for production.
51
51
52
52
> [!TIP]
53
-
> Create an Azure AI services resource if you plan to access multiple Azure AI services under a single endpoint/key. For Document Intelligence access only, create a Document Intelligence resource. Please note that you'll need a single-service resource if you intend to use [Microsoft Entra authentication](/azure/active-directory/authentication/overview-authentication).
53
+
> Create an Azure AI services resource if you plan to access multiple Azure AI services under a single endpoint/key. For Document Intelligence access only, create a Document Intelligence resource. You need a single-service resource if you intend to use [Microsoft Entra authentication](/azure/active-directory/authentication/overview-authentication).
54
54
55
55
* After your resource deploys, select **Go to resource**. You need the key and endpoint from the resource you create to connect your application to the Document Intelligence API. Later, you paste your key and endpoint into the code:
Copy file name to clipboardExpand all lines: articles/ai-services/document-intelligence/quickstarts/includes/javascript-sdk.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ author: laujan
6
6
manager: nitinme
7
7
ms.service: azure-ai-document-intelligence
8
8
ms.topic: include
9
-
ms.date: 12/17/2024
9
+
ms.date: 02/10/2025
10
10
ms.author: lajanuar
11
11
---
12
12
<!-- markdownlint-disable MD025 -->
@@ -41,7 +41,7 @@ In this quickstart, use the following features to analyze and extract data and v
41
41
* An Azure AI services or Document Intelligence resource. Once you have your Azure subscription, create a [single-service](https://portal.azure.com/#create/Microsoft.CognitiveServicesFormRecognizer) or [multi-service](https://portal.azure.com/#create/Microsoft.CognitiveServicesAIServices) Document Intelligence resource, in the Azure portal, to get your key and endpoint. You can use the free pricing tier (`F0`) to try the service, and upgrade later to a paid tier for production.
42
42
43
43
> [!TIP]
44
-
> Create an Azure AI services resource if you plan to access multiple Azure AI services under a single endpoint/key. For Document Intelligence access only, create a Document Intelligence resource. Please note that you'll need a single-service resource if you intend to use [Microsoft Entra authentication](/azure/active-directory/authentication/overview-authentication).
44
+
> Create an Azure AI services resource if you plan to access multiple Azure AI services under a single endpoint/key. For Document Intelligence access only, create a Document Intelligence resource. You need a single-service resource if you intend to use [Microsoft Entra authentication](/azure/active-directory/authentication/overview-authentication).
45
45
46
46
* After your resource deploys, select **Go to resource**. You need the key and endpoint from the resource you create to connect your application to the Document Intelligence API. You paste your key and endpoint into the code later in the quickstart:
Copy file name to clipboardExpand all lines: articles/ai-services/document-intelligence/train/custom-model.md
+12-12Lines changed: 12 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -85,19 +85,19 @@ If the language of your documents and extraction scenarios supports custom neura
85
85
86
86
* For PDF and TIFF, up to 2,000 pages can be processed (with a free tier subscription, only the first two pages are processed).
87
87
88
-
* The file size for analyzing documents is 500 MB for paid (S0) tier and 4 MB for free (F0) tier.
88
+
* The file size for analyzing documents is `500 MB` for paid (S0) tier and `4 MB` for free (F0) tier.
89
89
90
-
* Image dimensions must be between 50 x 50 pixels and 10,000 px x 10,000 pixels.
90
+
* Image dimensions must be between 50 x 50 pixels and 10,000 pixels x 10,000 pixels.
91
91
92
92
* If your PDFs are password-locked, you must remove the lock before submission.
93
93
94
-
* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about `8`-point text at 150 dots per inch.
94
+
* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about `8`-point text at `150` dots per inch.
95
95
96
96
* For custom model training, the maximum number of pages for training data is 500 for the custom template model and 50,000 for the custom neural model.
97
97
98
-
* For custom extraction model training, the total size of training data is 50 MB for template model and 1GB for the neural model.
98
+
* For custom extraction model training, the total size of training data is `50 MB` for template model and `1` GB for the neural model.
99
99
100
-
* For custom classification model training, the total size of training data is `1GB` with a maximum of 10,000 pages.
100
+
* For custom classification model training, the total size of training data is `1 GB` with a maximum of 10,000 pages.
101
101
102
102
### Optimal training data
103
103
@@ -252,7 +252,7 @@ This table compares the supported data extraction areas:
252
252
*-Behaves differently depending upon model. With template models, synthetic data is generated at training time. With neural models, exiting text recognized in the region is selected.
253
253
254
254
> [!TIP]
255
-
> When choosing between the two model types, start with a custom neural model if it meets your functional needs. See [custom neural](custom-neural.md) to learn more about custom neural models.
255
+
> To choose between the two model types, start with a custom neural model if it meets your functional needs. See [custom neural](custom-neural.md) to learn more about custom neural models.
256
256
257
257
:::moniker-end
258
258
@@ -266,22 +266,22 @@ The following table describes the features available with the associated tools a
| Custom form v2.1 |[Document Intelligence 2.1 GA API](/rest/api/aiservices/analyzer?view=rest-aiservices-v2.1&preserve-view=true)|[Document Intelligence SDK](../quickstarts/get-started-sdks-rest-api.md?view=doc-intel-2.1.0&preserve-view=true?pivots=programming-language-python)|[Sample labeling tool](https://fott-2-1.azurewebsites.net/)|
268
268
> [!NOTE]
269
-
> Custom template models trained with the 3.0 API will have a few improvements over the 2.1 API stemming from improvements to the OCR engine. Datasets used to train a custom template model using the 2.1 API can still be used to train a new model using the 3.0 API.
269
+
> Custom template models trained with the 3.0 API have a few improvements over the 2.1 API stemming from improvements to the OCR engine. Datasets used to train a custom template model using the 2.1 API can still be used to train a new model using the 3.0 API.
270
270
271
-
*For best results, provide one clear photo or high-quality scan per document.
272
-
*Supported file formats are JPEG/JPG, PNG, BMP, TIFF, and PDF (text-embedded or scanned). Text-embedded PDFs are best to eliminate the possibility of error in character extraction and location.
273
-
*For PDF and TIFF files, up to 2,000 pages can be processed. With a free tier subscription, only the first two pages are processed.
271
+
*One clear photo or high-quality scan per document should be provided for best results, provide.
272
+
*File formats supported are JPEG/JPG, PNG, BMP, TIFF, and PDF (text-embedded or scanned). Text-embedded PDFs are best to eliminate the possibility of error in character extraction and location.
273
+
* PDF and TIFF files, up to 2,000 pages, can be processed. With a free tier subscription, only the first two pages are processed.
274
274
* The file size must be less than 500 MB for paid (S0) tier and 4 MB for free (F0) tier.
275
275
* Image dimensions must be between 50 x 50 pixels and 10,000 x 10,000 pixels.
276
276
* PDF dimensions are up to 17 x 17 inches, corresponding to Legal or A3 paper size, or smaller.
277
277
* The total size of the training data is 500 pages or less.
278
-
*If your PDFs are password-locked, you must remove the lock before submission.
278
+
* PDFs that are password-lockedmust have the password lock removed before submission.
279
279
280
280
> [!TIP]
281
281
> Training data:
282
282
>
283
283
> * If possible, use text-based PDF documents instead of image-based documents. Scanned PDFs are handled as images.
284
-
> *Please supply only a single instance of the form per document.
284
+
> *Supply only a single instance of the form per document.
285
285
> * For filled-in forms, use examples that have all their fields filled in.
286
286
> * Use forms with different values in each field.
287
287
> * If your form images are of lower quality, use a larger dataset. For example, use 10 to 15 images.
> Regional endpoints do not support Microsoft Entra authentication. Create a [custom subdomain](../../../ai-services/authentication.md?tabs=powershell#create-a-resource-with-a-custom-subdomain) for your resource in order to use this type of authentication.
228
+
> Regional endpoints don't support Microsoft Entra authentication. Create a [custom subdomain](../../../ai-services/authentication.md?tabs=powershell#create-a-resource-with-a-custom-subdomain) for your resource in order to use this type of authentication.
229
229
230
230
Authorization is easiest using the `DefaultAzureCredential`. It provides a default token credential, based upon the running environment, capable of handling most Azure authentication scenarios.
0 commit comments