|
1 | 1 | ---
|
2 | 2 | author: laujan
|
3 | 3 | ms.service: azure-ai-document-intelligence
|
4 |
| -ms.custom: |
5 |
| - - ignite-2023 |
6 | 4 | ms.topic: include
|
7 |
| -ms.date: 11/15/2023 |
| 5 | +ms.date: 08/13/2024 |
8 | 6 | ms.author: lajanuar
|
9 | 7 | ---
|
10 | 8 | <!-- markdownlint-disable MD041 -->
|
11 | 9 |
|
12 |
| -* For best results, provide one clear photo or high-quality scan per document. |
13 |
| - |
14 | 10 | * Supported file formats:
|
15 | 11 |
|
16 |
| - |Model | PDF |Image: </br>JPEG/JPG, PNG, BMP, TIFF, HEIF | Microsoft Office: </br> Word (DOCX), Excel (XLSX), PowerPoint (PPTX), and HTML| |
17 |
| - |--------|:----:|:-----:|:---------------: |
| 12 | + |Model | PDF |Image: </br>`JPEG/JPG`, `PNG`, `BMP`, `TIFF`, `HEIF` | Microsoft Office: </br> Word (`DOCX`), Excel (`XLSX`), PowerPoint (`PPTX`), HTML| |
| 13 | + |--------|:----:|:-----:|:---------------:| |
18 | 14 | |Read | ✔ | ✔ | ✔ |
|
19 |
| - |Layout | ✔ | ✔ | ✔ (2024-02-29-preview, 2023-10-31-preview) | |
| 15 | + |Layout | ✔ | ✔ | ✔ (2024-07-31-preview, 2024-02-29-preview, 2023-10-31-preview) | |
20 | 16 | |General Document| ✔ | ✔ | |
|
21 | 17 | |Prebuilt | ✔ | ✔ | |
|
22 | 18 | |Custom extraction | ✔ | ✔ | |
|
23 |
| - |Custom classification | ✔ | ✔ | ✔ (2024-02-29-preview) | |
| 19 | + |Custom classification | ✔ | ✔ | ✔ (2024-07-31-preview, 2024-02-29-preview) | |
| 20 | + |
| 21 | +* For best results, provide one clear photo or high-quality scan per document. |
24 | 22 |
|
25 |
| -* For PDF and TIFF, up to 2000 pages can be processed (with a free tier subscription, only the first two pages are processed). |
| 23 | +* For PDF and TIFF, up to 2,000 pages can be processed (with a free tier subscription, only the first two pages are processed). |
26 | 24 |
|
27 |
| -* The file size for analyzing documents is 500 MB for paid (S0) tier and 4 MB for free (F0) tier. |
| 25 | +* The file size for analyzing documents is 500 MB for paid (S0) tier and `4` MB for free (F0) tier. |
28 | 26 |
|
29 |
| -* Image dimensions must be between 50 x 50 pixels and 10,000 px x 10,000 pixels. |
| 27 | +* Image dimensions must be between 50 pixels x 50 pixels and 10,000 pixels x 10,000 pixels. |
30 | 28 |
|
31 | 29 | * If your PDFs are password-locked, you must remove the lock before submission.
|
32 | 30 |
|
33 |
| -* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about `8`-point text at 150 dots per inch (DPI). |
| 31 | +* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about `8` point text at 150 dots per inch (DPI). |
34 | 32 |
|
35 | 33 | * For custom model training, the maximum number of pages for training data is 500 for the custom template model and 50,000 for the custom neural model.
|
36 | 34 |
|
37 |
| - * For custom extraction model training, the total size of training data is 50 MB for template model and 1G-MB for the neural model. |
| 35 | + * For custom extraction model training, the total size of training data is 50 MB for template model and `1` GB for the neural model. |
38 | 36 |
|
39 |
| - * For custom classification model training, the total size of training data is `1GB` with a maximum of 10,000 pages. |
| 37 | + * For custom classification model training, the total size of training data is `1` GB with a maximum of 10,000 pages. For 2024-07-31-preview and later, the total size of training data is `2` GB with a maximum of 10,000 pages. |
0 commit comments