You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/applied-ai-services/form-recognizer/includes/input-requirements.md
+21-3Lines changed: 21 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,19 +3,37 @@ author: laujan
3
3
ms.service: applied-ai-services
4
4
ms.subservice: forms-recognizer
5
5
ms.topic: include
6
-
ms.date: 07/27/2022
6
+
ms.date: 08/25/2022
7
7
ms.author: lajanuar
8
-
ms.custom: ignite-fall-2021
9
8
---
10
9
<!-- markdownlint-disable MD041 -->
11
10
12
11
* For best results, provide one clear photo or high-quality scan per document.
13
-
* Supported file formats: JPEG/JPG, PNG, BMP, TIFF, and PDF (text-embedded or scanned). Text-embedded PDFs are best to eliminate the possibility of error in character extraction and location. Additionally, only API version`2022/06/30` supports Microsoft Word (DOCX), Excel (XLS), PowerPoint (PPT), and HTML files in Read model.
12
+
13
+
* Supported file formats:
14
+
15
+
|Model | PDF |Image: </br>JPEG/JPG, PNG, BMP, and TIFF | Microsoft Office: </br> Word (DOCX), Excel (XLS), PowerPoint (PPT), and HTML|
16
+
|--------|:----:|:-----:|:---------------:
17
+
|Read | ✔ | ✔ | ✱**REST API version**</br> **`2022/06/30-preview`**
18
+
|Layout | ✔ | ✔ ||
19
+
|General Document| ✔ | ✔ ||
20
+
|Prebuilt | ✔ | ✔ ||
21
+
|Custom | ✔ | ✔ ||
22
+
23
+
✱ Microsoft Office files are currently not supported for other models or versions.
24
+
14
25
* For PDF and TIFF, up to 2000 pages can be processed (with a free tier subscription, only the first two pages are processed).
26
+
15
27
* The file size for analyzing documents must be _less than_ 500 MB for paid (S0) tier and 4 MB for free (F0) tier.
28
+
16
29
* Image dimensions must be between 50 x 50 pixels and 10,000 px x 10,000 pixels.
30
+
17
31
* PDF dimensions are up to 17 x 17 inches, corresponding to Legal or A3 paper size, or smaller.
32
+
18
33
* If your PDFs are password-locked, you must remove the lock before submission.
34
+
19
35
* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about 8-point text at 150 dots per inch (DPI).
36
+
20
37
* For custom model training, the maximum number of pages for training data is 500 for the custom template model and 50,000 for the custom neural model.
38
+
21
39
* For custom model training, the total size of training data is 50 MB for template model and 1G-MB for the neural model.
0 commit comments