Skip to content

Commit 030eb30

Browse files
authored
Merge pull request #209207 from laujan/1981984-update-input-requirements
update input requirements
2 parents 553d533 + e615de2 commit 030eb30

File tree

1 file changed

+21
-3
lines changed

1 file changed

+21
-3
lines changed

articles/applied-ai-services/form-recognizer/includes/input-requirements.md

Lines changed: 21 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -3,19 +3,37 @@ author: laujan
33
ms.service: applied-ai-services
44
ms.subservice: forms-recognizer
55
ms.topic: include
6-
ms.date: 07/27/2022
6+
ms.date: 08/25/2022
77
ms.author: lajanuar
8-
ms.custom: ignite-fall-2021
98
---
109
<!-- markdownlint-disable MD041 -->
1110

1211
* For best results, provide one clear photo or high-quality scan per document.
13-
* Supported file formats: JPEG/JPG, PNG, BMP, TIFF, and PDF (text-embedded or scanned). Text-embedded PDFs are best to eliminate the possibility of error in character extraction and location. Additionally, only API version`2022/06/30` supports Microsoft Word (DOCX), Excel (XLS), PowerPoint (PPT), and HTML files in Read model.
12+
13+
* Supported file formats:
14+
15+
|Model | PDF |Image: </br>JPEG/JPG, PNG, BMP, and TIFF | Microsoft Office: </br> Word (DOCX), Excel (XLS), PowerPoint (PPT), and HTML|
16+
|--------|:----:|:-----:|:---------------:
17+
|Read | ✔ | ✔ | &#x2731; **REST API version**</br> **`2022/06/30-preview`**
18+
|Layout ||| |
19+
|General&nbsp;Document||| |
20+
|Prebuilt ||| |
21+
|Custom ||| |
22+
23+
&#x2731; Microsoft Office files are currently not supported for other models or versions.
24+
1425
* For PDF and TIFF, up to 2000 pages can be processed (with a free tier subscription, only the first two pages are processed).
26+
1527
* The file size for analyzing documents must be _less than_ 500 MB for paid (S0) tier and 4 MB for free (F0) tier.
28+
1629
* Image dimensions must be between 50 x 50 pixels and 10,000 px x 10,000 pixels.
30+
1731
* PDF dimensions are up to 17 x 17 inches, corresponding to Legal or A3 paper size, or smaller.
32+
1833
* If your PDFs are password-locked, you must remove the lock before submission.
34+
1935
* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about 8-point text at 150 dots per inch (DPI).
36+
2037
* For custom model training, the maximum number of pages for training data is 500 for the custom template model and 50,000 for the custom neural model.
38+
2139
* For custom model training, the total size of training data is 50 MB for template model and 1G-MB for the neural model.

0 commit comments

Comments
 (0)