Skip to content

Commit 10b3b0f

Browse files
committed
update version support
1 parent f4ecd08 commit 10b3b0f

File tree

2 files changed

+28
-2
lines changed

2 files changed

+28
-2
lines changed

articles/ai-services/document-intelligence/concept/retrieval-augmented-generation.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -119,7 +119,6 @@ If you're looking for a specific section in a document, you can use semantic chu
119119

120120
```python
121121

122-
# Using SDK targeting 2024-11-30 (GA), make sure your resource is in one of these regions: East US, West US2, West Europe
123122
# pip install azure-ai-documentintelligence==1.0.0b1
124123
# pip install langchain langchain-community azure-ai-documentintelligence
125124

articles/ai-services/document-intelligence/prebuilt/layout.md

Lines changed: 28 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -610,7 +610,34 @@ Document Intelligence v2.1 supports the following tools, applications, and libra
610610

611611
## Input guidance
612612

613-
[!INCLUDE [input requirements](./../includes/input-requirements.md)]
613+
Supported file formats:
614+
615+
|Model | PDF |Image: </br>`JPEG/JPG`, `PNG`, `BMP`, `TIFF`, `HEIF` | Microsoft Office: </br> Word (`DOCX`), Excel (`XLSX`), PowerPoint (`PPTX`), HTML|
616+
|--------|:----:|:-----:|:---------------:|
617+
|Read | ✔ | ✔ | ✔ |
618+
|Layout | ✔ | ✔ | |
619+
|General&nbsp;Document| ✔ | ✔ | |
620+
|Prebuilt | ✔ | ✔ | |
621+
|Custom extraction | ✔ | ✔ | |
622+
|Custom classification | ✔ | ✔ | ✔ |
623+
624+
* For best results, provide one clear photo or high-quality scan per document.
625+
626+
* For PDF and TIFF, up to 2,000 pages can be processed (with a free tier subscription, only the first two pages are processed).
627+
628+
* The file size for analyzing documents is 500 MB for paid (S0) tier and `4` MB for free (F0) tier.
629+
630+
* Image dimensions must be between 50 pixels x 50 pixels and 10,000 pixels x 10,000 pixels.
631+
632+
* If your PDFs are password-locked, you must remove the lock before submission.
633+
634+
* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about `8` point text at 150 dots per inch (DPI).
635+
636+
* For custom model training, the maximum number of pages for training data is 500 for the custom template model and 50,000 for the custom neural model.
637+
638+
* For custom extraction model training, the total size of training data is 50 MB for template model and `1` GB for the neural model.
639+
640+
* For custom classification model training, the total size of training data is `1` GB with a maximum of 10,000 pages. For 2024-11-30 (GA), the total size of training data is `2` GB with a maximum of 10,000 pages.
614641

615642
:::moniker-end
616643

0 commit comments

Comments
 (0)