You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|Model | PDF |Image: </br>`JPEG/JPG`, `PNG`, `BMP`, `TIFF`, `HEIF` | Microsoft Office: </br> Word (`DOCX`), Excel (`XLSX`), PowerPoint (`PPTX`), HTML|
616
+
|--------|:----:|:-----:|:---------------:|
617
+
|Read | ✔ | ✔ | ✔ |
618
+
|Layout | ✔ | ✔ | |
619
+
|General Document| ✔ | ✔ | |
620
+
|Prebuilt | ✔ | ✔ | |
621
+
|Custom extraction | ✔ | ✔ | |
622
+
|Custom classification | ✔ | ✔ | ✔ |
623
+
624
+
* For best results, provide one clear photo or high-quality scan per document.
625
+
626
+
* For PDF and TIFF, up to 2,000 pages can be processed (with a free tier subscription, only the first two pages are processed).
627
+
628
+
* The file size for analyzing documents is 500 MB for paid (S0) tier and `4` MB for free (F0) tier.
629
+
630
+
* Image dimensions must be between 50 pixels x 50 pixels and 10,000 pixels x 10,000 pixels.
631
+
632
+
* If your PDFs are password-locked, you must remove the lock before submission.
633
+
634
+
* The minimum height of the text to be extracted is 12 pixels for a 1024 x 768 pixel image. This dimension corresponds to about `8` point text at 150 dots per inch (DPI).
635
+
636
+
* For custom model training, the maximum number of pages for training data is 500 for the custom template model and 50,000 for the custom neural model.
637
+
638
+
* For custom extraction model training, the total size of training data is 50 MB for template model and `1` GB for the neural model.
639
+
640
+
* For custom classification model training, the total size of training data is `1` GB with a maximum of 10,000 pages. For 2024-11-30 (GA), the total size of training data is `2` GB with a maximum of 10,000 pages.
0 commit comments