Skip to content

Commit 2fdccd7

Browse files
ahmadelmalahmdrxy
andauthored
docs: update Textract docs (#31992)
I am modifying two things: 1. "This sample demonstrates" with "The following samples demonstrate" as we're talking about at least 4 samples 2. Bringing the sentence to after talking about the definition of textract to keep the document organized (textract definition then samples) --------- Co-authored-by: Mason Daugherty <[email protected]>
1 parent 553ac18 commit 2fdccd7

File tree

1 file changed

+2
-15
lines changed

1 file changed

+2
-15
lines changed

docs/docs/integrations/document_loaders/amazon_textract.ipynb

Lines changed: 2 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -11,11 +11,9 @@
1111
">\n",
1212
">It goes beyond simple optical character recognition (OCR) to identify, understand, and extract data from forms and tables. Today, many companies manually extract data from scanned documents such as PDFs, images, tables, and forms, or through simple OCR software that requires manual configuration (which often must be updated when the form changes). To overcome these manual and expensive processes, `Textract` uses ML to read and process any type of document, accurately extracting text, handwriting, tables, and other data with no manual effort. \n",
1313
"\n",
14-
"This sample demonstrates the use of `Amazon Textract` in combination with LangChain as a DocumentLoader.\n",
14+
"`Textract` supports `JPEG`, `PNG`, `PDF`, and `TIFF` file formats; more information is available in [the documentation](https://docs.aws.amazon.com/textract/latest/dg/limits-document.html).\n",
1515
"\n",
16-
"`Textract` supports`PDF`, `TIFF`, `PNG` and `JPEG` format.\n",
17-
"\n",
18-
"`Textract` supports these [document sizes, languages and characters](https://docs.aws.amazon.com/textract/latest/dg/limits-document.html)."
16+
"The following samples demonstrate the use of `Amazon Textract` in combination with LangChain as a DocumentLoader."
1917
]
2018
},
2119
{
@@ -310,17 +308,6 @@
310308
"\n",
311309
"chain.run(input_documents=documents, question=query)"
312310
]
313-
},
314-
{
315-
"cell_type": "markdown",
316-
"id": "bd97f1c90aff6a83",
317-
"metadata": {
318-
"collapsed": false,
319-
"jupyter": {
320-
"outputs_hidden": false
321-
}
322-
},
323-
"source": []
324311
}
325312
],
326313
"metadata": {

0 commit comments

Comments
 (0)