You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/applied-ai-services/form-recognizer/concept-read.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,11 +26,11 @@ recommendations: false
26
26
27
27
Optical Character Recognition (OCR) for documents is optimized for large text-heavy documents in multiple file formats and global languages. It should include features like higher-resolution scanning of document images for better handling of smaller and dense text, paragraphs detection, handling fillable forms, and advanced forms and document scenarios like single character boxes and accurate extraction of key fields commonly found in invoices, receipts, and other prebuilt scenarios.
28
28
29
-
## Form Recognizer Read model
29
+
## OCR in Form Recognizer - Read model
30
30
31
31
Form Recognizer v3.0’s Read Optical Character Recognition (OCR) model runs at a higher resolution than Computer Vision Read and extracts print and handwritten text from PDF documents and scanned images. It also includes preview support for extracting text from Microsoft Word, Excel, PowerPoint, and HTML documents. It detects paragraphs, text lines, words, locations, and languages, and is the underlying OCR engine for other Form Recognizer models like Layout, General Document, Invoice, Receipt, Identity (ID) document, and other prebuilt models, as well as custom models.
32
32
33
-
## Supported document types
33
+
## OCR supported document types
34
34
35
35
> [!NOTE]
36
36
>
@@ -91,7 +91,7 @@ Try extracting text from forms and documents using the Form Recognizer Studio. Y
91
91
92
92
## Supported languages and locales
93
93
94
-
Form Recognizer v3.0 version supports several languages for the read model. *See* our [Language Support](language-support.md) for a complete list of supported handwritten and printed languages.
94
+
Form Recognizer v3.0 version supports several languages for the read OCR model. *See* our [Language Support](language-support.md) for a complete list of supported handwritten and printed languages.
> | Images: General, in-the-wild images | labels, street signs, and posters | [Computer Vision v4.0 preview](../concept-ocr.md) | Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in your user experience scenarios.
23
-
> | Documents: Digital and scanned, including images | books, articles, and reports | [Form Recognizer](../../../applied-ai-services/form-recognizer/concept-read.md) | Optimized for text-heavy scanned and digital documents with an asynchronous API to help automate intelligent document processing at scale.
22
+
> | **Images**: General, in-the-wild images | labels, street signs, and posters | [Computer Vision v4.0 preview](../concept-ocr.md) | Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in your user experience scenarios.
23
+
> | **Documents**: Digital and scanned, including images | books, articles, and reports | [Form Recognizer](../../../applied-ai-services/form-recognizer/concept-read.md) | Optimized for text-heavy scanned and digital documents with an asynchronous API to help automate intelligent document processing at scale.
24
24
>
25
-
> **Computer Vision v3.2 GA Read**
25
+
> **About Computer Vision v3.2 GA Read**
26
26
>
27
-
> Follow the Computer Vision 3.2 GA Read[overview](../how-to/call-read-api.md) and [quickstart](../quickstarts-sdk/client-library.md), but note that all future Read OCR enhancements for image and document scenarios will be part of the two new services listed above. There will be no further updates to the Computer Vision 3.2 Read version.
27
+
> Looking for the most recent Computer Vision v3.2 GA Read? Note that all future Read OCR enhancements will be part of the two new services listed above. There will be no further updates to the Computer Vision v3.2. To continue, see the Computer Vision v3.2 GA Read [overview](../how-to/call-read-api.md) and [quickstart](../quickstarts-sdk/client-library.md).
title: What is Optical Character Recognition (OCR)?
2
+
title: OCR - Optical Character Recognition
3
3
titleSuffix: Azure Cognitive Services
4
-
description: The optical character recognition (OCR) service extracts print and handwritten text from images.
4
+
description: Learn how the optical character recognition (OCR) services extract print and handwritten text from images and documents in global languages.
OCR or Optical Character Recognition is also referred to as text recognition or text extraction. Machine-learning based OCR techniques allow you to extract printed or handwritten text from images, such as posters, street signs and product labels, as well as from documents like articles, reports, forms, and invoices. The text is typically extracted as words, text lines, and paragraphs or text blocks, enabling access to digital version of the scanned text. This eliminates or significantly reduces the need for manual data entry.
20
20
21
21
## How is OCR related to Intelligent Document Processing (IDP)?
22
22
23
23
Intelligent Document Processing (IDP) uses OCR as its foundational technology to additionally extract structure, relationships, key-values, entities, and other document-centric insights with an advanced machine-learning based AI service like [Form Recognizer](../../applied-ai-services/form-recognizer/overview.md). Form Recognizer includes a document-optimized version of **Read** as its OCR engine while delegating to other models for higher-end insights. If you are extracting text from scanned and digital documents, use [Form Recognizer Read OCR](../../applied-ai-services/form-recognizer/concept-read.md).
24
24
25
-
## Read OCR engine
25
+
## OCR engine
26
26
Microsoft's **Read** OCR engine is composed of multiple advanced machine-learning based models supporting [global languages](./language-support.md). This allows them to extract printed and handwritten text including mixed languages and writing styles. **Read** is available as cloud service and on-premises container for deployment flexibility. With the latest preview, it's also available as a synchronous API for single, non-document, image-only scenarios with performance enhancements that make it easier to implement OCR-assisted user experiences.
27
27
28
+
> [!WARNING]
29
+
> The Computer Vision legacy [ocr](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f20d) and [RecognizeText](https://westus.dev.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/587f2c6a1540550560080311) operations are no longer supported and should not be used.
@@ -36,13 +39,13 @@ Try out OCR by using Vision Studio. Then follow one of the links to the Read edi
36
39
37
40
:::image type="content" source="Images/vision-studio-ocr-demo.png" alt-text="Screenshot: Read OCR demo in Vision Studio.":::
38
41
39
-
## Supported languages
42
+
## OCR supported languages
40
43
41
44
Both **Read** versions available today in Computer Vision support several languages for printed and handwritten text. OCR for printed text includes support for English, French, German, Italian, Portuguese, Spanish, Chinese, Japanese, Korean, Russian, Arabic, Hindi, and other international languages that use Latin, Cyrillic, Arabic, and Devanagari scripts. OCR for handwritten text includes support for English, Chinese Simplified, French, German, Italian, Japanese, Korean, Portuguese, and Spanish languages.
42
45
43
46
Refer to the full list of [OCR-supported languages](./language-support.md#optical-character-recognition-ocr).
44
47
45
-
## Read OCR common features
48
+
## OCR common features
46
49
47
50
The Read OCR model is available in Computer Vision and Form Recognizer with common baseline capabilities while optimizing for respective scenarios. The following list summarizes the common features:
48
51
@@ -51,21 +54,18 @@ The Read OCR model is available in Computer Vision and Form Recognizer with comm
51
54
* Support for mixed languages, mixed mode (print and handwritten)
52
55
* Available as Distroless Docker container for on-premises deployment
53
56
54
-
## Use the cloud APIs or deploy on-premises
57
+
## Use the OCR cloud APIs or deploy on-premises
55
58
56
59
The cloud APIs are the preferred option for most customers because of their ease of integration and fast productivity out of the box. Azure and the Computer Vision service handle scale, performance, data security, and compliance needs while you focus on meeting your customers' needs.
57
60
58
61
For on-premises deployment, the [Read Docker container (preview)](./computer-vision-how-to-install-containers.md) enables you to deploy the Computer Vision v3.2 generally available OCR capabilities in your own local environment. Containers are great for specific security and data governance requirements.
59
62
60
-
> [!WARNING]
61
-
> The Computer Vision [ocr](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f20d) and [RecognizeText](https://westus.dev.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/587f2c6a1540550560080311) operations are no longer supported and should not be used.
62
-
63
-
## Data privacy and security
63
+
## OCR data privacy and security
64
64
65
65
As with all of the Cognitive Services, developers using the Computer Vision service should be aware of Microsoft's policies on customer data. See the [Cognitive Services page](https://www.microsoft.com/trustcenter/cloudservices/cognitiveservices) on the Microsoft Trust Center to learn more.
66
66
67
67
## Next steps
68
68
69
-
-For general (non-document) images, try the [Computer Vision 4.0 preview Image Analysis REST API quickstart](./concept-ocr.md).
70
-
-For PDF, Office and HTML documents and document images, start with [Form Recognizer Read](../../applied-ai-services/form-recognizer/concept-read.md).
69
+
-OCR for general (non-document) images - try the [Computer Vision 4.0 preview Image Analysis REST API quickstart](./concept-ocr.md).
70
+
-OCR for PDF, Office and HTML documents and document images, start with [Form Recognizer Read](../../applied-ai-services/form-recognizer/concept-read.md).
71
71
- Looking for the previous GA version? Refer to the [Computer Vision 3.2 GA SDK or REST API quickstarts](./quickstarts-sdk/client-library.md).
0 commit comments