Skip to content

Commit 6124250

Browse files
Merge pull request #216492 from sanjeev3/main
[Cog Svcs] OCR SEO related edits
2 parents 943466b + d262977 commit 6124250

File tree

3 files changed

+23
-23
lines changed

3 files changed

+23
-23
lines changed

articles/applied-ai-services/form-recognizer/concept-read.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -26,11 +26,11 @@ recommendations: false
2626

2727
Optical Character Recognition (OCR) for documents is optimized for large text-heavy documents in multiple file formats and global languages. It should include features like higher-resolution scanning of document images for better handling of smaller and dense text, paragraphs detection, handling fillable forms, and advanced forms and document scenarios like single character boxes and accurate extraction of key fields commonly found in invoices, receipts, and other prebuilt scenarios.
2828

29-
## Form Recognizer Read model
29+
## OCR in Form Recognizer - Read model
3030

3131
Form Recognizer v3.0’s Read Optical Character Recognition (OCR) model runs at a higher resolution than Computer Vision Read and extracts print and handwritten text from PDF documents and scanned images. It also includes preview support for extracting text from Microsoft Word, Excel, PowerPoint, and HTML documents. It detects paragraphs, text lines, words, locations, and languages, and is the underlying OCR engine for other Form Recognizer models like Layout, General Document, Invoice, Receipt, Identity (ID) document, and other prebuilt models, as well as custom models.
3232

33-
## Supported document types
33+
## OCR supported document types
3434

3535
> [!NOTE]
3636
>
@@ -91,7 +91,7 @@ Try extracting text from forms and documents using the Form Recognizer Studio. Y
9191

9292
## Supported languages and locales
9393

94-
Form Recognizer v3.0 version supports several languages for the read model. *See* our [Language Support](language-support.md) for a complete list of supported handwritten and printed languages.
94+
Form Recognizer v3.0 version supports several languages for the read OCR model. *See* our [Language Support](language-support.md) for a complete list of supported handwritten and printed languages.
9595

9696
## Data detection and extraction
9797

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: "Read OCR editions"
2+
title: "OCR (Read) editions"
33
titleSuffix: "Azure Cognitive Services"
44
services: cognitive-services
55
author: PatrickFarley
@@ -12,16 +12,16 @@ ms.date: 09/23/2022
1212
ms.author: pafarley
1313
---
1414

15-
## Read OCR editions
15+
## OCR (Read) editions
1616

1717
> [!IMPORTANT]
18-
> Select the Read model and quickstart that best fit your requirements.
18+
> Select the Read edition that best fits your requirements.
1919
>
2020
> | Input | Examples | Read edition | Benefit |
2121
> |----------|--------------|-------------------------|-------------------------|
22-
> | Images: General, in-the-wild images | labels, street signs, and posters | [Computer Vision v4.0 preview](../concept-ocr.md) | Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in your user experience scenarios.
23-
> | Documents: Digital and scanned, including images | books, articles, and reports | [Form Recognizer](../../../applied-ai-services/form-recognizer/concept-read.md) | Optimized for text-heavy scanned and digital documents with an asynchronous API to help automate intelligent document processing at scale.
22+
> | **Images**: General, in-the-wild images | labels, street signs, and posters | [Computer Vision v4.0 preview](../concept-ocr.md) | Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in your user experience scenarios.
23+
> | **Documents**: Digital and scanned, including images | books, articles, and reports | [Form Recognizer](../../../applied-ai-services/form-recognizer/concept-read.md) | Optimized for text-heavy scanned and digital documents with an asynchronous API to help automate intelligent document processing at scale.
2424
>
25-
> **Computer Vision v3.2 GA Read**
25+
> **About Computer Vision v3.2 GA Read**
2626
>
27-
> Follow the Computer Vision 3.2 GA Read [overview](../how-to/call-read-api.md) and [quickstart](../quickstarts-sdk/client-library.md), but note that all future Read OCR enhancements for image and document scenarios will be part of the two new services listed above. There will be no further updates to the Computer Vision 3.2 Read version.
27+
> Looking for the most recent Computer Vision v3.2 GA Read? Note that all future Read OCR enhancements will be part of the two new services listed above. There will be no further updates to the Computer Vision v3.2. To continue, see the Computer Vision v3.2 GA Read [overview](../how-to/call-read-api.md) and [quickstart](../quickstarts-sdk/client-library.md).
Lines changed: 13 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: What is Optical Character Recognition (OCR)?
2+
title: OCR - Optical Character Recognition
33
titleSuffix: Azure Cognitive Services
4-
description: The optical character recognition (OCR) service extracts print and handwritten text from images.
4+
description: Learn how the optical character recognition (OCR) services extract print and handwritten text from images and documents in global languages.
55
services: cognitive-services
66
author: PatrickFarley
77
manager: nitinme
@@ -14,17 +14,20 @@ ms.author: pafarley
1414
ms.custom: seodec18, devx-track-csharp, ignite-2022
1515
---
1616

17-
# What is Optical Character Recognition (OCR)
17+
# OCR - Optical Character Recognition
1818

1919
OCR or Optical Character Recognition is also referred to as text recognition or text extraction. Machine-learning based OCR techniques allow you to extract printed or handwritten text from images, such as posters, street signs and product labels, as well as from documents like articles, reports, forms, and invoices. The text is typically extracted as words, text lines, and paragraphs or text blocks, enabling access to digital version of the scanned text. This eliminates or significantly reduces the need for manual data entry.
2020

2121
## How is OCR related to Intelligent Document Processing (IDP)?
2222

2323
Intelligent Document Processing (IDP) uses OCR as its foundational technology to additionally extract structure, relationships, key-values, entities, and other document-centric insights with an advanced machine-learning based AI service like [Form Recognizer](../../applied-ai-services/form-recognizer/overview.md). Form Recognizer includes a document-optimized version of **Read** as its OCR engine while delegating to other models for higher-end insights. If you are extracting text from scanned and digital documents, use [Form Recognizer Read OCR](../../applied-ai-services/form-recognizer/concept-read.md).
2424

25-
## Read OCR engine
25+
## OCR engine
2626
Microsoft's **Read** OCR engine is composed of multiple advanced machine-learning based models supporting [global languages](./language-support.md). This allows them to extract printed and handwritten text including mixed languages and writing styles. **Read** is available as cloud service and on-premises container for deployment flexibility. With the latest preview, it's also available as a synchronous API for single, non-document, image-only scenarios with performance enhancements that make it easier to implement OCR-assisted user experiences.
2727

28+
> [!WARNING]
29+
> The Computer Vision legacy [ocr](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f20d) and [RecognizeText](https://westus.dev.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/587f2c6a1540550560080311) operations are no longer supported and should not be used.
30+
2831
[!INCLUDE [read-editions](includes/read-editions.md)]
2932

3033
## How to use OCR
@@ -36,13 +39,13 @@ Try out OCR by using Vision Studio. Then follow one of the links to the Read edi
3639
3740
:::image type="content" source="Images/vision-studio-ocr-demo.png" alt-text="Screenshot: Read OCR demo in Vision Studio.":::
3841

39-
## Supported languages
42+
## OCR supported languages
4043

4144
Both **Read** versions available today in Computer Vision support several languages for printed and handwritten text. OCR for printed text includes support for English, French, German, Italian, Portuguese, Spanish, Chinese, Japanese, Korean, Russian, Arabic, Hindi, and other international languages that use Latin, Cyrillic, Arabic, and Devanagari scripts. OCR for handwritten text includes support for English, Chinese Simplified, French, German, Italian, Japanese, Korean, Portuguese, and Spanish languages.
4245

4346
Refer to the full list of [OCR-supported languages](./language-support.md#optical-character-recognition-ocr).
4447

45-
## Read OCR common features
48+
## OCR common features
4649

4750
The Read OCR model is available in Computer Vision and Form Recognizer with common baseline capabilities while optimizing for respective scenarios. The following list summarizes the common features:
4851

@@ -51,21 +54,18 @@ The Read OCR model is available in Computer Vision and Form Recognizer with comm
5154
* Support for mixed languages, mixed mode (print and handwritten)
5255
* Available as Distroless Docker container for on-premises deployment
5356

54-
## Use the cloud APIs or deploy on-premises
57+
## Use the OCR cloud APIs or deploy on-premises
5558

5659
The cloud APIs are the preferred option for most customers because of their ease of integration and fast productivity out of the box. Azure and the Computer Vision service handle scale, performance, data security, and compliance needs while you focus on meeting your customers' needs.
5760

5861
For on-premises deployment, the [Read Docker container (preview)](./computer-vision-how-to-install-containers.md) enables you to deploy the Computer Vision v3.2 generally available OCR capabilities in your own local environment. Containers are great for specific security and data governance requirements.
5962

60-
> [!WARNING]
61-
> The Computer Vision [ocr](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f20d) and [RecognizeText](https://westus.dev.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/587f2c6a1540550560080311) operations are no longer supported and should not be used.
62-
63-
## Data privacy and security
63+
## OCR data privacy and security
6464

6565
As with all of the Cognitive Services, developers using the Computer Vision service should be aware of Microsoft's policies on customer data. See the [Cognitive Services page](https://www.microsoft.com/trustcenter/cloudservices/cognitiveservices) on the Microsoft Trust Center to learn more.
6666

6767
## Next steps
6868

69-
- For general (non-document) images, try the [Computer Vision 4.0 preview Image Analysis REST API quickstart](./concept-ocr.md).
70-
- For PDF, Office and HTML documents and document images, start with [Form Recognizer Read](../../applied-ai-services/form-recognizer/concept-read.md).
69+
- OCR for general (non-document) images - try the [Computer Vision 4.0 preview Image Analysis REST API quickstart](./concept-ocr.md).
70+
- OCR for PDF, Office and HTML documents and document images, start with [Form Recognizer Read](../../applied-ai-services/form-recognizer/concept-read.md).
7171
- Looking for the previous GA version? Refer to the [Computer Vision 3.2 GA SDK or REST API quickstarts](./quickstarts-sdk/client-library.md).

0 commit comments

Comments
 (0)