Skip to content

Commit fc51436

Browse files
authored
Merge pull request #196800 from sanjeev3/main
Computer Vision Read 3.2 Model GA release
2 parents 6f11b6c + 3367e9c commit fc51436

File tree

13 files changed

+172
-182
lines changed

13 files changed

+172
-182
lines changed

articles/applied-ai-services/form-recognizer/language-support.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ ms.author: lajanuar
1313

1414
# Language support for Form Recognizer
1515

16-
This table lists the written languages supported by each Form Recognizer service.
16+
This article covers the supported languages for text and field **extraction (by feature)** and **[detection (Read only)](#detected-languages-read-api)**. Both groups are mutually exclusive.
1717

1818
<!-- markdownlint-disable MD001 -->
1919
<!-- markdownlint-disable MD024 -->
@@ -29,7 +29,7 @@ The following lists include the currently GA languages in for the v2.1 version a
2929
3030
To use the preview languages, refer to the [v3.0 REST API migration guide](/rest/api/media/#changes-to-the-rest-api-endpoints) to understand the differences from the v2.1 GA API and explore the [v3.0 preview SDK quickstarts](quickstarts/try-v3-python-sdk.md).
3131

32-
### Handwritten languages (preview and GA)
32+
### Handwritten text (preview and GA)
3333

3434
The following table lists the supported languages for extracting handwritten texts.
3535

@@ -41,7 +41,7 @@ The following table lists the supported languages for extracting handwritten tex
4141
|German (preview) |`de`|Spanish (preview) |`es`|
4242
|Italian (preview) |`it`|
4343

44-
### Print languages (preview)
44+
### Print text (preview)
4545

4646
This section lists the supported languages for extracting printed texts in the latest preview.
4747

@@ -94,7 +94,7 @@ This section lists the supported languages for extracting printed texts in the l
9494
|Kurukh (Devanagari) | `kru`|Welsh | `cy`
9595
|Kyrgyz (Cyrillic) | `ky`
9696

97-
### Print languages (GA)
97+
### Print text (GA)
9898

9999
This section lists the supported languages for extracting printed texts in the latest GA version.
100100

@@ -178,14 +178,14 @@ Language| Locale code |
178178

179179
## Detected languages: Read API
180180

181-
The [Read API](concept-read.md) supports language detection for the following languages:
181+
The [Read API](concept-read.md) supports detecting the following languages in your documents. This list may include languages not currently supported for text extraction.
182182

183183
> [!NOTE]
184184
> **Language detection**
185185
>
186-
> Form Recognizer read model can _detect_ a wide range of languages, variants, dialects, and some regional/cultural languages and return a language code.
187-
>
188-
> This section lists the languages that can be detected using the Read API. To determine if text can also be _extracted_ for a given language, see [handwritten](#handwritten-languages-preview-and-ga), [print preview](#print-languages-preview), and [print GA](#print-languages-ga) language extraction lists (above).
186+
> Form Recognizer read model can _detect_ possible presence of languages and returns language codes for detected languages. To determine if text can also be
187+
> extracted for a given language, see previous sections.
188+
189189

190190
| Language | Code |
191191
|---------------------|---------------|

articles/cognitive-services/Computer-vision/Vision-API-How-to-Topics/call-read-api.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -23,17 +23,18 @@ This guide assumes you have already <a href="https://portal.azure.com/#create/Mi
2323

2424
### Specify the OCR model
2525

26-
By default, the service will use the latest GA model to extract text. Starting with Read 3.2, a `model-version` parameter allows choosing between the GA and preview models for a given API version. The model you specify will be used to extract text with the Read operation.
26+
By default, the service will use the latest generally available (GA) model to extract text. Starting with Read 3.2, a `model-version` parameter allows choosing between the GA and preview models for a given API version. The model you specify will be used to extract text with the Read operation.
2727

2828
When using the Read operation, use the following values for the optional `model-version` parameter.
2929

3030
|Value| Model used |
3131
|:-----|:----|
32-
| 2022-01-30-preview | Latest preview model with additonal Hindi, Arabic and other Devanagari and Arabic script languages and enhancements to the previous preview.
33-
| 2021-09-30-preview | Previous preview model with addiitonal Russian and Cyrillic languages and enhancements to the GA previous model.
34-
| 2021-04-12 | most recent GA model |
35-
| Not provided | most recent GA model |
36-
| latest | most recent GA model|
32+
| Not provided | Latest GA model |
33+
| latest | Latest GA model|
34+
| [2022-04-30](../whats-new.md#may-2022) | Latest GA model. 164 languages for print text and 9 languages for handwritten text along with several enhancements on quality and performance |
35+
| [2022-01-30-preview](../whats-new.md#february-2022) | Preview model adds print text support for Hindi, Arabic and related languages. For handwriitten text, adds support for Japanese and Korean. |
36+
| [2021-09-30-preview](../whats-new.md#september-2021) | Preview model adds print text support for Russian and other Cyrillic languages, For handwriitten text, adds support for Chinese Simplified, French, German, Italian, Portuguese, and Spanish. |
37+
| 2021-04-12 | 2021 GA model |
3738

3839
### Input language
3940

articles/cognitive-services/Computer-vision/computer-vision-how-to-install-containers.md

Lines changed: 16 additions & 35 deletions
Original file line numberDiff line numberDiff line change
@@ -23,14 +23,14 @@ Containers enable you to run the Computer Vision APIs in your own environment. C
2323
The *Read* OCR container allows you to extract printed and handwritten text from images and documents with support for JPEG, PNG, BMP, PDF, and TIFF file formats. For more information, see the [Read API how-to guide](Vision-API-How-to-Topics/call-read-api.md).
2424

2525
## What's new
26-
For existing users of the Read containers, a new `3.2-model-2021-09-30-preview` version of the Read container is available with support for 122 languages and general performance and AI enhancements. Please follow the [download instructions](#docker-pull-for-the-read-ocr-container) to get started.
26+
The `3.2-model-2022-04-30` GA version of the Read container is available with support for [164 languages and other enhancements](./whats-new.md#may-2022). If you are an existing customer, please follow the [download instructions](#docker-pull-for-the-read-ocr-container) to get started.
2727

2828
## Read 3.2 container
2929

30-
The Read 3.2 OCR container provides:
30+
The Read 3.2 OCR container latest GA model provides:
3131
* New models for enhanced accuracy.
3232
* Support for multiple languages within the same document.
33-
* Support for a total of 73 languages. See the full list of [OCR-supported languages](./language-support.md#optical-character-recognition-ocr).
33+
* Support for a total of 164 languages. See the full list of [OCR-supported languages](./language-support.md#optical-character-recognition-ocr).
3434
* A single operation for both documents and images.
3535
* Support for larger documents and images.
3636
* Confidence scores.
@@ -84,29 +84,22 @@ grep -q avx2 /proc/cpuinfo && echo AVX2 supported || echo No AVX2 support detect
8484

8585
Container images for Read are available.
8686

87-
| Container | Container Registry / Repository / Image Name |
88-
|-----------|------------|
89-
| Read 3.2 model-2021-09-30-preview | `mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2021-09-30-preview` |
90-
| Read 3.2 | `mcr.microsoft.com/azure-cognitive-services/vision/read:3.2` |
91-
| Read 2.0-preview | `mcr.microsoft.com/azure-cognitive-services/vision/read:2.0-preview` |
87+
| Container | Container Registry / Repository / Image Name | Tags |
88+
|-----------|------------|-----------------------------------------|
89+
| Read 3.2 GA | `mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2022-04-30` | latest, 3.2, 3.2-model-2022-04-30 |
90+
| Read 2.0-preview | `mcr.microsoft.com/azure-cognitive-services/vision/read:2.0-preview` |2.0.019300020-amd64-preview |
9291

9392
Use the [`docker pull`](https://docs.docker.com/engine/reference/commandline/pull/) command to download a container image.
9493

9594
### Docker pull for the Read OCR container
9695

97-
For the latest preview:
96+
# [Version 3.2 GA](#tab/version-3-2)
9897

9998
```bash
100-
docker pull mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2021-09-30-preview
99+
docker pull mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2022-04-30
101100
```
102101

103-
# [Version 3.2](#tab/version-3-2)
104-
105-
```bash
106-
docker pull mcr.microsoft.com/azure-cognitive-services/vision/read:3.2
107-
```
108-
109-
# [Version 2.0-preview](#tab/version-2)
102+
# [Version 2.0 preview](#tab/version-2)
110103

111104
```bash
112105
docker pull mcr.microsoft.com/azure-cognitive-services/vision/read:2.0-preview
@@ -129,37 +122,31 @@ Use the [docker run](https://docs.docker.com/engine/reference/commandline/run/)
129122

130123
[Examples](computer-vision-resource-container-config.md#example-docker-run-commands) of the `docker run` command are available.
131124

132-
For the latest preview, replace 3.2 path with:
133-
134-
```
135-
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2021-09-30-preview
136-
```
137-
138125
# [Version 3.2](#tab/version-3-2)
139126

140127
```bash
141-
docker run --rm -it -p 5000:5000 --memory 18g --cpus 8 \
142-
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2 \
128+
docker run --rm -it -p 5000:5000 --memory 16g --cpus 8 \
129+
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2022-04-30 \
143130
Eula=accept \
144131
Billing={ENDPOINT_URI} \
145132
ApiKey={API_KEY}
146133
```
147134

148135
This command:
149136

150-
* Runs the Read OCR container from the container image.
151-
* Allocates 8 CPU core and 18 gigabytes (GB) of memory.
137+
* Runs the Read OCR latest GA container from the container image.
138+
* Allocates 8 CPU core and 16 gigabytes (GB) of memory.
152139
* Exposes TCP port 5000 and allocates a pseudo-TTY for the container.
153140
* Automatically removes the container after it exits. The container image is still available on the host computer.
154141

155142
You can alternatively run the container using environment variables:
156143

157144
```bash
158-
docker run --rm -it -p 5000:5000 --memory 18g --cpus 8 \
145+
docker run --rm -it -p 5000:5000 --memory 16g --cpus 8 \
159146
--env Eula=accept \
160147
--env Billing={ENDPOINT_URI} \
161148
--env ApiKey={API_KEY} \
162-
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2
149+
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2022-04-30
163150
```
164151

165152
# [Version 2.0-preview](#tab/version-2)
@@ -216,18 +203,12 @@ To find your connection string:
216203

217204
The container provides REST-based query prediction endpoint APIs.
218205

219-
For the latest preview:
220-
221-
Use the same Swagger path as 3.2 but a different port if you have already deployed 3.2 at the 5000 port.
222-
223206
Use the host, `http://localhost:5000`, for container APIs. You can view the Swagger path at: `http://localhost:5000/swagger/`.
224207

225208
---
226209

227210
### Asynchronous Read
228211

229-
For the latest preview, everything is the same as 3.2 except for the additional `"modelVersion": "2021-09-30-preview"`.
230-
231212
# [Version 3.2](#tab/version-3-2)
232213

233214
You can use the `POST /vision/v3.2/read/analyze` and `GET /vision/v3.2/read/operations/{operationId}` operations in concert to asynchronously read an image, similar to how the Computer Vision service uses those corresponding REST operations. The asynchronous POST method will return an `operationId` that is used as the identifer to the HTTP GET request.

articles/cognitive-services/Computer-vision/computer-vision-resource-container-config.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -126,8 +126,8 @@ The following Docker examples are for the Read OCR container.
126126
### Basic example
127127

128128
```bash
129-
docker run --rm -it -p 5000:5000 --memory 18g --cpus 8 \
130-
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2 \
129+
docker run --rm -it -p 5000:5000 --memory 16g --cpus 8 \
130+
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2022-04-30 \
131131
Eula=accept \
132132
Billing={ENDPOINT_URI} \
133133
ApiKey={API_KEY}
@@ -137,8 +137,8 @@ ApiKey={API_KEY}
137137
### Logging example
138138

139139
```bash
140-
docker run --rm -it -p 5000:5000 --memory 18g --cpus 8 \
141-
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2 \
140+
docker run --rm -it -p 5000:5000 --memory 16g --cpus 8 \
141+
mcr.microsoft.com/azure-cognitive-services/vision/read:3.2-model-2022-04-30 \
142142
Eula=accept \
143143
Billing={ENDPOINT_URI} \
144144
ApiKey={API_KEY}
@@ -150,7 +150,7 @@ Logging:Console:LogLevel:Default=Information
150150
### Basic example
151151

152152
```bash
153-
docker run --rm -it -p 5000:5000 --memory 18g --cpus 8 \
153+
docker run --rm -it -p 5000:5000 --memory 16g --cpus 8 \
154154
mcr.microsoft.com/azure-cognitive-services/vision/read:2.0-preview \
155155
Eula=accept \
156156
Billing={ENDPOINT_URI} \
@@ -161,7 +161,7 @@ ApiKey={API_KEY}
161161
### Logging example
162162

163163
```bash
164-
docker run --rm -it -p 5000:5000 --memory 18g --cpus 8 \
164+
docker run --rm -it -p 5000:5000 --memory 16g --cpus 8 \
165165
mcr.microsoft.com/azure-cognitive-services/vision/read:2.0-preview \
166166
Eula=accept \
167167
Billing={ENDPOINT_URI} \

articles/cognitive-services/Computer-vision/includes/container-requirements-and-recommendations.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,9 @@ The following table describes the minimum and recommended allocation of resource
1717

1818
| Container | Minimum | Recommended |
1919
|-----------|---------|-------------|
20-
| Read 2.0-preview | 1 core, 8-GB memory | 8 cores, 16-GB memory |
21-
| Read 3.2 | 4 cores, 16-GB memory | 8 cores, 24-GB memory |
20+
| Read 3.2 2022-04-30 | 4 cores, 8-GB memory | 8 cores, 16-GB memory |
21+
| Read 3.2 2021-04-12 | 4 cores, 16-GB memory | 8 cores, 24-GB memory |
22+
| Read 2.0-preview | 1 core, 8-GB memory | 8 cores, 16-GB memory |
2223

2324
* Each core must be at least 2.6 gigahertz (GHz) or faster.
2425

articles/cognitive-services/Computer-vision/includes/curl-quickstart.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -53,11 +53,9 @@ curl -v -X POST "https://westcentralus.api.cognitive.microsoft.com/vision/v3.2/r
5353
The response will include an `Operation-Location` header, whose value is a unique URL. You use this URL to query the results of the Read operation. The URL expires in 48 hours.
5454

5555
### Optionally, specify the model version
56-
As an optional step, see [How to specify the model version](../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) for the model version parameter values you can use. The most recent model includes any enhancements to the previous GA and preview models.
57-
58-
For example, to use the model-version = `2022-01-30-preview` parameter, use the following command:
56+
As an optional step, see [How to specify the model version](../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional). For example, to explicitly specify the latest GA model, use `model-version=2022-04-30` as the parameter. Skipping the parameter or using `model-version=latest` automatically uses the most recent GA model.
5957
```bash
60-
curl -v -X POST "https://westcentralus.api.cognitive.microsoft.com/vision/v3.2/read/analyze?model-version=2022-01-30-preview" -H "Content-Type: application/json" -H "Ocp-Apim-Subscription-Key: <subscription key>" --data-ascii "{\"url\":\"https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png\"}"
58+
curl -v -X POST "https://westcentralus.api.cognitive.microsoft.com/vision/v3.2/read/analyze?model-version=2022-04-30" -H "Content-Type: application/json" -H "Ocp-Apim-Subscription-Key: <subscription key>" --data-ascii "{\"url\":\"https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png\"}"
6159
```
6260

6361
### Get Read results

articles/cognitive-services/Computer-vision/includes/quickstarts-sdk/csharp-sdk.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -88,11 +88,11 @@ Use the OCR client library to read printed and handwritten text from a remote im
8888
> [!IMPORTANT]
8989
> Remember to remove the subscription key from your code when you're done, and never post it publicly. For production, consider using a secure way of storing and accessing your credentials. For example, [Azure key vault](../../../../key-vault/general/overview.md).
9090

91-
1. As an optional step, see [How to specify the model version](../../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) for the model version parameter values you can use. The most recent model includes any enhancements to the previous GA and preview models. For example, to use the model-version=`2022-01-30-preview` parameter, edit the ReadAsync call as shown:
91+
1. As an optional step, see [How to specify the model version](../../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional). For example, to explicitly specify the latest GA model, edit the `ReadAsync` call as shown. Skipping the parameter or using `"latest"` automatically uses the most recent GA model.
9292

9393
```csharp
9494
// Read text from URL with a specific model version
95-
var textHeaders = await client.ReadAsync(urlFile,null,null,"2022-01-30-preview");
95+
var textHeaders = await client.ReadAsync(urlFile,null,null,"2022-04-30");
9696
```
9797

9898
1. Run the application.

articles/cognitive-services/Computer-vision/includes/quickstarts-sdk/node-sdk.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -78,11 +78,10 @@ Use the Optical character recognition client library to read printed and handwri
7878
> [!IMPORTANT]
7979
> Remember to remove the subscription key from your code when you're done, and never post it publicly. For production, consider using a secure way of storing and accessing your credentials. For example, [Azure key vault](../../../../key-vault/general/overview.md).
8080

81-
82-
1. As an optional step, see [How to specify the model version](../../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) for the model version parameter values you can use. The most recent model includes any enhancements to the previous GA and preview models. For example, to use the model version `2022-01-30-preview`, edit the `read` statement as shown:
81+
1. As an optional step, see [How to specify the model version](../../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional). For example, to explicitly specify the latest GA model, edit the `read` statement as shown. Skipping the parameter or using `"latest"` automatically uses the most recent GA model.
8382

8483
```JS
85-
let result = await client.read(url,{modelVersion:"2022-01-30-preview"});
84+
let result = await client.read(url,{modelVersion:"2022-04-30"});
8685
```
8786

8887
1. Run the application with the `node` command on your quickstart file.

articles/cognitive-services/Computer-vision/includes/quickstarts-sdk/python-sdk.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -63,11 +63,11 @@ Use the OCR client library to read printed and handwritten text from a remote im
6363
> [!IMPORTANT]
6464
> Remember to remove the subscription key from your code when you're done, and never post it publicly. For production, consider using a secure way of storing and accessing your credentials. For example, [Azure key vault](../../../../key-vault/general/overview.md).
6565

66-
1. As an optional step, see [How to specify the model version](../../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) for the model version parameter values you can use. The most recent model includes any enhancements to the previous GA and preview models. For example, to use the model version `2022-01-30-preview`, edit the `read` statement as shown:
66+
1. As an optional step, see [How to specify the model version](../../Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional). For example, to explicitly specify the latest GA model, edit the `read` statement as shown. Skipping the parameter or using `"latest"` automatically uses the most recent GA model.
6767

6868
```python
6969
# Call API with URL and raw response (allows you to get the operation location)
70-
read_response = computervision_client.read(read_image_url, raw=True, model_version="2022-01-30-preview")
70+
read_response = computervision_client.read(read_image_url, raw=True, model_version="2022-04-30")
7171
```
7272

7373
1. Run the application with the `python` command on your quickstart file.

0 commit comments

Comments
 (0)