Skip to content

Commit 1bc2b62

Browse files
authored
Update whats-new.md
1 parent db32664 commit 1bc2b62

File tree

1 file changed

+37
-14
lines changed

1 file changed

+37
-14
lines changed

articles/cognitive-services/Computer-vision/whats-new.md

Lines changed: 37 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -8,18 +8,37 @@ manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: computer-vision
1010
ms.topic: overview
11-
ms.date: 05/24/2021
11+
ms.date: 02/05/2022
1212
ms.author: pafarley
1313
---
1414

1515
# What's new in Computer Vision
1616

1717
Learn what's new in the service. These items may be release notes, videos, blog posts, and other types of information. Bookmark this page to stay up to date with the service.
1818

19+
## May 2022
20+
21+
### OCR (Read) API Public Preview supports 164 languages
22+
23+
Computer Vision's [OCR (Read) API](overview-ocr.md) expands [supported languages](language-support.md) to 164 with its latest preview:
24+
25+
* OCR support for print text expands to 42 new languages including Arabic, Hindi and other languages using Arabic and Devanagari scripts.
26+
* OCR support for handwritten text expands to Japanese and Korean in addition to English, Chinese Simplified, French, German, Italian, Portuguese, and Spanish.
27+
* Enhancements including better support for extracting handwritten dates, amounts, names, and single character boxes.
28+
* General performance and AI quality improvements
29+
30+
See the [OCR how-to guide](Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) to learn how to use the new preview features.
31+
32+
> [!div class="nextstepaction"]
33+
> [Get Started with the Read API](./quickstarts-sdk/client-library.md)
34+
35+
1936
## September 2021
2037

2138
### OCR (Read) API Public Preview supports 122 languages
39+
2240
Computer Vision's [OCR (Read) API](overview-ocr.md) expands [supported languages](language-support.md) to 122 with its latest preview:
41+
2342
* OCR support for print text in 49 new languages including Russian, Bulgarian, and other Cyrillic and more Latin languages.
2443
* OCR support for handwritten text in 6 new languages that include English, Chinese Simplified, French, German, Italian, Portuguese, and Spanish.
2544
* Enhancements for processing digital PDFs and Machine Readable Zone (MRZ) text in identity documents.
@@ -30,7 +49,6 @@ See the [OCR how-to guide](Vision-API-How-to-Topics/call-read-api.md#determine-h
3049
> [!div class="nextstepaction"]
3150
> [Get Started with the Read API](./quickstarts-sdk/client-library.md)
3251
33-
3452
## August 2021
3553

3654
### Image tagging language expansion
@@ -41,20 +59,20 @@ The [latest version (v3.2)](https://westus.dev.cognitive.microsoft.com/docs/serv
4159

4260
### Spatial Analysis container update
4361

44-
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
62+
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
4563

46-
* [Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect the orientation that a person is facing.
47-
* An orientation classifier can be enabled for the `personcrossingline` and `personcrossingpolygon` operations by configuring the `enable_orientation` parameter. It is set to off by default.
64+
* [Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect the orientation that a person is facing.
65+
* An orientation classifier can be enabled for the `personcrossingline` and `personcrossingpolygon` operations by configuring the `enable_orientation` parameter. It is set to off by default.
4866

4967
* [Spatial Analysis operations](spatial-analysis-operations.md) now also offers configuration to detect a person's speed while walking/running
50-
* Speed can be detected for the `personcrossingline` and `personcrossingpolygon` operations by turning on the `enable_speed` classifier, which is off by default. The output is reflected in the `speed`, `avgSpeed`, and `minSpeed` outputs.
51-
68+
* Speed can be detected for the `personcrossingline` and `personcrossingpolygon` operations by turning on the `enable_speed` classifier, which is off by default. The output is reflected in the `speed`, `avgSpeed`, and `minSpeed` outputs.
5269

5370
## April 2021
5471

5572
### Computer Vision v3.2 GA
5673

5774
The Computer Vision API v3.2 is now generally available with the following updates:
75+
5876
* Improved image tagging model: analyzes visual content and generates relevant tags based on objects, actions, and content displayed in the image. This model is available through the [Tag Image API](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f200). See the Image Analysis [how-to guide](./vision-api-how-to-topics/howtocallvisionapi.md) and [overview](./overview-image-analysis.md) to learn more.
5977
* Updated content moderation model: detects presence of adult content and provides flags to filter images containing adult, racy, and gory visual content. This model is available through the [Analyze API](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f21b). See the Image Analysis [how-to guide](./vision-api-how-to-topics/howtocallvisionapi.md) and [overview](./overview-image-analysis.md) to learn more.
6078
* [OCR (Read) available for 73 languages](./language-support.md#optical-character-recognition-ocr) including Simplified and Traditional Chinese, Japanese, Korean, and Latin languages.
@@ -75,7 +93,9 @@ The Computer Vision API v3.2 public preview has been updated. The preview releas
7593
## February 2021
7694

7795
### Read API v3.2 Public Preview with OCR support for 73 languages
96+
7897
The Computer Vision Read API v3.2 public preview, available as cloud service and Docker container, includes these updates:
98+
7999
* [OCR for 73 languages](./language-support.md#optical-character-recognition-ocr) including Simplified and Traditional Chinese, Japanese, Korean, and Latin languages.
80100
* Natural reading order for the text line output (Latin languages only)
81101
* Handwriting style classification for text lines along with a confidence score (Latin languages only).
@@ -87,24 +107,22 @@ See the [Read API how-to guide](Vision-API-How-to-Topics/call-read-api.md) to le
87107
> [!div class="nextstepaction"]
88108
> [Use the Read API v3.2 Public Preview](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/5d986960601faab4bf452005)
89109
90-
91110
## January 2021
92111

93112
### Spatial Analysis container update
94113

95-
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
114+
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
96115

97-
* [Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect if a person is wearing a protective face covering such as a mask.
98-
* A mask classifier can be enabled for the `personcount`, `personcrossingline` and `personcrossingpolygon` operations by configuring the `ENABLE_FACE_MASK_CLASSIFIER` parameter.
99-
* The attributes `face_mask` and `face_noMask` will be returned as metadata with confidence score for each person detected in the video stream
116+
* [Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect if a person is wearing a protective face covering such as a mask.
117+
* A mask classifier can be enabled for the `personcount`, `personcrossingline` and `personcrossingpolygon` operations by configuring the `ENABLE_FACE_MASK_CLASSIFIER` parameter.
118+
* The attributes `face_mask` and `face_noMask` will be returned as metadata with confidence score for each person detected in the video stream
100119
* The *personcrossingpolygon* operation has been extended to allow the calculation of the dwell time a person spends in a zone. You can set the `type` parameter in the Zone configuration for the operation to `zonedwelltime` and a new event of type *personZoneDwellTimeEvent* will include the `durationMs` field populated with the number of milliseconds that the person spent in the zone.
101120
* **Breaking change**: The *personZoneEvent* event has been renamed to *personZoneEnterExitEvent*. This event is raised by the *personcrossingpolygon* operation when a person enters or exits the zone and provides directional info with the numbered side of the zone that was crossed.
102121
* Video URL can be provided as "Private Parameter/obfuscated" in all operations. Obfuscation is optional now and it will only work if `KEY` and `IV` are provided as environment variables.
103122
* Calibration is enabled by default for all operations. Set the `do_calibration: false` to disable it.
104123
* Added support for auto recalibration (by default disabled) via the `enable_recalibration` parameter, please refer to [Spatial Analysis operations](./spatial-analysis-operations.md) for details
105124
* Camera calibration parameters to the `DETECTOR_NODE_CONFIG`. Refer to [Spatial Analysis operations](./spatial-analysis-operations.md) for details.
106125

107-
108126
## October 2020
109127

110128
### Computer Vision API v3.1 GA
@@ -115,10 +133,12 @@ The Computer Vision API in General Availability has been upgraded to v3.1.
115133

116134
### Spatial Analysis container preview
117135

118-
The [Spatial Analysis container](spatial-analysis-container.md) is now in preview. The Spatial Analysis feature of Computer Vision lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments. Spatial Analysis is a Docker container you can use on-premises.
136+
The [Spatial Analysis container](spatial-analysis-container.md) is now in preview. The Spatial Analysis feature of Computer Vision lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments. Spatial Analysis is a Docker container you can use on-premises.
119137

120138
### Read API v3.1 Public Preview adds OCR for Japanese
139+
121140
The Computer Vision Read API v3.1 public preview adds these capabilities:
141+
122142
* OCR for Japanese language
123143
* For each text line, indicate whether the appearance is Handwriting or Print style, along with a confidence score (Latin languages only).
124144
* For a multi-page document extract text only for selected pages or page range.
@@ -133,6 +153,7 @@ See the [Read API how-to guide](Vision-API-How-to-Topics/call-read-api.md) to le
133153
## July 2020
134154

135155
### Read API v3.1 Public Preview with OCR for Simplified Chinese
156+
136157
The Computer Vision Read API v3.1 public preview adds support for Simplified Chinese.
137158

138159
* This preview version of the Read API supports English, Dutch, French, German, Italian, Portuguese, Simplified Chinese, and Spanish languages.
@@ -143,6 +164,7 @@ See the [Read API how-to guide](Vision-API-How-to-Topics/call-read-api.md) to le
143164
> [Learn more about Read API v3.1 Public Preview 1](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-1-preview-1/operations/5d986960601faab4bf452005)
144165
145166
## May 2020
167+
146168
Computer Vision API v3.0 entered General Availability, with updates to the Read API:
147169

148170
* Support for English, Dutch, French, German, Italian, Portuguese, and Spanish
@@ -161,6 +183,7 @@ See the [OCR overview](overview-ocr.md) to learn more.
161183
### Read API 3.0 Public Preview
162184

163185
You now can use version 3.0 of the Read API to extract printed or handwritten text from images. Compared to earlier versions, 3.0 provides:
186+
164187
* Improved accuracy
165188
* New output format
166189
* Confidence score for each extracted word

0 commit comments

Comments
 (0)