You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Computer-vision/whats-new.md
+37-14Lines changed: 37 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,18 +8,37 @@ manager: nitinme
8
8
ms.service: cognitive-services
9
9
ms.subservice: computer-vision
10
10
ms.topic: overview
11
-
ms.date: 05/24/2021
11
+
ms.date: 02/05/2022
12
12
ms.author: pafarley
13
13
---
14
14
15
15
# What's new in Computer Vision
16
16
17
17
Learn what's new in the service. These items may be release notes, videos, blog posts, and other types of information. Bookmark this page to stay up to date with the service.
18
18
19
+
## May 2022
20
+
21
+
### OCR (Read) API Public Preview supports 164 languages
22
+
23
+
Computer Vision's [OCR (Read) API](overview-ocr.md) expands [supported languages](language-support.md) to 164 with its latest preview:
24
+
25
+
* OCR support for print text expands to 42 new languages including Arabic, Hindi and other languages using Arabic and Devanagari scripts.
26
+
* OCR support for handwritten text expands to Japanese and Korean in addition to English, Chinese Simplified, French, German, Italian, Portuguese, and Spanish.
27
+
* Enhancements including better support for extracting handwritten dates, amounts, names, and single character boxes.
28
+
* General performance and AI quality improvements
29
+
30
+
See the [OCR how-to guide](Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) to learn how to use the new preview features.
31
+
32
+
> [!div class="nextstepaction"]
33
+
> [Get Started with the Read API](./quickstarts-sdk/client-library.md)
34
+
35
+
19
36
## September 2021
20
37
21
38
### OCR (Read) API Public Preview supports 122 languages
39
+
22
40
Computer Vision's [OCR (Read) API](overview-ocr.md) expands [supported languages](language-support.md) to 122 with its latest preview:
41
+
23
42
* OCR support for print text in 49 new languages including Russian, Bulgarian, and other Cyrillic and more Latin languages.
24
43
* OCR support for handwritten text in 6 new languages that include English, Chinese Simplified, French, German, Italian, Portuguese, and Spanish.
25
44
* Enhancements for processing digital PDFs and Machine Readable Zone (MRZ) text in identity documents.
@@ -30,7 +49,6 @@ See the [OCR how-to guide](Vision-API-How-to-Topics/call-read-api.md#determine-h
30
49
> [!div class="nextstepaction"]
31
50
> [Get Started with the Read API](./quickstarts-sdk/client-library.md)
32
51
33
-
34
52
## August 2021
35
53
36
54
### Image tagging language expansion
@@ -41,20 +59,20 @@ The [latest version (v3.2)](https://westus.dev.cognitive.microsoft.com/docs/serv
41
59
42
60
### Spatial Analysis container update
43
61
44
-
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
62
+
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
45
63
46
-
*[Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect the orientation that a person is facing.
47
-
* An orientation classifier can be enabled for the `personcrossingline` and `personcrossingpolygon` operations by configuring the `enable_orientation` parameter. It is set to off by default.
64
+
*[Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect the orientation that a person is facing.
65
+
* An orientation classifier can be enabled for the `personcrossingline` and `personcrossingpolygon` operations by configuring the `enable_orientation` parameter. It is set to off by default.
48
66
49
67
*[Spatial Analysis operations](spatial-analysis-operations.md) now also offers configuration to detect a person's speed while walking/running
50
-
* Speed can be detected for the `personcrossingline` and `personcrossingpolygon` operations by turning on the `enable_speed` classifier, which is off by default. The output is reflected in the `speed`, `avgSpeed`, and `minSpeed` outputs.
51
-
68
+
* Speed can be detected for the `personcrossingline` and `personcrossingpolygon` operations by turning on the `enable_speed` classifier, which is off by default. The output is reflected in the `speed`, `avgSpeed`, and `minSpeed` outputs.
52
69
53
70
## April 2021
54
71
55
72
### Computer Vision v3.2 GA
56
73
57
74
The Computer Vision API v3.2 is now generally available with the following updates:
75
+
58
76
* Improved image tagging model: analyzes visual content and generates relevant tags based on objects, actions, and content displayed in the image. This model is available through the [Tag Image API](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f200). See the Image Analysis [how-to guide](./vision-api-how-to-topics/howtocallvisionapi.md) and [overview](./overview-image-analysis.md) to learn more.
59
77
* Updated content moderation model: detects presence of adult content and provides flags to filter images containing adult, racy, and gory visual content. This model is available through the [Analyze API](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f21b). See the Image Analysis [how-to guide](./vision-api-how-to-topics/howtocallvisionapi.md) and [overview](./overview-image-analysis.md) to learn more.
60
78
*[OCR (Read) available for 73 languages](./language-support.md#optical-character-recognition-ocr) including Simplified and Traditional Chinese, Japanese, Korean, and Latin languages.
@@ -75,7 +93,9 @@ The Computer Vision API v3.2 public preview has been updated. The preview releas
75
93
## February 2021
76
94
77
95
### Read API v3.2 Public Preview with OCR support for 73 languages
96
+
78
97
The Computer Vision Read API v3.2 public preview, available as cloud service and Docker container, includes these updates:
98
+
79
99
*[OCR for 73 languages](./language-support.md#optical-character-recognition-ocr) including Simplified and Traditional Chinese, Japanese, Korean, and Latin languages.
80
100
* Natural reading order for the text line output (Latin languages only)
81
101
* Handwriting style classification for text lines along with a confidence score (Latin languages only).
@@ -87,24 +107,22 @@ See the [Read API how-to guide](Vision-API-How-to-Topics/call-read-api.md) to le
87
107
> [!div class="nextstepaction"]
88
108
> [Use the Read API v3.2 Public Preview](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/5d986960601faab4bf452005)
89
109
90
-
91
110
## January 2021
92
111
93
112
### Spatial Analysis container update
94
113
95
-
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
114
+
A new version of the [Spatial Analysis container](spatial-analysis-container.md) has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.
96
115
97
-
*[Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect if a person is wearing a protective face covering such as a mask.
98
-
* A mask classifier can be enabled for the `personcount`, `personcrossingline` and `personcrossingpolygon` operations by configuring the `ENABLE_FACE_MASK_CLASSIFIER` parameter.
99
-
* The attributes `face_mask` and `face_noMask` will be returned as metadata with confidence score for each person detected in the video stream
116
+
*[Spatial Analysis operations](spatial-analysis-operations.md) can be now configured to detect if a person is wearing a protective face covering such as a mask.
117
+
* A mask classifier can be enabled for the `personcount`, `personcrossingline` and `personcrossingpolygon` operations by configuring the `ENABLE_FACE_MASK_CLASSIFIER` parameter.
118
+
* The attributes `face_mask` and `face_noMask` will be returned as metadata with confidence score for each person detected in the video stream
100
119
* The *personcrossingpolygon* operation has been extended to allow the calculation of the dwell time a person spends in a zone. You can set the `type` parameter in the Zone configuration for the operation to `zonedwelltime` and a new event of type *personZoneDwellTimeEvent* will include the `durationMs` field populated with the number of milliseconds that the person spent in the zone.
101
120
***Breaking change**: The *personZoneEvent* event has been renamed to *personZoneEnterExitEvent*. This event is raised by the *personcrossingpolygon* operation when a person enters or exits the zone and provides directional info with the numbered side of the zone that was crossed.
102
121
* Video URL can be provided as "Private Parameter/obfuscated" in all operations. Obfuscation is optional now and it will only work if `KEY` and `IV` are provided as environment variables.
103
122
* Calibration is enabled by default for all operations. Set the `do_calibration: false` to disable it.
104
123
* Added support for auto recalibration (by default disabled) via the `enable_recalibration` parameter, please refer to [Spatial Analysis operations](./spatial-analysis-operations.md) for details
105
124
* Camera calibration parameters to the `DETECTOR_NODE_CONFIG`. Refer to [Spatial Analysis operations](./spatial-analysis-operations.md) for details.
106
125
107
-
108
126
## October 2020
109
127
110
128
### Computer Vision API v3.1 GA
@@ -115,10 +133,12 @@ The Computer Vision API in General Availability has been upgraded to v3.1.
115
133
116
134
### Spatial Analysis container preview
117
135
118
-
The [Spatial Analysis container](spatial-analysis-container.md) is now in preview. The Spatial Analysis feature of Computer Vision lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments. Spatial Analysis is a Docker container you can use on-premises.
136
+
The [Spatial Analysis container](spatial-analysis-container.md) is now in preview. The Spatial Analysis feature of Computer Vision lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments. Spatial Analysis is a Docker container you can use on-premises.
119
137
120
138
### Read API v3.1 Public Preview adds OCR for Japanese
139
+
121
140
The Computer Vision Read API v3.1 public preview adds these capabilities:
141
+
122
142
* OCR for Japanese language
123
143
* For each text line, indicate whether the appearance is Handwriting or Print style, along with a confidence score (Latin languages only).
124
144
* For a multi-page document extract text only for selected pages or page range.
@@ -133,6 +153,7 @@ See the [Read API how-to guide](Vision-API-How-to-Topics/call-read-api.md) to le
133
153
## July 2020
134
154
135
155
### Read API v3.1 Public Preview with OCR for Simplified Chinese
156
+
136
157
The Computer Vision Read API v3.1 public preview adds support for Simplified Chinese.
137
158
138
159
* This preview version of the Read API supports English, Dutch, French, German, Italian, Portuguese, Simplified Chinese, and Spanish languages.
@@ -143,6 +164,7 @@ See the [Read API how-to guide](Vision-API-How-to-Topics/call-read-api.md) to le
143
164
> [Learn more about Read API v3.1 Public Preview 1](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-1-preview-1/operations/5d986960601faab4bf452005)
144
165
145
166
## May 2020
167
+
146
168
Computer Vision API v3.0 entered General Availability, with updates to the Read API:
147
169
148
170
* Support for English, Dutch, French, German, Italian, Portuguese, and Spanish
@@ -161,6 +183,7 @@ See the [OCR overview](overview-ocr.md) to learn more.
161
183
### Read API 3.0 Public Preview
162
184
163
185
You now can use version 3.0 of the Read API to extract printed or handwritten text from images. Compared to earlier versions, 3.0 provides:
0 commit comments