Merge pull request #214172 from PatrickFarley/comvis-4-rest

v-shils · web-flow · commit 4d9781c96dee · 2022-10-12T12:49:06.000-07:00
[cog svcs] Comvis 4 rest
diff --git a/articles/cognitive-services/Computer-vision/computer-vision-how-to-install-containers.md b/articles/cognitive-services/Computer-vision/computer-vision-how-to-install-containers.md
@@ -1,7 +1,7 @@
 ---
-title: Install Read OCR Docker containers from Computer Vision
+title: Computer Vision 3.2 GA Read OCR container
 titleSuffix: Azure Cognitive Services
-description: Use the Read OCR Docker containers from Computer Vision to extract text from images and documents, on-premises.
+description: Use the Read 3.2 OCR containers from Computer Vision to extract text from images and documents, on-premises.
 services: cognitive-services
 author: PatrickFarley
 manager: nitinme
@@ -14,7 +14,7 @@ ms.custom: seodec18, cog-serv-seo-aug-2020
 keywords: on-premises, OCR, Docker, container
 ---
 
-# Install Read OCR Docker containers
+# Install Computer Vision 3.2 GA Read OCR container
 
 [!INCLUDE [container hosting on the Microsoft Container Registry](../containers/includes/gated-container-hosting.md)]
 
diff --git a/articles/cognitive-services/Computer-vision/concept-generating-thumbnails.md b/articles/cognitive-services/Computer-vision/concept-generating-thumbnails.md
@@ -49,6 +49,9 @@ The following table illustrates thumbnails defined by smart-cropping for the exa
 
 The Computer Vision smart-cropping utility takes a given aspect ratio (or several) and returns the bounding box coordinates (in pixels) of the region(s) identified. Your app can then crop and return the image using those coordinates.
 
+> [!IMPORTANT]
+> This feature uses face detection to help determine important regions in the image. The detection does not involve distinguishing one face from another face, predicting or classifying facial attributes, or creating a facial template (a unique set of numbers generated from an image that represents the distinctive features of a face).
+
 ---
 
 ## Use the API
diff --git a/articles/cognitive-services/Computer-vision/concept-ocr.md b/articles/cognitive-services/Computer-vision/concept-ocr.md
@@ -18,14 +18,13 @@ ms.author: pafarley
 
 Version 4.0 of Image Analysis offers the ability to extract text from images. Contextual information like line number and position is also returned. Text reading is also available through the [OCR service](overview-ocr.md), but the latest model version is available through Image Analysis. This version is optimized for image inputs as opposed to documents.
 
-> [!IMPORTANT]
-> you need Image Analysis version 4.0 to use this feature. Version 4.0 is currently available to resources in the following Azure regions: East US, France Central, Korea Central, North Europe, Southeast Asia, West Europe, West US.
+[!INCLUDE [read-editions](./includes/read-editions.md)]
 
 ## Reading text example
 
 The following JSON response illustrates what the Analyze API returns when reading text in the given image.
 
-![Photo of a sticky note with writing on it.](./Images/handwritten-note.jpg).
+![Photo of a sticky note with writing on it.](./Images/handwritten-note.jpg)
 
 ```json
 {
diff --git a/articles/cognitive-services/Computer-vision/concept-people-detection.md b/articles/cognitive-services/Computer-vision/concept-people-detection.md
@@ -19,13 +19,13 @@ ms.author: pafarley
 Version 4.0 of Image Analysis offers the ability to detect people appearing in images. The bounding box coordinates of each detected person are returned, along with a confidence score. 
 
 > [!IMPORTANT]
-> you need Image Analysis version 4.0 to use this feature. Version 4.0 is currently available to resources in the following Azure regions: East US, France Central, Korea Central, North Europe, Southeast Asia, West Europe, West US.
+> We built this model by enhancing our object detection model for person detection scenarios. People detection does not involve distinguishing one face from another face, predicting or classifying facial attributes, or creating a facial template (a unique set of numbers generated from an image that represents the distinctive features of a face).
 
 ## People detection example
 
 The following JSON response illustrates what the Analyze API returns when describing the example image based on its visual features.
 
-![Photo of a woman in a kitchen.](./Images/windows-kitchen.jpg).
+![Photo of a woman in a kitchen.](./Images/windows-kitchen.jpg)
 
 ```json
 {
diff --git a/articles/cognitive-services/Computer-vision/faq.yml b/articles/cognitive-services/Computer-vision/faq.yml
@@ -20,22 +20,17 @@ summary: |
 
 
 sections:
-  - name: General Computer Vision questions
+  - name: Computer Vision API frequently asked questions
     questions:
       - question: |
           How can I increase the transactions-per-second (TPS) allowed by the service?
         answer: |
-          The free (S0) tier only allows 20 transaction per minute. Upgrade to the S1 tier to get up to 30 transactions per second. If you're seeing the error code 429 and the "Too many requests" error message, [submit an Azure support ticket](https://azure.microsoft.com/support/create-ticket/) to raise your TPS to 50 or higher with a brief business justification. [Computer Vision pricing](https://azure.microsoft.com/pricing/details/cognitive-services/computer-vision/#pricing).
+          The free (S0) tier only allows 20 transactions per minute. Upgrade to the S1 tier to get up to 30 transactions per second. If you're seeing the error code 429 and the "Too many requests" error message, [submit an Azure support ticket](https://azure.microsoft.com/support/create-ticket/) to raise your TPS to 50 or higher with a brief business justification. [Computer Vision pricing](https://azure.microsoft.com/pricing/details/cognitive-services/computer-vision/#pricing).
 
       - question: |
           The service is throwing an error because my image file is too large. How can I work around this?
         answer: |
-          The file size limit for most Computer Vision features is 4 MB, but the client library SDKs can handle files up to 6 MB. For Optical Character Recognition (OCR) that handles multi-page documents, the maximum file size is 50 MB. For more information, see the Image [Analysis inputs limits](overview-image-analysis.md#image-requirements) and [OCR input limits](how-to/call-read-api.md#input-requirements).
-
-      - question: |
-          How can I process multi-page documents with OCR in a single call?
-        answer: |
-          Optical Character Recognition, specifically the Read operation, supports multi-page documents as the API input. If you call the API with a 10-page document, you'll be billed for 10 pages, with each page counted as a billable transaction. If you have the free (S0) tier, it can only process two pages at a time.
+          The file size limit for most Computer Vision features is 4 MB for the 3.2 version of the API and 20MB for the 4.0 preview version, and the client library SDKs can handle files up to 6 MB. For more information, see the [Image Analysis input limits](overview-image-analysis.md#image-requirements).
 
       - question: |
           Can I send multiple images in a single API call to the Computer Vision service?
@@ -46,19 +41,11 @@ sections:
         answer: |
           See the [Language support](language-support.md) page for the list of languages covered by Image Analysis and OCR.
 
-  - name: OCR service questions
-    questions:
-      - question: |
-          How can I process multi-page documents with OCR in a single call?
-        answer: |
-          Optical Character Recognition, specifically the Read operation, supports multi-page documents as the API input. If you call the API with a 10-page document, you'll be billed for 10 pages, with each page counted as a billable transaction. Note that if you have the free (S0) tier, it can only process two pages at a time.
       - question: |
           Can I deploy the OCR (Read) capability on-premises?
         answer: |
-          Yes, the OCR (Read) cloud API is also available as a Docker container for on-premises deployment. Learn [how to deploy the OCR containers](./computer-vision-how-to-install-containers.md).
+          Yes, the Computer Vision 3.2 OCR (Read) cloud API is also available as a Docker container for on-premises deployment. Learn [how to deploy 
 
-  - name: Image Analysis service questions
-    questions:
       - question: |
           Can I train Computer Vision API to use custom tags?  For example, I would like to feed in pictures of cat breeds to 'train' the AI, then receive the breed value on an AI request.
         answer: |
diff --git a/articles/cognitive-services/Computer-vision/how-to/call-analyze-image.md b/articles/cognitive-services/Computer-vision/how-to/call-analyze-image.md
@@ -76,25 +76,19 @@ The Analyze API gives you access to all of the service's image analysis features
 
 #### [REST](#tab/rest)
 
-You can specify which features you want to use by setting the URL query parameters of the [Analyze API](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f21b). A parameter can have multiple values, separated by commas. Each feature you specify will require more computation time, so only specify what you need.
+You can specify which features you want to use by setting the URL query parameters of the [Analyze API](https://aka.ms/vision-4-0-ref). A parameter can have multiple values, separated by commas. Each feature you specify will require more computation time, so only specify what you need.
 
 |URL parameter | Value | Description|
 |---|---|--|
-|`visualFeatures`|`Adult` | detects if the image is pornographic in nature (depicts nudity or a sex act), or is gory (depicts extreme violence or blood). Sexually suggestive content ("racy" content) is also detected.|
-|`visualFeatures`|`Brands` | detects various brands within an image, including the approximate location. The Brands argument is only available in English.|
-|`visualFeatures`|`Categories` | categorizes image content according to a taxonomy defined in documentation. This value is the default value of `visualFeatures`.|
-|`visualFeatures`|`Color` | determines the accent color, dominant color, and whether an image is black&white.|
-|`visualFeatures`|`Description` | describes the image content with a complete sentence in supported languages.|
-|`visualFeatures`|`Faces` | detects if faces are present. If present, generate coordinates, gender and age.|
-|`visualFeatures`|`ImageType` | detects if image is clip art or a line drawing.|
-|`visualFeatures`|`Objects` | detects various objects within an image, including the approximate location. The Objects argument is only available in English.|
-|`visualFeatures`|`Tags` | tags the image with a detailed list of words related to the image content.|
-|`details`| `Celebrities` | identifies celebrities if detected in the image.|
-|`details`|`Landmarks` |identifies landmarks if detected in the image.|
+|`features`|`Read` | reads the visible text in the image and outputs it as structured JSON data.|
+|`features`|`Description` | describes the image content with a complete sentence in supported languages.|
+|`features`|`SmartCrops` | finds the rectangle coordinates that would crop the image to a desired aspect ratio while preserving the area of interest.|
+|`features`|`Objects` | detects various objects within an image, including the approximate location. The Objects argument is only available in English.|
+|`features`|`Tags` | tags the image with a detailed list of words related to the image content.|
 
 A populated URL might look like this:
 
-`https://{endpoint}/vision/v2.1/analyze?visualFeatures=Description,Tags&details=Celebrities`
+`https://{endpoint}/computervision/imageanalysis:analyze?api-version=2022-10-12-preview&features=Tags`
 
 #### [C#](#tab/csharp)
 
@@ -143,7 +137,7 @@ The following URL query parameter specifies the language. The default value is `
 
 A populated URL might look like this:
 
-`https://{endpoint}/vision/v2.1/analyze?visualFeatures=Description,Tags&details=Celebrities&language=en`
+`https://{endpoint}/computervision/imageanalysis:analyze?api-version=2022-10-12-preview&features=Tags&language=en`
 
 #### [C#](#tab/csharp)
 
@@ -198,44 +192,41 @@ This section shows you how to parse the results of the API call. It includes the
 The service returns a `200` HTTP response, and the body contains the returned data in the form of a JSON string. The following text is an example of a JSON response.
 
 ```json
-{  
-  "tags":[  
-    {  
-      "name":"outdoor",
-      "score":0.976
+{
+    "metadata":
+    {
+        "width": 300,
+        "height": 200
     },
-    {  
-      "name":"bird",
-      "score":0.95
+    "tagsResult":
+    {
+        "values":
+        [
+            {
+                "name": "grass",
+                "confidence": 0.9960499405860901
+            },
+            {
+                "name": "outdoor",
+                "confidence": 0.9956876635551453
+            },
+            {
+                "name": "building",
+                "confidence": 0.9893627166748047
+            },
+            {
+                "name": "property",
+                "confidence": 0.9853052496910095
+            },
+            {
+                "name": "plant",
+                "confidence": 0.9791355729103088
+            }
+        ]
     }
-  ],
-  "description":{  
-    "tags":[  
-      "outdoor",
-      "bird"
-    ],
-    "captions":[  
-      {  
-        "text":"partridge in a pear tree",
-        "confidence":0.96
-      }
-    ]
-  }
 }
 ```
 
-See the following table for explanations of the fields in this example:
-
-Field | Type | Content
-------|------|------|
-Tags  | `object` | The top-level object for an array of tags.
-tags[].Name | `string`    | The keyword from the tags classifier.
-tags[].Score    | `number`    | The confidence score, between 0 and 1.
-description     | `object`    | The top-level object for an image description.
-description.tags[] |    `string`    | The list of tags. If there is insufficient confidence in the ability to produce a caption, the tags might be the only information available to the caller.
-description.captions[].text    | `string`    | A phrase describing the image.
-description.captions[].confidence    | `number`    | The confidence score for the phrase.
-
 ### Error codes
 
 See the following list of possible errors and their causes:
@@ -292,4 +283,4 @@ The following code calls the Image Analysis API and prints the results to the co
 ## Next steps
 
 * Explore the [concept articles](../concept-object-detection.md) to learn more about each feature.
-* See the [API reference](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f21b) to learn more about the API functionality.
+* See the [API reference](https://aka.ms/vision-4-0-ref) to learn more about the API functionality.
diff --git a/articles/cognitive-services/Computer-vision/how-to/call-read-api.md b/articles/cognitive-services/Computer-vision/how-to/call-read-api.md
@@ -16,10 +16,10 @@ ms.author: pafarley
 
 # Call the Computer Vision 3.2 GA Read API
 
-[!INCLUDE [read-editions](../includes/read-editions.md)]
-
 In this guide, you'll learn how to call the v3.2 GA Read API to extract text from images. You'll learn the different ways you can configure the behavior of this API to meet your needs. This guide assumes you have already <a href="https://portal.azure.com/#create/Microsoft.CognitiveServicesComputerVision"  title="created a Computer Vision resource"  target="_blank">create a Computer Vision resource </a> and obtained a key and endpoint URL. If you haven't, follow a [quickstart](../quickstarts-sdk/client-library.md) to get started.
 
+[!INCLUDE [read-editions](../includes/read-editions.md)]
+
 ## Input requirements
 
 The **Read** call takes images and documents as its input. They have the following requirements:
@@ -43,7 +43,7 @@ When using the Read operation, use the following values for the optional `model-
 | latest | Latest GA model|
 | [2022-04-30](../whats-new.md#may-2022) | Latest GA model. 164 languages for print text and 9 languages for handwritten text along with several enhancements on quality and performance |
 | [2022-01-30-preview](../whats-new.md#february-2022) | Preview model adds print text support for Hindi, Arabic and related languages. For handwritten text, adds support for Japanese and Korean. |
-| [2021-09-30-preview](../whats-new.md#september-2021) | Preview model adds print text support for Russian and other Cyrillic languages, For handwritten text,  adds support for Chinese Simplified, French, German, Italian, Portuguese, and Spanish. |
+| [2021-09-30-preview](../whats-new.md#september-2021) | Preview model adds print text support for Russian and other Cyrillic languages. For handwritten text,  adds support for Chinese Simplified, French, German, Italian, Portuguese, and Spanish. |
 | 2021-04-12 | 2021 GA model |
 
 ### Input language
diff --git a/articles/cognitive-services/Computer-vision/includes/read-editions.md b/articles/cognitive-services/Computer-vision/includes/read-editions.md
@@ -19,7 +19,7 @@ ms.author: pafarley
 >
 > | Input | Examples | Suggested API | Benefits |
 > |----------|--------------|-------------------------|-------------------------|
-> | General in-the-wild images with single image at a time |  labels, street signs, and posters | [Image&nbsp;Analysis Read (preview)](/azure/cognitive-services/computer-vision/how-to/concept-ocr) | Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR powered experiences in your workflows.
+> | General in-the-wild images with single image at a time |  labels, street signs, and posters | [Image&nbsp;Analysis Read&nbsp;(preview)](/azure/cognitive-services/computer-vision/concept-ocr) | Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR powered experiences in your workflows.
 > | Scanned document images, digital and scanned documents including embedded images| books, reports, and forms | [Form&nbsp;Recognizer Read](/azure/applied-ai-services/form-recognizer/concept-read) | Optimized for text-heavy scanned and digital document scenarios with asynchronous API to allow processing large documents in your workflows.
 >
 > **Computer Vision 3.2 GA Read**
diff --git a/articles/cognitive-services/Computer-vision/index.yml b/articles/cognitive-services/Computer-vision/index.yml
@@ -80,7 +80,7 @@ conceptualContent:
           url: /training/modules/analyze-images-computer-vision/
         - itemType: reference
           text: Image Analysis API reference
-          url: https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f21b
+          url: https://aka.ms/vision-4-0-ref
       footerLink:
         text: More
         url: index-image-analysis.yml
diff --git a/articles/cognitive-services/Computer-vision/overview-image-analysis.md b/articles/cognitive-services/Computer-vision/overview-image-analysis.md
@@ -19,9 +19,9 @@ keywords: computer vision, computer vision applications, computer vision service
 
 The Computer Vision Image Analysis service can extract a wide variety of visual features from your images. For example, it can determine whether an image contains adult content, find specific brands or objects, or find human faces.
 
-The latest version of Image Analysis, 4.0, has new features like OCR and people detection, and it uses updated models that have achieved human parity in certain recognition tasks. If your resource belongs to one of the regions enabled for 4.0 (East US, France Central, Korea Central, North Europe, Southeast Asia, West Europe, West US), we recommend you use this version going forward.
+The latest version of Image Analysis, 4.0, which is now in public preview, has new features like synchronous OCR and people detection. We recommend you use this version going forward.
 
-You can use Image Analysis through a client library SDK or by calling the [REST API](https://westcentralus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-ga/operations/5d986960601faab4bf452005) directly. Follow the [quickstart](quickstarts-sdk/image-analysis-client-library.md) to get started.
+You can use Image Analysis through a client library SDK or by calling the [REST API](https://aka.ms/vision-4-0-ref) directly. Follow the [quickstart](quickstarts-sdk/image-analysis-client-library.md) to get started.
 
 > [!div class="nextstepaction"]
 > [Quickstart](quickstarts-sdk/image-analysis-client-library.md)
diff --git a/articles/cognitive-services/Computer-vision/toc.yml b/articles/cognitive-services/Computer-vision/toc.yml
@@ -96,6 +96,14 @@ items:
     href: /samples/browse/?products=azure&terms=analyze-image
   - name: Responsible use of AI
     items:
+    - name: Transparency note
+      href: /legal/cognitive-services/computer-vision/imageanalysis-transparency-note?context=/azure/cognitive-services/computer-vision/context/context
+    - name: Characteristics and limitations
+      href: /legal/cognitive-services/computer-vision/imageanalysis-characteristics-and-limitations?context=/azure/cognitive-services/computer-vision/context/context
+    - name: Integration and responsible use 
+      href: /legal/cognitive-services/computer-vision/imageanalysis-guidance-for-integration?context=/azure/cognitive-services/computer-vision/context/context
+    - name: Data, privacy, and security 
+      href: /legal/cognitive-services/computer-vision/imageanalysis-data-privacy-security?context=/azure/cognitive-services/computer-vision/context/context
     - name: Limited Access features
       href: /legal/cognitive-services/computer-vision/limited-access?context=/azure/cognitive-services/computer-vision/context/context
   - name: How-to guides
diff --git a/articles/cognitive-services/Computer-vision/whats-new.md b/articles/cognitive-services/Computer-vision/whats-new.md

Original file line number	Diff line number	Diff line change
`@@ -19,7 +19,7 @@ ms.author: pafarley`
`19`	`19`	`>`
`20`	`20`	`> \| Input \| Examples \| Suggested API \| Benefits \|`
`21`	`21`	`> \|----------\|--------------\|-------------------------\|-------------------------\|`
`22`		`-> \| General in-the-wild images with single image at a time \| labels, street signs, and posters \| [Image Analysis Read (preview)](/azure/cognitive-services/computer-vision/how-to/concept-ocr) \| Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR powered experiences in your workflows.`
	`22`	`+> \| General in-the-wild images with single image at a time \| labels, street signs, and posters \| [Image Analysis Read (preview)](/azure/cognitive-services/computer-vision/concept-ocr) \| Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR powered experiences in your workflows.`
`23`	`23`	`> \| Scanned document images, digital and scanned documents including embedded images\| books, reports, and forms \| [Form Recognizer Read](/azure/applied-ai-services/form-recognizer/concept-read) \| Optimized for text-heavy scanned and digital document scenarios with asynchronous API to allow processing large documents in your workflows.`
`24`	`24`	`>`
`25`	`25`	`> Computer Vision 3.2 GA Read`