MicrosoftDocs
diff --git a/‎.openpublishing.publish.config.json
Lines changed: 6 additions & 0 deletions b/‎.openpublishing.publish.config.json
Lines changed: 6 additions & 0 deletions
diff --git a/‎articles/cognitive-services/Computer-vision/Images/dense-captions.png
557 KB b/‎articles/cognitive-services/Computer-vision/Images/dense-captions.png
557 KB
diff --git a/‎articles/cognitive-services/Computer-vision/Images/farm.png
558 KB b/‎articles/cognitive-services/Computer-vision/Images/farm.png
558 KB
diff --git a/‎articles/cognitive-services/Computer-vision/concept-background-removal.md
Lines changed: 61 additions & 0 deletions b/‎articles/cognitive-services/Computer-vision/concept-background-removal.md
Lines changed: 61 additions & 0 deletions
diff --git a/‎articles/cognitive-services/Computer-vision/concept-describe-images-40.md
Lines changed: 153 additions & 0 deletions b/‎articles/cognitive-services/Computer-vision/concept-describe-images-40.md
Lines changed: 153 additions & 0 deletions
diff --git a/‎articles/cognitive-services/Computer-vision/concept-describing-images.md
Lines changed: 1 addition & 32 deletions b/‎articles/cognitive-services/Computer-vision/concept-describing-images.md
Lines changed: 1 addition & 32 deletions
diff --git a/‎articles/cognitive-services/Computer-vision/concept-generate-thumbnails-40.md
Lines changed: 44 additions & 0 deletions b/‎articles/cognitive-services/Computer-vision/concept-generate-thumbnails-40.md
Lines changed: 44 additions & 0 deletions
diff --git a/‎articles/cognitive-services/Computer-vision/concept-generating-thumbnails.md
Lines changed: 0 additions & 27 deletions b/‎articles/cognitive-services/Computer-vision/concept-generating-thumbnails.md
Lines changed: 0 additions & 27 deletions
@@ -914,6 +914,12 @@
       "branch": "main",
       "branch_mapping": {}
     },
+    {
+      "path_to_root": "azure-ai-vision-sdk",
+      "url": "https://github.com/Azure-Samples/azure-ai-vision-sdk",
+      "branch": "main",
+      "branch_mapping": {}
+    },
     {
       "path_to_root": "azure-cache-redis-samples",
       "url": "https://github.com/Azure-Samples/azure-cache-redis-samples",
 
@@ -0,0 +1,61 @@
+---
+title: Background removal - Image Analysis
+titleSuffix: Azure Cognitive Services
+description: Learn about background removal, an operation of Image Analysis
+services: cognitive-services
+author: PatrickFarley
+manager: nitinme
+
+ms.service: cognitive-services
+ms.subservice: computer-vision
+ms.topic: conceptual
+ms.date: 03/02/2023
+ms.author: pafarley
+ms.custom: references_regions
+---
+
+# Background removal (version 4.0 preview)
+
+The Image Analysis service can divide images into multiple segments or regions to help the user identify different objects or parts of the image. Background removal creates an alpha matte that separates the foreground object from the background in an image.
+
+> [!div class="nextstepaction"]
+> [Call the Background removal API](./how-to/background-removal.md)
+
+This feature provides two possible outputs based on the customer's needs:
+
+- The foreground object of the image without the background. This edited image shows the foreground object and makes the background transparent, allowing the foreground to be placed on a new background. 
+- An alpha matte that shows the opacity of the detected foreground object. This matte can be used to separate the foreground object from the background for further processing.
+
+This service is currently in preview, and the API may change in the future.
+
+## Background removal examples
+
+The following example images illustrate what the Image Analysis service returns when removing the background of an image and creating an alpha matte. 
+
+
+|Original image  |With background removed  |Alpha matte  |
+|---------|---------|---------|
+
+| | | |
+|---------|---------|---------|
+| :::image type="content" source="media/background-removal/building-1.png" alt-text="Photo of a city near water.":::    |  :::image type="content" source="media/background-removal/building-1-result.png" alt-text="Photo of a city near water; sky is transparent.":::       |   :::image type="content" source="media/background-removal/building-1-matte.png" alt-text="Alpha matte of a city skyline.":::      |
+|   :::image type="content" source="media/background-removal/person-5.png" alt-text="Photo of a group of people using a tablet.":::  |    :::image type="content" source="media/background-removal/person-5-result.png" alt-text="Photo of a group of people using a tablet; background is transparent.":::     |   :::image type="content" source="media/background-removal/person-5-matte.png" alt-text="Alpha matte of a group of people.":::      |
+|   :::image type="content" source="media/background-removal/bears.png" alt-text="Photo of a group of bears in the woods.":::  |    :::image type="content" source="media/background-removal/bears-result.png" alt-text="Photo of a group of bears; background is transparent.":::     |   :::image type="content" source="media/background-removal/bears-alpha.png" alt-text="Alpha matte of a group of bears.":::      |
+
+
+## Limitations
+
+It's important to note the limitations of background removal:
+
+* Background removal works best for categories such as people and animals, buildings and environmental structures, furniture, vehicles, food, text and graphics, and personal belongings.
+* Objects that aren't prominent in the foreground may not be identified as part of the foreground.
+* Images with thin and detailed structures, like hair or fur, may show some artifacts when overlaid on backgrounds with strong contrast to the original background.
+* The latency of the background removal operation will be higher, up to several seconds, for large images. We suggest you experiment with integrating both modes into your workflow to find the best usage for your needs (for instance, calling background removal on the original image versus calling foreground matting on a downsampled version of the image, then resizing the alpha matte to the original size and applying it to the original image).
+
+## Use the API
+
+The background removal feature is available through the [Image Analysis - Segment](https://aka.ms/vision-4-0-ref) API (`imageanalysis:segment`). You can call this API through REST calls. See the [Background removal how-to guide](./how-to/background-removal.md) for more information.
+
+## Next steps
+
+* [Call the background removal API](./how-to/background-removal.md)
@@ -0,0 +1,153 @@
+---
+title: Image captions - Image Analysis 4.0
+titleSuffix: Azure Cognitive Services
+description: Concepts related to the image captioning feature of the Image Analysis 4.0 API.
+services: cognitive-services
+author: PatrickFarley
+manager: nitinme
+
+ms.service: cognitive-services
+ms.subservice: computer-vision
+ms.topic: conceptual
+ms.date: 01/24/2023
+ms.author: pafarley
+ms.custom: seodec18, ignite-2022, references_regions
+---
+
+# Image captions (version 4.0 preview)
+Image captions in Image Analysis 4.0 (preview) are available through the Caption and Dense Captions features. 
+
+Caption generates a one sentence description for all image contents. Dense Captions provides more detail by generating one sentence descriptions of up to 10 regions of the image in addition to describing the whole image. Dense Captions also returns bounding box coordinates of the described image regions. Both these features use the latest groundbreaking Florence based AI models. 
+
+At this time, image captioning is available in English language only.
+
+### Gender-neutral captions
+All captions contain gender terms: "man", "woman", "boy" and "girl" by default. You have the option to replace these terms with "person" in your results and receive gender-neutral captions. You can do so by setting the optional API request parameter, **gender-neutral-caption** to `true` in the request URL.
+
+> [!IMPORTANT]
+> Image captioning in Image Analysis 4.0 is only available in the following Azure data center regions at this time: East US, France Central, Korea Central, North Europe, Southeast Asia, West Europe, West US. You must use a Computer Vision resource located in one of these regions to get results from Caption and Dense Captions features.
+>
+> If you have to use a Computer Vision resource outside these regions to generate image captions, please use [Image Analysis 3.2](concept-describing-images.md) which is available in all Computer Vision regions.  
+
+
+Try out the image captioning features quickly and easily in your browser using Vision Studio.
+
+> [!div class="nextstepaction"]
+> [Try Vision Studio](https://portal.vision.cognitive.azure.com/)
+
+## Caption example
+
+#### [Caption](#tab/image)
+
+The following JSON response illustrates what the Analysis 4.0 API returns when describing the example image based on its visual features.
+
+![Photo of a man pointing at a screen](./Media/quickstarts/presentation.png)
+
+```json
+"captions": [
+    {
+        "text": "a man pointing at a screen",
+        "confidence": 0.4891590476036072
+    }
+]
+```
+
+#### [Dense Captions](#tab/dense)
+
+The following JSON response illustrates what the Analysis 4.0 API returns when generating dense captions for the example image.
+
+![Photo of a tractor on a farm](./Images/farm.png)
+
+```json
+{
+  "denseCaptionsResult": {
+    "values": [
+      {
+        "text": "a man driving a tractor in a farm",
+        "confidence": 0.535620927810669,
+        "boundingBox": {
+          "x": 0,
+          "y": 0,
+          "w": 850,
+          "h": 567
+        }
+      },
+      {
+        "text": "a man driving a tractor in a field",
+        "confidence": 0.5428450107574463,
+        "boundingBox": {
+          "x": 132,
+          "y": 266,
+          "w": 209,
+          "h": 219
+        }
+      },
+      {
+        "text": "a blurry image of a tree",
+        "confidence": 0.5139822363853455,
+        "boundingBox": {
+          "x": 147,
+          "y": 126,
+          "w": 76,
+          "h": 131
+        }
+      },
+      {
+        "text": "a man riding a tractor",
+        "confidence": 0.4799223840236664,
+        "boundingBox": {
+          "x": 206,
+          "y": 264,
+          "w": 64,
+          "h": 97
+        }
+      },
+      {
+        "text": "a blue sky above a hill",
+        "confidence": 0.35495415329933167,
+        "boundingBox": {
+          "x": 0,
+          "y": 0,
+          "w": 837,
+          "h": 166
+        }
+      },
+      {
+        "text": "a tractor in a field",
+        "confidence": 0.47338250279426575,
+        "boundingBox": {
+          "x": 0,
+          "y": 243,
+          "w": 838,
+          "h": 311
+        }
+      }
+    ]
+  },
+  "modelVersion": "2023-02-01-preview",
+  "metadata": {
+    "width": 850,
+    "height": 567
+  }
+}
+```
+
+---
+
+## Use the API
+
+#### [Image captions](#tab/image)
+
+The image captioning feature is part of the [Analyze Image](https://aka.ms/vision-4-0-ref) API. Include `Caption` in the **features** query parameter. Then, when you get the full JSON response, parse the string for the contents of the `"captionResult"` section.
+
+#### [Dense captions](#tab/dense)
+
+The dense captioning feature is part of the [Analyze Image](https://aka.ms/vision-4-0-ref) API. You can call this API using REST. Include `denseCaptions` in the **features** query parameter. Then, when you get the full JSON response, parse the string for the contents of the `"denseCaptionsResult"` section.
+
+---
+
+## Next steps
+
+* Learn the related concept of [object detection](concept-object-detection-40.md).
+* [Quickstart: Image Analysis REST API or client libraries](./quickstarts-sdk/image-analysis-client-library-40.md?pivots=programming-language-csharp)
+* [Call the Analyze Image API](./how-to/call-analyze-image-40.md)
@@ -14,7 +14,7 @@ ms.author: pafarley
 ms.custom: seodec18, ignite-2022
 ---
 
-# Image description generation
+# Image descriptions
 
 Computer Vision can analyze an image and generate a human-readable phrase that describes its contents. The algorithm returns several descriptions based on different visual features, and each description is given a confidence score. The final output is a list of descriptions ordered from highest to lowest confidence.
 
@@ -31,8 +31,6 @@ The following JSON response illustrates what the Analyze API returns when descri
 
 ![A black and white picture of buildings in Manhattan](./Images/bw_buildings.png)
 
-#### [Version 3.2](#tab/3-2)
-
 ```json
 {
    "description":{
@@ -57,41 +55,12 @@ The following JSON response illustrates what the Analyze API returns when descri
    "modelVersion":"2021-05-01"
 }
 ```
-#### [Version 4.0](#tab/4-0)
-
-```json
-{
-    "metadata":
-    {
-        "width": 239,
-        "height": 300
-    },
-    "descriptionResult":
-    {
-        "values":
-        [
-            {
-                "text": "a city with tall buildings",
-                "confidence": 0.3551448881626129
-            }
-        ]
-    }
-}
-```
----
 
 ## Use the API
 
-#### [Version 3.2](#tab/3-2)
 
 The image description feature is part of the [Analyze Image](https://westcentralus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f21b) API. You can call this API through a native SDK or through REST calls. Include `Description` in the **visualFeatures** query parameter. Then, when you get the full JSON response, parse the string for the contents of the `"description"` section.
 
-#### [Version 4.0](#tab/4-0)
-
-The image description feature is part of the [Analyze Image](https://aka.ms/vision-4-0-ref) API. You can call this API using REST. Include `Description` in the **features** query parameter. Then, when you get the full JSON response, parse the string for the contents of the `"description"` section.
-
----
-
 * [Quickstart: Image Analysis REST API or client libraries](./quickstarts-sdk/image-analysis-client-library.md?pivots=programming-language-csharp)
 
 ## Next steps
 
@@ -0,0 +1,44 @@
+---
+title: Smart-cropped thumbnails - Image Analysis 4.0
+titleSuffix: Azure Cognitive Services
+description: Concepts related to generating thumbnails for images using the Image Analysis 4.0 API.
+services: cognitive-services
+author: PatrickFarley
+manager: nitinme
+
+ms.service: cognitive-services
+ms.subservice: computer-vision
+ms.topic: conceptual
+ms.date: 01/24/2023
+ms.author: pafarley
+ms.custom: seodec18, ignite-2022
+---
+
+# Smart-cropped thumbnails (version 4.0 preview)
+
+A thumbnail is a reduced-size representation of an image. Thumbnails are used to represent images and other data in a more economical, layout-friendly way. The Computer Vision API uses smart cropping to create intuitive image thumbnails that include the most important regions of an image with priority given to any detected faces.
+
+The Computer Vision smart-cropping utility takes one or more aspect ratios in the range [0.75, 1.80] and returns the bounding box coordinates (in pixels) of the region(s) identified. Your app can then crop and return the image using those coordinates.
+
+> [!IMPORTANT]
+> This feature uses face detection to help determine important regions in the image. The detection does not involve distinguishing one face from another face, predicting or classifying facial attributes, or creating a facial template (a unique set of numbers generated from an image that represents the distinctive features of a face).
+
+## Examples
+
+The generated bounding box can vary widely depending on what you specify for aspect ratio, as shown in the following images.
+
+| Aspect ratio | Bounding box |
+|-------|-----------|
+| original | :::image type="content" source="Images/cropped-original.png" alt-text="Photo of a man with a dog at a table."::: |
+| 0.75 |  :::image type="content" source="Images/cropped-075-bb.png" alt-text="Photo of a man with a dog at a table. A 0.75 ratio bounding box is drawn."::: |
+| 1.00 |  :::image type="content" source="Images/cropped-1-0-bb.png" alt-text="Photo of a man with a dog at a table. A 1.00 ratio bounding box is drawn."::: |
+| 1.50 |  :::image type="content" source="Images/cropped-150-bb.png" alt-text="Photo of a man with a dog at a table. A 1.50 ratio bounding box is drawn."::: |
+
+
+## Use the API
+
+The smart cropping feature is available through the [Analyze Image API](https://aka.ms/vision-4-0-ref). Include `SmartCrops` in the **features** query parameter. Also include a **smartcrops-aspect-ratios** query parameter, and set it to a decimal value for the aspect ratio you want (defined as width / height) in the range [0.75, 1.80]. Multiple aspect ratio values should be comma-separated. If no aspect ratio value is provided the API will return a crop with an aspect ratio that best preserves the image’s most important region.  
+
+## Next steps
+
+* [Call the Analyze Image API](./how-to/call-analyze-image-40.md)
@@ -18,7 +18,6 @@ ms.custom: seodec18, ignite-2022
 
 A thumbnail is a reduced-size representation of an image. Thumbnails are used to represent images and other data in a more economical, layout-friendly way. The Computer Vision API uses smart cropping to create intuitive image thumbnails that include the most important regions of an image with priority given to any detected faces.
 
-#### [Version 3.2](#tab/3-2)
 The Computer Vision thumbnail generation algorithm works as follows:
 
 1. Remove distracting elements from the image and identify the _area of interest_&mdash;the area of the image in which the main object(s) appears.
@@ -45,37 +44,11 @@ The following table illustrates thumbnails defined by smart-cropping for the exa
 |![A white flower with a green background](./Images/flower.png) | ![Vision Analyze Flower thumbnail](./Images/flower_thumbnail.png) |
 |![A woman on the roof of an apartment building](./Images/woman_roof.png) | ![thumbnail of a woman on the roof of an apartment building](./Images/woman_roof_thumbnail.png) |
 
-#### [Version 4.0](#tab/4-0)
-
-The Computer Vision smart-cropping utility takes one or more aspect ratios in the range [0.75, 1.80] and returns the bounding box coordinates (in pixels) of the region(s) identified. Your app can then crop and return the image using those coordinates.
-
-> [!IMPORTANT]
-> This feature uses face detection to help determine important regions in the image. The detection does not involve distinguishing one face from another face, predicting or classifying facial attributes, or creating a facial template (a unique set of numbers generated from an image that represents the distinctive features of a face).
-
-## Examples
-
-The generated bounding box can vary widely depending on what you specify for aspect ratio, as shown in the following images.
-
-| Aspect ratio | Bounding box |
-|-------|-----------|
-| original | :::image type="content" source="Images/cropped-original.png" alt-text="Photo of a man with a dog at a table."::: |
-| 0.75 |  :::image type="content" source="Images/cropped-075-bb.png" alt-text="Photo of a man with a dog at a table. A 0.75 ratio bounding box is drawn."::: |
-| 1.00 |  :::image type="content" source="Images/cropped-1-0-bb.png" alt-text="Photo of a man with a dog at a table. A 1.00 ratio bounding box is drawn."::: |
-| 1.50 |  :::image type="content" source="Images/cropped-150-bb.png" alt-text="Photo of a man with a dog at a table. A 1.50 ratio bounding box is drawn."::: |
-
-
----
 
 ## Use the API
 
-#### [Version 3.2](#tab/3-2)
 
 The generate thumbnail feature is available through the [Get Thumbnail](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/56f91f2e778daf14a499f20c) and [Get Area of Interest](https://westus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-2/operations/b156d0f5e11e492d9f64418d) APIs. You can call this API through a native SDK or through REST calls. 
 
-#### [Version 4.0](#tab/4-0)
-
-The smart cropping feature is available through the [Analyze](https://aka.ms/vision-4-0-ref) API. You can call this API using REST. Include `SmartCrops` in the **visualFeatures** query parameter. Also include a **smartcrops-aspect-ratios** query parameter, and set it to a decimal value for the aspect ratio you want (defined as width / height). Multiple aspect ratio values should be comma-separated.
-
----
 
 * [Generate a thumbnail (how-to)](./how-to/generate-thumbnail.md)