Skip to content

Commit cc976da

Browse files
committed
Incorped CFR & email feedback, updated links to prepare for quickstart rework
1 parent 5c0c997 commit cc976da

File tree

1 file changed

+40
-25
lines changed
  • articles/cognitive-services/Computer-vision

1 file changed

+40
-25
lines changed

articles/cognitive-services/Computer-vision/Home.md

Lines changed: 40 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -7,15 +7,48 @@ manager: nolachar
77
ms.service: cognitive-services
88
ms.component: computer-vision
99
ms.topic: overview
10-
ms.date: 05/25/2018
11-
ms.author: nolachar
12-
#Customer intent: As a developer, I want to learn more about Computer Vision so that I can add image processing functionality to my application.
10+
ms.date: 08/22/2018
11+
ms.author: v-deken
12+
#Customer intent: As a developer, I want to evaluate image processing functionality, so that I can determine if it will work for my information extraction or object detection scenarios.
1313
---
1414
# What is Computer Vision?
1515

1616
The cloud-based Computer Vision service provides developers with access to advanced algorithms for processing images and returning information. Computer Vision works with popular image formats, such as JPEG and PNG. To analyze an image, you can either upload an image or specify an image URL. Computer Vision algorithms can analyze the content of an image in different ways based on your inputs and choices, depending on the visual features you're interested in. For example, Computer Vision can determine if an image contains adult or racy content, or find all the faces in an image.
1717

18-
## What can Computer Vision analyze?
18+
You can use Computer Vision in your application, by either using our [client libraries](quickstarts-sdk/csharp-analyze-sdk.md) to invoke the service, or invoking the [REST API](vision-api-how-to-topics/howtocallvisionapi.md) directly, to:
19+
20+
- [Analyze images for insight](#analyzing-images-for-insight)
21+
- [Extract text from images](#extracting-text-from-images)
22+
- [Moderate content in images](#moderating-content-in-images)
23+
24+
## Analyzing images for insight
25+
26+
You can analyze images using Computer Vision to detect and provide insight about the visual features and characteristics of your images. You can analyze either a local image, by uploading the contents of the image at the time of analysis, or a remote image, by specifying the URL to an image to be analyzed.
27+
28+
Computer Vision can perform the following actions when analyzing an image:
29+
30+
| Action | Description |
31+
| ------ | ----------- |
32+
|**[Tag visual features](quickstarts/csharp-analyze.md)**|Identify and tag visual features in an image, based on more than 2,000 recognizable objects, living beings, scenery, and actions. When tags are ambiguous or not common knowledge, the response provides 'hints' to clarify the meaning of the tag in the context of a known setting. Tagging isn't limited to the main subject, such as a person in the foreground, but also includes the setting (indoor or outdoor), furniture, tools, plants, animals, accessories, gadgets, and so on.|
33+
|**[Categorize an image](quickstarts/csharp-analyze.md)**|Identify and categorize an entire image, using on a [category taxonomy](Category-Taxonomy.md) with parent/child hereditary hierarchies. Categories can be used alone, or with our new tagging models.<br/>Currently, English is the only supported language for tagging and categorizing images.|
34+
|**[Describe an image](quickstarts/csharp-analyze.md)**|Generate a description of an entire image in human-readable language, using complete sentences. Computer Vision's algorithms generate various descriptions based on the objects identified in the image. The descriptions are each evaluated and a confidence score generated. A list is then returned ordered from highest confidence score to lowest. An example of a bot that uses this technology to generate image captions can be found [here](https://github.com/Microsoft/BotBuilder-Samples/tree/master/CSharp/intelligence-ImageCaption).|
35+
|**[Detect faces](quickstarts/csharp-analyze.md)** |Detect and analyze faces in an image, providing information about each detected face. Computer Vision returns the face coordinates, the rectangle for the face, gender, and age for each detected face. Computer Vision provides a subset of the functionality that can be found in [Face](/azure/cognitive-services/face/), and you can use the Face service for more detailed analysis, such as facial identification and pose detection.|
36+
|**[Detect image types](quickstarts/csharp-analyze.md)**|Detect characteristics about an image, such as whether an image is a line drawing or the likelihood of whether an image is clip art.|
37+
|**[Detect domain-specific content](quickstarts/python-domain.md)**|Use domain models to detect and identify domain-specific content, such as celebrities and landmarks, depending on the categorization of an image's content. For example, if an image contains people, Computer Vision can use a domain model for celebrities included with the service to determine if the people detected in the image match known celebrities.|
38+
|**[Detect the color scheme](quickstarts/csharp-analyze.md)**|Analyze color usage within an image. Computer Vision can determine whether an image is black & white or color and, for color images, identify the dominant and accent colors.|
39+
|**[Generate a thumbnail](quickstarts/csharp-thumb.md)**|Analyze the contents of an image to provide a useful, contextual thumbnail for an image. Computer Vision first generates a high-quality thumbnail and then analyzes the objects within the image to determine the `region of interest` (ROI). Computer Vision then crops the image to fit the requirements of the region of interest. The generated thumbnail can be presented using an aspect ratio that is different from the aspect ratio of the original image, depending on your needs.|
40+
41+
## Extracting text from images
42+
43+
You can use Computer Vision to [extract text using OCR](quickstarts/csharp-print-text.md) from an image into a machine-readable character stream. If needed, OCR corrects the rotation of the recognized text, in degrees, around the horizontal image axis, and provides the frame coordinates of each word. OCR supports 25 languages, and automatically detects the language of extracted text.
44+
45+
You can also [recognize printed and handwritten text](quickstarts/csharp-hand-text.md) from an image. Computer Vision can detect and extract both printed and handwritten text from images of various objects with different surfaces and backgrounds, such as receipts, posters, business cards, letters, and whiteboards. Currently, recognizing printed and handwritten text is in preview, and English is the only supported language.
46+
47+
## Moderating content in images
48+
49+
You can use Computer Vision to [detect adult and racy content](quickstarts/csharp-analyze.md) in an image, rating the likelihood that the image contains either adult or racy content and generating a confidence score for both. The filter for adult and racy content detection can be set on a sliding scale to accommodate your preferences.
50+
51+
## Image requirements
1952

2053
Computer Vision can analyze images that meet the following requirements:
2154

@@ -24,28 +57,10 @@ Computer Vision can analyze images that meet the following requirements:
2457
- The dimensions of the image must be greater than 50 x 50 pixels
2558
For OCR, the dimensions of the image must be between 40 x 40 and 3200 x 3200 pixels, and the image cannot be bigger than 10 megapixels.
2659

27-
## How do I use Computer Vision?
28-
29-
You can use Computer Vision in your application, by either using our [client libraries](quickstarts-sdk/csharp-analyze-sdk.md) to invoke the service, or invoking the [REST API](vision-api-how-to-topics/howtocallvisionapi.md) directly, to:
30-
31-
- **[Tag visual features](quickstarts/csharp-analyze.md)** in an image, based on more than 2,000 recognizable objects, living beings, scenery, and actions. When tags are ambiguous or not common knowledge, the response provides 'hints' to clarify the meaning of the tag in the context of a known setting. Tagging isn't limited to the main subject, such as a person in the foreground, but also includes the setting (indoor or outdoor), furniture, tools, plants, animals, accessories, gadgets etc.
32-
- **[Categorize an image](quickstarts/csharp-analyze.md)** based on a [category taxonomy](Category-Taxonomy.md) with parent/child hereditary hierarchies, as defined in previous versions of the service. Categories can be used alone, or with our new tagging models.
33-
Currently, English is the only supported language for tagging and categorizing images.
34-
- **[Describe an image](quickstarts/csharp-analyze.md)** in human-readable language, using complete sentences. Computer Vision's algorithms generate various descriptions based on the objects identified in the image. The descriptions are each evaluated and a confidence score generated. A list is then returned ordered from highest confidence score to lowest. An example of a bot that uses this technology to generate image captions can be found [here](https://github.com/Microsoft/BotBuilder-Samples/tree/master/CSharp/intelligence-ImageCaption).
35-
- **[Detect faces](quickstarts/csharp-analyze.md)** in an image and provide information about each detected face. Computer Vision returns the face coordinates, the rectangle for the face, gender, and age for each detected face. Computer Vision provides a subset of the functionality that can be found in [Face](/azure/cognitive-services/face/), and you can use the Face service for more detailed analysis, such as facial identification and pose detection.
36-
- **[Detect image types](quickstarts/csharp-analyze.md)**, such as whether an image is a line drawing, or rate the likelihood of whether an image is clip art.
37-
- **[Detect domain-specific content](quickstarts/python-domain.md)**, such as celebrities and landmarks, depending on the categorization of an image's content. For example, if an image contains people, Computer Vision can use a domain-specific model included with the service to determine if the people match known celebrities.
38-
- **[Detect the color scheme](quickstarts/csharp-analyze.md)**, including the dominant and accent colors, of an image. Computer Vision can determine whether an image is black & white or color and, for color images, identify the dominant and accent colors.
39-
- **[Detect adult and racy content](quickstarts/csharp-analyze.md)** in an image, rating the likelihood that the image contains either adult or racy content and generating a confidence score for both. The filter for adult and racy content detection can be set on a sliding scale to accommodate your preferences.
40-
- **[Extract text using OCR](quickstarts/csharp-print-text.md)** from an image into a machine-readable character stream. If needed, OCR corrects the rotation of the recognized text, in degrees, around the horizontal image axis, and provides the frame coordinates of each word. OCR supports 25 languages, and automatically detects the language of extracted text.
41-
- **[Recognize printed and handwritten text](quickstarts/csharp-hand-text.md)** from an image. Computer Vision can detect and extract both printed and handwritten text from images of various objects with different surfaces and backgrounds, such as receipts, posters, business cards, letters, and whiteboards.
42-
Currently, recognizing printed and handwritten text is in preview, and English is the only supported language.
43-
- **[Generate a thumbnail](quickstarts/csharp-thumb.md)** from an image, first generating a high-quality thumbnail and then analyzing the objects within the image. Computer Vision then crops the image to fit the requirements of the 'region of interest' (ROI). The generated thumbnail can be presented using an aspect ratio that is different from the aspect ratio of the original image, depending on your needs.
44-
4560
## Next steps
4661

4762
Get started with Computer Vision with one of our quickstarts:
4863

49-
- [Analyze an image](/quickstarts/csharp-analyze.md)
50-
- [Extract handwritten text](/quickstarts/csharp-hand-text.md)
51-
- [Generate a thumbnail](/quickstarts/csharp-thumb.md)
64+
- [Analyze an image](/quickstarts-sdk/csharp-analyze-sdk.md)
65+
- [Extract handwritten text](/quickstarts-sdk/csharp-hand-text-sdk.md)
66+
- [Generate a thumbnail](/quickstarts-sdk/csharp-thumb-sdk.md)

0 commit comments

Comments
 (0)