Skip to content

Commit 6b1e09c

Browse files
committed
add input reqs
1 parent fa66423 commit 6b1e09c

File tree

2 files changed

+14
-1
lines changed

2 files changed

+14
-1
lines changed

articles/ai-services/computer-vision/concept-image-retrieval.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,9 @@ Multi-modal embedding has a variety of applications in different fields, includi
3939
4040
## What are vector embeddings?
4141

42-
Vector embeddings are a way of representing content—text or images—as vectors of real numbers in a high-dimensional space. Vector embeddings are often learned from large amounts of textual and visual data using machine learning algorithms, such as neural networks. Each dimension of the vector corresponds to a different feature or attribute of the content, such as its semantic meaning, syntactic role, or context in which it commonly appears.
42+
Vector embeddings are a way of representing content—text or images—as vectors of real numbers in a high-dimensional space. Vector embeddings are often learned from large amounts of textual and visual data using machine learning algorithms, such as neural networks.
43+
44+
Each dimension of the vector corresponds to a different feature or attribute of the content, such as its semantic meaning, syntactic role, or context in which it commonly appears. In Azure AI Vision, image and text vector embeddings have 1024 dimensions.
4345

4446
> [!NOTE]
4547
> Vector embeddings can only be meaningfully compared if they are from the same model type.
@@ -66,6 +68,15 @@ The image and video retrieval services return a field called "relevance." The te
6668
> [!IMPORTANT]
6769
> The relevance score is a good measure to rank results such as images or video frames with respect to a single query. However, the relevance score cannot be accurately compared across queries. Therefore, it's not possible to easily map the relevance score to a confidence level. It's also not possible to trivially create a threshold algorithm to eliminate irrelevant results based solely on the relevance score.
6870
71+
## Input requirements
72+
73+
**Image input**
74+
- The file size of the image must be less than 20 megabytes (MB)
75+
- The dimensions of the image must be greater than 10 x 10 pixels and less than 16,000 x 16,000 pixels
76+
77+
**Text input**
78+
- The text string must be between (inclusive) one word and 75 words.
79+
6980
## Next steps
7081

7182
Enable Multi-modal embeddings for your search service and follow the steps to generate vector embeddings for text and images.

articles/ai-services/computer-vision/overview-image-analysis.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -96,6 +96,8 @@ Image Analysis works on images that meet the following requirements:
9696
- The file size of the image must be less than 20 megabytes (MB)
9797
- The dimensions of the image must be greater than 50 x 50 pixels and less than 16,000 x 16,000 pixels
9898

99+
> [!TIP]
100+
> Input requirements for multi-modal embeddings are different and are listed in [Multi-modal embeddings](/azure/ai-services/computer-vision/concept-image-retrieval#input-requirements)
99101
100102
#### [Version 3.2](#tab/3-2)
101103

0 commit comments

Comments
 (0)