add input reqs

PatrickFarley · PatrickFarley · commit 6b1e09c03e89 · 2024-01-31T16:37:29.000-05:00
diff --git a/articles/ai-services/computer-vision/concept-image-retrieval.md b/articles/ai-services/computer-vision/concept-image-retrieval.md
@@ -39,7 +39,9 @@ Multi-modal embedding has a variety of applications in different fields, includi
 
 ## What are vector embeddings? 
 
-Vector embeddings are a way of representing content&mdash;text or images&mdash;as vectors of real numbers in a high-dimensional space. Vector embeddings are often learned from large amounts of textual and visual data using machine learning algorithms, such as neural networks. Each dimension of the vector corresponds to a different feature or attribute of the content, such as its semantic meaning, syntactic role, or context in which it commonly appears. 
+Vector embeddings are a way of representing content&mdash;text or images&mdash;as vectors of real numbers in a high-dimensional space. Vector embeddings are often learned from large amounts of textual and visual data using machine learning algorithms, such as neural networks. 
+
+Each dimension of the vector corresponds to a different feature or attribute of the content, such as its semantic meaning, syntactic role, or context in which it commonly appears. In Azure AI Vision, image and text vector embeddings have 1024 dimensions.
 
 > [!NOTE]
 > Vector embeddings can only be meaningfully compared if they are from the same model type.
@@ -66,6 +68,15 @@ The image and video retrieval services return a field called "relevance." The te
 > [!IMPORTANT]
 > The relevance score is a good measure to rank results such as images or video frames with respect to a single query. However, the relevance score cannot be accurately compared across queries. Therefore, it's not possible to easily map the relevance score to a confidence level. It's also not possible to trivially create a threshold algorithm to eliminate irrelevant results based solely on the relevance score. 
 
+## Input requirements
+
+**Image input**
+- The file size of the image must be less than 20 megabytes (MB)
+- The dimensions of the image must be greater than 10 x 10 pixels and less than 16,000 x 16,000 pixels
+
+**Text input**
+- The text string must be between (inclusive) one word and 75 words.
+
 ## Next steps
 
 Enable Multi-modal embeddings for your search service and follow the steps to generate vector embeddings for text and images.  
diff --git a/articles/ai-services/computer-vision/overview-image-analysis.md b/articles/ai-services/computer-vision/overview-image-analysis.md
@@ -96,6 +96,8 @@ Image Analysis works on images that meet the following requirements:
 - The file size of the image must be less than 20 megabytes (MB)
 - The dimensions of the image must be greater than 50 x 50 pixels and less than 16,000 x 16,000 pixels
 
+> [!TIP]
+> Input requirements for multi-modal embeddings are different and are listed in [Multi-modal embeddings](/azure/ai-services/computer-vision/concept-image-retrieval#input-requirements)
 
 #### [Version 3.2](#tab/3-2)