MicrosoftDocs
diff --git a/‎articles/ai-services/computer-vision/concept-image-retrieval.md
Lines changed: 14 additions & 14 deletions b/‎articles/ai-services/computer-vision/concept-image-retrieval.md
Lines changed: 14 additions & 14 deletions
diff --git a/‎articles/ai-services/computer-vision/how-to/image-retrieval.md
Lines changed: 10 additions & 8 deletions b/‎articles/ai-services/computer-vision/how-to/image-retrieval.md
Lines changed: 10 additions & 8 deletions
diff --git a/‎articles/ai-services/computer-vision/how-to/video-retrieval.md
Lines changed: 1 addition & 1 deletion b/‎articles/ai-services/computer-vision/how-to/video-retrieval.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-services/computer-vision/language-support.md
Lines changed: 110 additions & 1 deletion b/‎articles/ai-services/computer-vision/language-support.md
Lines changed: 110 additions & 1 deletion
@@ -1,5 +1,5 @@
 ---
-title: Multi-modal embeddings concepts - Image Analysis 4.0
+title: Multimodal embeddings concepts - Image Analysis 4.0
 titleSuffix: Azure AI services
 description: Concepts related to image vectorization using the Image Analysis 4.0 API.
 #services: cognitive-services
@@ -8,13 +8,13 @@ manager: nitinme
 
 ms.service: azure-ai-vision
 ms.topic: conceptual
-ms.date: 01/19/2024
+ms.date: 02/20/2024
 ms.author: pafarley
 ---
 
-# Multi-modal embeddings (version 4.0 preview)
+# Multimodal embeddings (version 4.0)
 
-Multi-modal embedding is the process of generating a numerical representation of an image that captures its features and characteristics in a vector format. These vectors encode the content and context of an image in a way that is compatible with text search over the same vector space.
+Multimodal embedding is the process of generating a numerical representation of an image that captures its features and characteristics in a vector format. These vectors encode the content and context of an image in a way that is compatible with text search over the same vector space.
 
 Image retrieval systems have traditionally used features extracted from the images, such as content labels, tags, and image descriptors, to compare images and rank them by similarity. However, vector similarity search is gaining more popularity due to a number of benefits over traditional keyword-based search and is becoming a vital component in popular content search services.
 
@@ -26,35 +26,35 @@ Vector search searches large collections of vectors in high-dimensional space to
 
 ## Business applications
 
-Multi-modal embedding has a variety of applications in different fields, including: 
+Multimodal embedding has a variety of applications in different fields, including: 
 
-- **Digital asset management**: Multi-modal embedding can be used to manage large collections of digital images, such as in museums, archives, or online galleries. Users can search for images based on visual features and retrieve the images that match their criteria.
+- **Digital asset management**: Multimodal embedding can be used to manage large collections of digital images, such as in museums, archives, or online galleries. Users can search for images based on visual features and retrieve the images that match their criteria.
 - **Security and surveillance**: Vectorization can be used in security and surveillance systems to search for images based on specific features or patterns, such as in, people & object tracking, or threat detection. 
 - **Forensic image retrieval**: Vectorization can be used in forensic investigations to search for images based on their visual content or metadata, such as in cases of cyber-crime.
 - **E-commerce**: Vectorization can be used in online shopping applications to search for similar products based on their features or descriptions or provide recommendations based on previous purchases.
 - **Fashion and design**: Vectorization can be used in fashion and design to search for images based on their visual features, such as color, pattern, or texture. This can help designers or retailers to identify similar products or trends.
 
 > [!CAUTION]
-> Multi-modal embedding is not designed analyze medical images for diagnostic features or disease patterns. Please do not use Multi-modal embedding for medical purposes.
+> Multimodal embedding is not designed analyze medical images for diagnostic features or disease patterns. Please do not use Multimodal embedding for medical purposes.
 
 ## What are vector embeddings? 
 
 Vector embeddings are a way of representing content&mdash;text or images&mdash;as vectors of real numbers in a high-dimensional space. Vector embeddings are often learned from large amounts of textual and visual data using machine learning algorithms, such as neural networks. 
 
 Each dimension of the vector corresponds to a different feature or attribute of the content, such as its semantic meaning, syntactic role, or context in which it commonly appears. In Azure AI Vision, image and text vector embeddings have 1024 dimensions.
 
-> [!NOTE]
-> Vector embeddings can only be meaningfully compared if they are from the same model type.
+> [!IMPORTANT]
+> Vector embeddings can only be compared and matched if they're from the same model type. Images vectorized by one model won't be searchable through a different model. The latest Image Analysis API offers two models, version `2023-04-15` which supports text search in many languages, and the legacy `2022-04-11` model which supports only English.
 
 ## How does it work? 
 
-The following are the main steps of the image retrieval process using Multi-modal embeddings.
+The following are the main steps of the image retrieval process using Multimodal embeddings.
 
 :::image type="content" source="media/image-retrieval.png" alt-text="Diagram of image retrieval process.":::
 
-1. Vectorize Images and Text: the Multi-modal embeddings APIs, **VectorizeImage** and **VectorizeText**, can be used to extract feature vectors out of an image or text respectively. The APIs return a single feature vector representing the entire input.
+1. Vectorize Images and Text: the Multimodal embeddings APIs, **VectorizeImage** and **VectorizeText**, can be used to extract feature vectors out of an image or text respectively. The APIs return a single feature vector representing the entire input.
    > [!NOTE]
-   > Multi-modal embedding does not do any biometric processing of human faces. For face detection and identification, see the [Azure AI Face service](./overview-identity.md).
+   > Multimodal embedding does not do any biometric processing of human faces. For face detection and identification, see the [Azure AI Face service](./overview-identity.md).
 
 1. Measure similarity: Vector search systems typically use distance metrics, such as cosine distance or Euclidean distance, to compare vectors and rank them by similarity. The [Vision studio](https://portal.vision.cognitive.azure.com/) demo uses [cosine distance](./how-to/image-retrieval.md#calculate-vector-similarity) to measure similarity.  
 1. Retrieve Images: Use the top _N_ vectors similar to the search query and retrieve images corresponding to those vectors from your photo library to  provide as the final result.
@@ -79,6 +79,6 @@ The image and video retrieval services return a field called "relevance." The te
 
 ## Next steps
 
-Enable Multi-modal embeddings for your search service and follow the steps to generate vector embeddings for text and images.  
-* [Call the Multi-modal embeddings APIs](./how-to/image-retrieval.md)
+Enable Multimodal embeddings for your search service and follow the steps to generate vector embeddings for text and images.  
+* [Call the Multimodal embeddings APIs](./how-to/image-retrieval.md)
 
@@ -1,5 +1,5 @@
 ---
-title: Do image retrieval using multi-modal embeddings - Image Analysis 4.0
+title: Do image retrieval using multimodal embeddings - Image Analysis 4.0
 titleSuffix: Azure AI services
 description: Learn how to call the image retrieval API to vectorize image and search terms.
 #services: cognitive-services
@@ -8,14 +8,14 @@ manager: nitinme
 
 ms.service: azure-ai-vision
 ms.topic: how-to
-ms.date: 01/30/2024
+ms.date: 02/20/2024
 ms.author: pafarley
 ms.custom: references_regions
 ---
 
-# Do image retrieval using multi-modal embeddings (version 4.0 preview)
+# Do image retrieval using multimodal embeddings (version 4.0)
 
-The Multi-modal embeddings APIs enable the _vectorization_ of images and text queries. They convert images to coordinates in a multi-dimensional vector space. Then, incoming text queries can also be converted to vectors, and images can be matched to the text based on semantic closeness. This allows the user to search a set of images using text, without the need to use image tags or other metadata. Semantic closeness often produces better results in search.
+The Multimodal embeddings APIs enable the _vectorization_ of images and text queries. They convert images to coordinates in a multi-dimensional vector space. Then, incoming text queries can also be converted to vectors, and images can be matched to the text based on semantic closeness. This allows the user to search a set of images using text, without the need to use image tags or other metadata. Semantic closeness often produces better results in search.
 
 > [!IMPORTANT]
 > These APIs are only available in the following geographic regions: East US, France Central, Korea Central, North Europe, Southeast Asia, West Europe, West US.
@@ -26,9 +26,9 @@ The Multi-modal embeddings APIs enable the _vectorization_ of images and text qu
 * Once you have your Azure subscription, <a href="https://portal.azure.com/#create/Microsoft.CognitiveServicesComputerVision"  title="Create a Computer Vision resource"  target="_blank">create a Computer Vision resource </a> in the Azure portal to get your key and endpoint. Be sure to create it in one of the permitted geographic regions: East US, France Central, Korea Central, North Europe, Southeast Asia, West Europe, West US. 
    * After it deploys, select **Go to resource**. Copy the key and endpoint to a temporary location to use later on.
 
-## Try out Multi-modal embeddings
+## Try out Multimodal embeddings
 
-You can try out the Multi-modal embeddings feature quickly and easily in your browser using Vision Studio.
+You can try out the Multimodal embeddings feature quickly and easily in your browser using Vision Studio.
 
 > [!IMPORTANT]
 > The Vision Studio experience is limited to 500 images. To use a larger image set, create your own search application using the APIs in this guide.
@@ -43,9 +43,10 @@ The `retrieval:vectorizeImage` API lets you convert an image's data to a vector.
 1. Replace `<endpoint>` with your Azure AI Vision endpoint.
 1. Replace `<subscription-key>` with your Azure AI Vision key.
 1. In the request body, set `"url"` to the URL of a remote image you want to use.
+1. Optionally, change the `model-version` parameter to an older version. `2022-04-11` is the legacy model that supports only English text. Images and text that are vectorized with a certain model aren't compatible with other models, so be sure to use the same model for both. 
 
 ```bash
-curl.exe -v -X POST "https://<endpoint>/computervision/retrieval:vectorizeImage?api-version=2023-02-01-preview&modelVersion=latest" -H "Content-Type: application/json" -H "Ocp-Apim-Subscription-Key: <subscription-key>" --data-ascii "
+curl.exe -v -X POST "https://<endpoint>/computervision/retrieval:vectorizeImage?api-version=2024-02-01-preview&model-version=2023-04-15" -H "Content-Type: application/json" -H "Ocp-Apim-Subscription-Key: <subscription-key>" --data-ascii "
 {
 'url':'https://learn.microsoft.com/azure/ai-services/computer-vision/media/quickstarts/presentation.png'
 }"
@@ -69,9 +70,10 @@ The `retrieval:vectorizeText` API lets you convert a text string to a vector. To
 1. Replace `<endpoint>` with your Azure AI Vision endpoint.
 1. Replace `<subscription-key>` with your Azure AI Vision key.
 1. In the request body, set `"text"` to the example search term you want to use.
+1. Optionally, change the `model-version` parameter to an older version. `2022-04-11` is the legacy model that supports only English text. Images and text that are vectorized with a certain model aren't compatible with other models, so be sure to use the same model for both. 
 
 ```bash
-curl.exe -v -X POST "https://<endpoint>/computervision/retrieval:vectorizeText?api-version=2023-02-01-preview&modelVersion=latest" -H "Content-Type: application/json" -H "Ocp-Apim-Subscription-Key: <subscription-key>" --data-ascii "
+curl.exe -v -X POST "https://<endpoint>/computervision/retrieval:vectorizeText?api-version=2023-02-01-preview&model-version=2023-04-15" -H "Content-Type: application/json" -H "Ocp-Apim-Subscription-Key: <subscription-key>" --data-ascii "
 {
 'text':'cat jumping'
 }"
 
@@ -338,4 +338,4 @@ Connection: close
 
 ## Next steps
 
-[Multi-modal embeddings concepts](../concept-image-retrieval.md)
+[Multimodal embeddings concepts](../concept-image-retrieval.md)
@@ -127,7 +127,7 @@ The following table lists the OCR supported languages for print text by the most
 |Kazakh (Latin) | `kk-latn`|Zhuang | `za` |
 |Khaling | `klr`|Zulu  | `zu` |
 
-## Image analysis
+## Analyze image
 
 Some features of the [Analyze - Image](https://westcentralus.dev.cognitive.microsoft.com/docs/services/computer-vision-v3-1-ga/operations/56f91f2e778daf14a499f21b) API can return results in other languages, specified with the `language` query parameter. Other actions return results in English regardless of what language is specified, and others throw an exception for unsupported languages. Actions are specified with the `visualFeatures` and `details` query parameters; see the [Overview](overview-image-analysis.md) for a list of all the actions you can do with image analysis. Languages for tagging are only available in API version 3.2 or later.
 
@@ -185,3 +185,112 @@ Some features of the [Analyze - Image](https://westcentralus.dev.cognitive.micro
 |Chinese Simplified |`zh`|✅ | ✅| ✅|||||| |✅|✅||
 |Chinese Simplified |`zh-Hans`| | ✅| |||||| ||||
 |Chinese Traditional |`zh-Hant`| | ✅| |||||| ||||
+
+## Multimodal embeddings
+
+The latest [Multimodal embeddings](./concept-image-retrieval.md) model supports vector search in many languages. The original model supports English only. Images that are vectorized in the English-only model are not compatible with text searches in the multi-lingual model.
+
+| Language  | Language code | `2023-04-15` model | `2022-04-11` model|
+|-----------------------|---------------| -- |--  |
+| Akrikaans             | `af`          | ✅ |  |
+| Amharic               | `am`          | ✅ |  |
+| Arabic                | `ar`          | ✅ |  |
+| Armenian              | `hy`          | ✅ |  |
+| Assamese              | `as`          | ✅ |  |
+| Asturian              | `ast`         | ✅ |  |
+| Azerbaijani           | `az`          | ✅ |  |
+| Belarusian            | `be`          | ✅ |  |
+| Bengali               | `bn`          | ✅ |  |
+| Bosnian               | `bs`          | ✅ |  |
+| Bulgarian             | `bg`          | ✅ |  |
+| Burmese               | `my`          | ✅ |  |
+| Catalan               | `ca`          | ✅ |  |
+| Cebuano               | `ceb`         | ✅ |  |
+| Chinese Simpl         | `zho`         | ✅ |  |
+| Chinese Trad          | `zho`         | ✅ |  |
+| Croatian              | `hr`          | ✅ |  |
+| Czech                 | `cs`          | ✅ |  |
+| Danish                | `da`          | ✅ |  |
+| Dutch                 | `nl`          | ✅ |  |
+| English               | `en`          | ✅ | ✅ |
+| Estonian              | `et`          | ✅ |  |
+| Filipino (Tagalog)    | `tl`          | ✅ |  |
+| Finnish               | `fi`          | ✅ |  |
+| French                | `fr`          | ✅ |  |
+| Fulah                 | `ff`          | ✅ |  |
+| Galician              | `gl`          | ✅ |  |
+| Ganda                 | `lg`          | ✅ |  |
+| Georgian              | `ka`          | ✅ |  |
+| German                | `de`          | ✅ |  |
+| Greek                 | `el`          | ✅ |  |
+| Gujarati              | `gu`          | ✅ |  |
+| Hausa                 | `ha`          | ✅ |  |
+| Hebrew                | `he`          | ✅ |  |
+| Hindi                 | `hi`          | ✅ |  |
+| Hungarian             | `hu`          | ✅ |  |
+| Icelandic             | `is`          | ✅ |  |
+| Igbo                  | `ig`          | ✅ |  |
+| Indonesian            | `id`          | ✅ |  |
+| Irish                 | `ga`          | ✅ |  |
+| Italian               | `it`          | ✅ |  |
+| Japanese              | `ja`          | ✅ |  |
+| Javanese              | `jv`          | ✅ |  |
+| Kabuverdianu          | `kea`         | ✅ |  |
+| Kamba                 | `kam`         | ✅ |  |
+| Kannada               | `kn`          | ✅ |  |
+| Kazakh                | `kk`          | ✅ |  |
+| Khmer                 | `km`          | ✅ |  |
+| Korean                | `ko`          | ✅ |  |
+| Kyrgyz                | `ky`          | ✅ |  |
+| Lao                   | `lo`          | ✅ |  |
+| Latvian               | `lv`          | ✅ |  |
+| Lingala               | `ln`          | ✅ |  |
+| Lithuanian            | `lt`          | ✅ |  |
+| Luo                   | `luo`         | ✅ |  |
+| Luxembourgish         | `lb`          | ✅ |  |
+| Macedonian            | `mk`          | ✅ |  |
+| Malay                 | `ms`          | ✅ |  |
+| Malayalam             | `ml`          | ✅ |  |
+| Maltese               | `mt`          | ✅ |  |
+| Maori                 | `mi`          | ✅ |  |
+| Marathi               | `mr`          | ✅ |  |
+| Mongolian             | `mn`          | ✅ |  |
+| Nepali                | `ne`          | ✅ |  |
+| Northern Sotho        | `ns`          | ✅ |  |
+| Norwegian             | `no`          | ✅ |  |
+| Nyanja                | `ny`          | ✅ |  |
+| Occitan               | `oc`          | ✅ |  |
+| Oriya                 | `or`          | ✅ |  |
+| Oromo                 | `om`          | ✅ |  |
+| Pashto                | `ps`          | ✅ |  |
+| Persian               | `fa`          | ✅ |  |
+| Polish                | `pl`          | ✅ |  |
+| Portuguese (Brazil)   | `pt`          | ✅ |  |
+| Punjabi               | `pa`          | ✅ |  |
+| Romanian              | `ro`          | ✅ |  |
+| Russian               | `ru`          | ✅ |  |
+| Serbian               | `sr`          | ✅ |  |
+| Shona                 | `sn`          | ✅ |  |
+| Sindhi                | `sd`          | ✅ |  |
+| Slovak                | `sk`          | ✅ |  |
+| Slovenian             | `sl`          | ✅ |  |
+| Somali                | `so`          | ✅ |  |
+| Sorani Kurdish        | `ku`          | ✅ |  |
+| Spanish (Latin American) | `es`       | ✅ |  |
+| Swahili               | `sw`          | ✅ |  |
+| Swedish               | `sv`          | ✅ |  |
+| Tajik                 | `tg`          | ✅ |  |
+| Tamil                 | `ta`          | ✅ |  |
+| Telugu                | `te`          | ✅ |  |
+| Thai                  | `th`          | ✅ |  |
+| Turkish               | `tr`          | ✅ |  |
+| Ukrainian             | `uk`          | ✅ |  |
+| Umbundu               | `umb`         | ✅ |  |
+| Urdu                  | `ur`          | ✅ |  |
+| Uzbek                 | `uz`          | ✅ |  |
+| Vietnamese            | `vi`          | ✅ |  |
+| Welsh                 | `cy`          | ✅ |  |
+| Wolof                 | `wo`          | ✅ |  |
+| Xhosa                 | `xh`          | ✅ |  |
+| Yoruba                | `yo`          | ✅ |  |
+| Zulu                  | `zu`          | ✅ |  |
Original file line number	Diff line number	Diff line change
`@@ -338,4 +338,4 @@ Connection: close`
`338`	`338`
`339`	`339`	`## Next steps`
`340`	`340`
`341`		`-[Multi-modal embeddings concepts](../concept-image-retrieval.md)`
	`341`	`+[Multimodal embeddings concepts](../concept-image-retrieval.md)`