MicrosoftDocs
diff --git a/‎articles/ai-foundry/concepts/models-featured.md
Lines changed: 1 addition & 1 deletion b/‎articles/ai-foundry/concepts/models-featured.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-foundry/how-to/use-image-models.md
Lines changed: 149 additions & 0 deletions b/‎articles/ai-foundry/how-to/use-image-models.md
Lines changed: 149 additions & 0 deletions
diff --git a/‎articles/ai-foundry/model-inference/includes/use-embeddings/python.md
Lines changed: 1 addition & 1 deletion b/‎articles/ai-foundry/model-inference/includes/use-embeddings/python.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-foundry/toc.yml
Lines changed: 2 additions & 0 deletions b/‎articles/ai-foundry/toc.yml
Lines changed: 2 additions & 0 deletions
diff --git a/‎articles/ai-foundry/whats-new-azure-ai-foundry.md
Lines changed: 18 additions & 15 deletions b/‎articles/ai-foundry/whats-new-azure-ai-foundry.md
Lines changed: 18 additions & 15 deletions
diff --git a/‎articles/ai-services/computer-vision/index.yml
Lines changed: 21 additions & 21 deletions b/‎articles/ai-services/computer-vision/index.yml
Lines changed: 21 additions & 21 deletions
diff --git a/‎articles/ai-services/computer-vision/overview-ocr.md
Lines changed: 7 additions & 3 deletions b/‎articles/ai-services/computer-vision/overview-ocr.md
Lines changed: 7 additions & 3 deletions
@@ -262,7 +262,7 @@ Mistral AI offers two categories of models, namely:
 | [Mistral-Large-2411](https://ai.azure.com/explore/models/Mistral-Large-2411/version/2/registry/azureml-mistral) | [chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context) |  - **Input:** text (128,000 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
 | [Mistral-large-2407](https://ai.azure.com/explore/models/Mistral-large-2407/version/1/registry/azureml-mistral) <br /> (deprecated) | [chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context) |  - **Input:** text (131,072 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON  |
 | [Mistral-large](https://ai.azure.com/explore/models/Mistral-large/version/1/registry/azureml-mistral) <br /> (deprecated) | [chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context) |  - **Input:** text (32,768 tokens) <br /> - **Output:** text (4,096 tokens) <br />  - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
-| [Mistral-OCR-2503](https://aka.ms/aistudio/landing/mistral-ocr-2503) | image to text |  - **Input:** image or PDF pages (1,000 pages, max 50MB PDF file) <br> - **Output:** text <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON, Markdown |
+| [Mistral-OCR-2503](https://aka.ms/aistudio/landing/mistral-ocr-2503) | [image to text](../how-to/use-image-models.md) |  - **Input:** image or PDF pages (1,000 pages, max 50MB PDF file) <br> - **Output:** text <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON, Markdown |
 | [Mistral-small-2503](https://aka.ms/aistudio/landing/mistral-small-2503) | [chat-completion (with images)](../model-inference/how-to/use-chat-multi-modal.md?context=/azure/ai-foundry/context/context) |  - **Input:** text and images (131,072 tokens), <br> image-based tokens are 16px x 16px <br> blocks of the original images <br /> - **Output:** text (4,096 tokens) <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
 | [Mistral-small](https://ai.azure.com/explore/models/Mistral-small/version/1/registry/azureml-mistral) | [chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context) |  - **Input:** text (32,768 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
 
 
@@ -0,0 +1,149 @@
+---
+title: How to use image-to-text models in the model catalog
+titleSuffix: Azure AI Foundry
+description: Learn how to use image-to-text models from the AI Foundry model catalog.
+manager: scottpolly
+author: msakande
+reviewer: frogglew
+ms.service: azure-ai-model-inference
+ms.topic: how-to
+ms.date: 05/02/2025
+ms.author: mopeakande
+ms.reviewer: frogglew
+ms.custom: references_regions, tool_generated
+---
+
+# How to use image-to-text models in the model catalog
+
+This article explains how to use _image-to-text_ models in the AI Foundry model catalog. 
+
+Image-to-text models are designed to analyze images and generate descriptive text based on what they see. Think of them as a combination of a camera and a writer. You provide an image as an input to the model, and the model looks at the image and identifies different elements within it, like objects, people, scenes, and even text. Based on its analysis, the model then generates a written description of the image, summarizing what it sees.
+
+Image-to-text models excel at various use cases such as accessibility features, content organization (tagging), creating product and educational visual descriptions, and digitizing content via Optical Character Recognition (OCR). One might say image-to-text models bridge the gap between visual content and written language, making information more accessible and easier to process in various contexts.
+
+## Prerequisites
+
+To use image models in your application, you need:
+ 
+- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
+
+- An [Azure AI Foundry project](create-projects.md).
+
+- An image model deployment on Azure AI Foundry. 
+
+  - This article uses a __Mistral OCR__ model deployment.
+
+- The endpoint URL and key.
+
+## Use image-to-text model
+
+1. Authenticate using an API key. First, deploy the model to generate the endpoint URL and an API key to authenticate against the service. In this example, the endpoint and key are strings holding the endpoint URL and the API key. The API endpoint URL and API key can be found on the **Deployments + Endpoint** page once the model is deployed.
+
+    If you're using Bash:
+  
+    ```bash    
+    export AZURE_API_KEY = "<your-api-key>"
+    ```
+
+    If you're in PowerShell:
+  
+    ```powershell
+    $Env:AZURE_API_KEY = "<your-api-key>"
+    ```
+  
+    If you're using Windows command prompt:
+    
+    ```
+    export AZURE_API_KEY = "<your-api-key>"
+    ```
+
+1. Run a basic code sample. Different image models accept different data formats. In this example, _Mistral OCR 25.03_ supports only base64 encoded data; document url or image url isn't supported. Paste the following code into a shell.
+  
+    ```http
+    curl --request POST \
+      --url https://<your_serverless_endpoint>/v1/ocr \
+      --header 'Authorization: <api_key>' \
+      --header 'Content-Type: Application/json' \
+      --data '{
+      "model": "mistral-ocr-2503",
+      "document": {
+        "type": "document_url",
+        "document_name": "test",
+        "document_url": "data:application/pdf;base64,JVBER... <replace with your base64 encoded image data>"
+      }
+    }'
+    ```
+
+## More code samples for Mistral OCR 25.03
+
+To process PDF files:
+
+```bash
+# Read the pdf file
+input_file_path="assets/2201.04234v3.pdf"
+base64_value=$(base64 "$input_file_path")
+input_base64_value="data:application/pdf;base64,${base64_value}"
+# echo $input_base64_value
+ 
+# Prepare JSON data
+payload_body=$(cat <<EOF
+{
+    "model": "mistral-ocr-2503",
+    "document": {
+        "type": "document_url",
+        "document_url": "$input_base64_value"
+    },
+    "include_image_base64": true
+}
+EOF
+)
+
+echo "$payload_body" | curl ${AZURE_AI_CHAT_ENDPOINT}/v1/ocr \
+  -H "Content-Type: application/json" \
+  -H "Authorization: Bearer ${AZURE_AI_CHAT_KEY}" \
+  -d @- -o ocr_pdf_output.json
+```
+
+To process an image file:
+
+```bash
+# Read the image file
+input_file_path="assets/receipt.png"
+base64_value=$(base64 "$input_file_path")
+input_base64_value="data:application/png;base64,${base64_value}"
+# echo $input_base64_value
+ 
+# Prepare JSON data
+payload_body=$(cat <<EOF
+{
+    "model": "mistral-ocr-2503",
+    "document": {
+        "type": "image_url",
+        "image_url": "$input_base64_value"
+    },
+    "include_image_base64": true
+}
+EOF
+)
+ 
+# Process the base64 data with ocr endpoint
+echo "$payload_body" | curl ${AZURE_AI_CHAT_ENDPOINT}/v1/ocr \
+  -H "Content-Type: application/json" \
+  -H "Authorization: Bearer ${AZURE_AI_CHAT_KEY}" \
+  -d @- -o ocr_png_output.json
+```
+
+## Model-specific parameters
+
+Some image-to-text models only support specific data formats. Mistral OCR 25.03, for example, requires `base64 encoded image data` for their `document_url` parameter. The following table lists the supported and unsupported data formats for image models in the model catalog.
+
+| Model | Supported | Not supported |
+| :---- | ----- | ----- |
+| Mistral OCR 25.03 | base64 encoded image data  | document url, image url |
+
+
+
+## Related content
+
+- [How to use image generation models on Azure OpenAI](../../ai-services/openai/how-to/dall-e.md)
+
@@ -52,7 +52,7 @@ If you have configured the resource to with **Microsoft Entra ID** support, you
 ```python
 import os
 from azure.ai.inference import EmbeddingsClient
-from azure.core.credentials import AzureKeyCredential
+from azure.identity import DefaultAzureCredential
 
 model = EmbeddingsClient(
     endpoint=os.environ["AZURE_INFERENCE_ENDPOINT"],
 
@@ -130,6 +130,8 @@ items:
           href: ../ai-foundry/model-inference/how-to/use-chat-reasoning.md?context=/azure/ai-foundry/context/context
         - name: Work with multimodal models
           href: ../ai-foundry/model-inference/how-to/use-chat-multi-modal.md?context=/azure/ai-foundry/context/context
+    - name: Work with image models
+      href: how-to/use-image-models.md
 - name: Azure OpenAI and AI services
   items:
   - name: Use Azure OpenAI Service in Azure AI Foundry portal
 
@@ -1,36 +1,39 @@
 ---
-title: "Azure AI Foundry docs: What's new for March 2025"
-description: "What's new in the Azure AI Foundry docs for March 2025."
+title: "Azure AI Foundry docs: What's new for April 2025"
+description: "What's new in the Azure AI Foundry docs for April 2025."
 ms.author: smcdowell
 author: skpmcdowell
 ms.topic: whats-new
 ms.subject: ai-studio
-ms.custom: March-2025
-ms.date: 04/02/2025
+ms.custom: April-2025
+ms.date: 05/03/2025
 ---
 
-# Azure AI Foundry docs: What's new for March 2025
+# Azure AI Foundry docs: What's new for April 2025
 
-Welcome to what's new in the Azure AI Foundry docs for March 2025. This article lists some of the major changes to docs during this period.
+Welcome to what's new in the Azure AI Foundry docs for April 2025. This article lists some of the major changes to docs during this period.
 
 
 ## Azure AI Foundry
 
 ### New articles
 
-- [Featured models of Azure AI Foundry](../ai-foundry/concepts/models-featured.md)
-- [How to deploy NVIDIA Inference Microservices](../ai-foundry/how-to/deploy-nvidia-inference-microservice.md)
-- [How to use image and audio in chat completions with Azure AI model inference](../ai-foundry/model-inference/how-to/use-chat-multi-modal.md)
-- [Tutorial: Get started with DeepSeek-R1 reasoning model in Azure AI model inference](../ai-foundry/model-inference/tutorials/get-started-deepseek-r1.md)
+- [AI Red Teaming Agent (preview)](../ai-foundry/concepts/ai-red-teaming-agent.md)
+- [Evaluate your AI agents locally with Azure AI Evaluation SDK (preview)](../ai-foundry/how-to/develop/agent-evaluate-sdk.md)
+- [How to use structured outputs for chat models](../ai-foundry/model-inference/how-to/use-structured-outputs.md)
+- [Run automated safety scans with AI Red Teaming Agent (preview)](../ai-foundry/how-to/develop/run-scans-ai-red-teaming-agent.md)
+- [Work with Azure AI Agent Service in Visual Studio Code (Preview)](../ai-foundry/how-to/develop/vs-code-agents.md)
+- [Work with the Azure AI Foundry for Visual Studio Code extension (Preview)](../ai-foundry/how-to/develop/get-started-projects-vs-code.md)
 
 
 ### Updated articles
 
-- [Deploy a flow for real-time inference](../ai-foundry/how-to/flow-deploy.md)
-- [Fine-tune models using serverless APIs in Azure AI Foundry](../ai-foundry/how-to/fine-tune-serverless.md)
-- [How to deploy and inference a managed compute deployment with code](../ai-foundry/how-to/deploy-models-managed.md) 
-- [How to trace your application with Azure AI Foundry project library](../ai-foundry/how-to/develop/trace-local-sdk.md) 
-- [Monitor quality and token usage of deployed prompt flow applications](../ai-foundry/how-to/monitor-quality-safety.md)
+- [Evaluate your AI agents locally with Azure AI Evaluation SDK (preview)](../ai-foundry/how-to/develop/agent-evaluate-sdk.md)  
+- [Evaluate your Generative AI application locally with the Azure AI Evaluation SDK](../ai-foundry/how-to/develop/evaluate-sdk.md) 
+- [Evaluation and monitoring metrics for generative AI](../ai-foundry/concepts/evaluation-metrics-built-in.md)  
+- [Fine-tune models using serverless APIs in Azure AI Foundry](../ai-foundry/how-to/fine-tune-serverless.md) 
+- [How to configure a private link for Azure AI Foundry hubs](../ai-foundry/how-to/configure-private-link.md) 
+- [How to use MedImageParse healthcare AI models for segmentation of medical images](../ai-foundry/how-to/healthcare-ai/deploy-medimageparse.md)
 
 
 
@@ -32,27 +32,6 @@ highlightedContent:
 
 conceptualContent:
   items:
-    - title: Optical character recognition
-      links:
-        - itemType: overview
-          text: About OCR
-          url: overview-ocr.md
-        - itemType: quickstart
-          text: Get started with OCR
-          url: quickstarts-sdk/client-library.md
-        - itemType: how-to-guide
-          text: Call the Read API
-          url: how-to/call-read-api.md
-        - itemType: deploy
-          text: Use the Read OCR container
-          url: computer-vision-how-to-install-containers.md
-        - itemType: learn
-          text: Microsoft Learn training
-          url: /training/modules/read-text-images-documents-with-computer-vision-service/
-        - itemType: reference
-          text: OCR API reference
-          url: /rest/api/computervision/recognize-printed-text?view=rest-computervision-v3.2
-
     - title: Image Analysis
       links:
         - itemType: overview
@@ -119,6 +98,27 @@ conceptualContent:
         - itemType: how-to-guide
           text: Call the Video Retrieval APIs
           url: how-to/video-retrieval.md
+    - title: Optical character recognition (legacy)
+      links:
+        - itemType: overview
+          text: About OCR
+          url: overview-ocr.md
+        - itemType: quickstart
+          text: Get started with OCR
+          url: quickstarts-sdk/client-library.md
+        - itemType: how-to-guide
+          text: Call the Read API
+          url: how-to/call-read-api.md
+        - itemType: deploy
+          text: Use the Read OCR container
+          url: computer-vision-how-to-install-containers.md
+        - itemType: learn
+          text: Microsoft Learn training
+          url: /training/modules/read-text-images-documents-with-computer-vision-service/
+        - itemType: reference
+          text: OCR API reference
+          url: /rest/api/computervision/recognize-printed-text?view=rest-computervision-v3.2
+
 
 tools:
   title: Software development kits (SDKs)
 
@@ -16,6 +16,12 @@ ms.custom: devx-track-csharp
 
 # OCR - Optical Character Recognition
 
+> [!WARNING]
+> This service, including the Azure AI Vision legacy [OCR API in v3.2](/rest/api/computervision/recognize-printed-text?view=rest-computervision-v3.2) and [RecognizeText API in v2.1](/rest/api/computervision/recognize-printed-text/recognize-printed-text?view=rest-computervision-v2.1), is not recommended for use.
+
+[!INCLUDE [read-editions](includes/read-editions.md)]
+
+
 OCR or Optical Character Recognition is also referred to as text recognition or text extraction. Machine-learning-based OCR techniques allow you to extract printed or handwritten text from images such as posters, street signs and product labels, as well as from documents like articles, reports, forms, and invoices. The text is typically extracted as words, text lines, and paragraphs or text blocks, enabling access to digital version of the scanned text. This eliminates or significantly reduces the need for manual data entry.
 
 
@@ -24,10 +30,8 @@ OCR or Optical Character Recognition is also referred to as text recognition or
 
 Microsoft's **Read** OCR engine is composed of multiple advanced machine-learning based models supporting [global languages](./language-support.md). It can extract printed and handwritten text including mixed languages and writing styles. **Read** is available as cloud service and on-premises container for deployment flexibility. It's also available as a synchronous API for single, non-document, image-only scenarios with performance enhancements that make it easier to implement OCR-assisted user experiences.
 
-> [!WARNING]
-> The Azure AI Vision legacy [OCR API in v3.2](/rest/api/computervision/recognize-printed-text?view=rest-computervision-v3.2) and [RecognizeText API in v2.1](/rest/api/computervision/recognize-printed-text/recognize-printed-text?view=rest-computervision-v2.1) operations are not recommended for use.
 
-[!INCLUDE [read-editions](includes/read-editions.md)]
+
 
 ## How is OCR related to Intelligent Document Processing (IDP)?