Skip to content

Commit 5e61387

Browse files
committed
review
1 parent 7e5f61e commit 5e61387

File tree

1 file changed

+11
-8
lines changed

1 file changed

+11
-8
lines changed

articles/ai-foundry/how-to/use-image-models.md

Lines changed: 11 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,13 @@
11
---
2-
title: How to use image models in the model catalog
2+
title: How to use image-to-text models in the model catalog
33
titleSuffix: Azure AI Foundry
44
description: Learn how to use image-to-text models from the AI Foundry model catalog.
55
manager: scottpolly
66
author: msakande
77
reviewer: frogglew
88
ms.service: azure-ai-model-inference
99
ms.topic: how-to
10-
ms.date: 04/30/2025
10+
ms.date: 05/02/2025
1111
ms.author: mopeakande
1212
ms.reviewer: frogglew
1313
ms.custom: references_regions, tool_generated
@@ -17,9 +17,9 @@ ms.custom: references_regions, tool_generated
1717

1818
This article explains how to use _image-to-text_ models in the AI Foundry model catalog.
1919

20-
Image-to-text models are designed to analyze images and generate descriptive text based on what they see. Think of them as a combination of a camera and a writer. You will provide an image as an input to the model, and the model will look at the image and identifies different elements within it, like objects, people, scenes, and even text. Based on its analysis, the model then generates a written description of the image, summarizing what it sees.
20+
Image-to-text models are designed to analyze images and generate descriptive text based on what they see. Think of them as a combination of a camera and a writer. You provide an image as an input to the model, and the model looks at the image and identifies different elements within it, like objects, people, scenes, and even text. Based on its analysis, the model then generates a written description of the image, summarizing what it sees.
2121

22-
Image-to-text models excel at various use cases such as accessibility features, content organization (tagging), creating product and educational visual description, and digitizing content (via Optical Character Recognition). One might say image-to-text models bridge the gap between visual content and written language, making information more accessible and easier to process in various contexts.
22+
Image-to-text models excel at various use cases such as accessibility features, content organization (tagging), creating product and educational visual descriptions, and digitizing content via Optical Character Recognition (OCR). One might say image-to-text models bridge the gap between visual content and written language, making information more accessible and easier to process in various contexts.
2323

2424
## Prerequisites
2525

@@ -74,9 +74,10 @@ To use image models in your application, you need:
7474
}'
7575
```
7676
77-
**More code samples for Mistral OCR 25.03**
77+
## More code samples for Mistral OCR 25.03
78+
79+
To process PDF files:
7880
79-
**Processing PDF files**
8081
```bash
8182
# Read the pdf file
8283
input_file_path="assets/2201.04234v3.pdf"
@@ -102,8 +103,10 @@ echo "$payload_body" | curl ${AZURE_AI_CHAT_ENDPOINT}/v1/ocr \
102103
-H "Authorization: Bearer ${AZURE_AI_CHAT_KEY}" \
103104
-d @- -o ocr_pdf_output.json
104105
```
105-
**Processing an image file**
106-
```
106+
107+
To process an image file:
108+
109+
```bash
107110
# Read the image file
108111
input_file_path="assets/receipt.png"
109112
base64_value=$(base64 "$input_file_path")

0 commit comments

Comments
 (0)