You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/how-to/use-image-models.md
+40-9Lines changed: 40 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
---
2
2
title: How to use image models in the model catalog
3
3
titleSuffix: Azure AI Foundry
4
-
description: Learn how to use image models from the AI Foundry model catalog.
4
+
description: Learn how to use image-to-text models from the AI Foundry model catalog.
5
5
manager: scottpolly
6
6
author: msakande
7
7
reviewer: frogglew
@@ -13,9 +13,13 @@ ms.reviewer: frogglew
13
13
ms.custom: references_regions, tool_generated
14
14
---
15
15
16
-
# How to use image models in the model catalog
16
+
# How to use image-to-text models in the model catalog
17
17
18
-
This article explains how to use _image_ models in the AI Foundry model catalog. Some models have unique parameters or data format requirements.
18
+
This article explains how to use _image-to-text_ models in the AI Foundry model catalog.
19
+
20
+
Image-to-text models are designed to analyze images and generate descriptive text based on what they see. Think of them as a combination of a camera and a writer. You will provide an image as an input to the model, and the model will look at the image and identifies different elements within it, like objects, people, scenes, and even text. Based on its analysis, the model then generates a written description of the image, summarizing what it sees.
21
+
22
+
Image-to-text models excel at various use cases such as accessibility features, content organization (tagging), creating product and educational visual description, and digitizing content (via Optical Character Recognition). One might say image-to-text models bridge the gap between visual content and written language, making information more accessible and easier to process in various contexts.
19
23
20
24
## Prerequisites
21
25
@@ -31,7 +35,7 @@ To use image models in your application, you need:
31
35
32
36
- The endpoint URL and key.
33
37
34
-
## Use image model
38
+
## Use image-to-text model
35
39
36
40
1. Authenticate using an API key. First, deploy the model to generate the endpoint URL and an API key to authenticate against the service. In this example, the endpoint and key are strings holding the endpoint URL and the API key. The API endpoint URL and API key can be found on the **Deployments + Endpoint** page once the model is deployed.
37
41
@@ -65,14 +69,16 @@ To use image models in your application, you need:
Some image models only support specific data formats. Mistral OCR 25.03, for example, requires `base64 encoded image data` for their `document_url` parameter. The following table lists the supported and unsupported data formats for image models in the model catalog.
135
+
Some image-to-text models only support specific data formats. Mistral OCR 25.03, for example, requires `base64 encoded image data` for their `document_url` parameter. The following table lists the supported and unsupported data formats for image models in the model catalog.
0 commit comments