You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/gpt4v.md
+6-11Lines changed: 6 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,38 +1,33 @@
1
-
# Using GPT-4 Turbo with Vision
1
+
# Using GPT vision model with RAG approach
2
2
3
-
This repository now includes an example of integrating GPT-4 Turbo with Vision with Azure AI Search. This feature enables indexing and searching images and graphs, such as financial documents, in addition to text-based content.
3
+
This repository now includes an example of integrating a GPTVision model with Azure AI Search. This feature enables indexing and searching images and graphs, such as financial documents, in addition to text-based content, and then sending the retrieved content to the GPT model for response generation.
4
4
5
5
## Feature Overview
6
6
7
7
-**Document Handling:** Source documents are split into pages and saved as PNG files in blob storage. Each file's name and page number are embedded for reference.
8
8
-**Data Extraction:** Text data is extracted using OCR.
9
9
-**Data Indexing:** Text and image embeddings, generated using Azure AI Vision ([Azure AI Vision Embeddings](https://learn.microsoft.com/azure/ai-services/computer-vision/how-to/image-retrieval)), are indexed in Azure AI Search along with the raw text.
10
-
-**Search and Response:** Searches can be conducted using vectors or hybrid methods. Responses are generated by GPT-4 Turbo with Vision based on the retrieved content.
10
+
-**Search and Response:** Searches can be conducted using vectors or hybrid methods. Responses are generated by GPT vision model based on the retrieved content.
11
11
12
12
## Getting Started
13
13
14
14
### Prerequisites
15
15
16
16
- Create a [Computer Vision account in Azure Portal first](https://ms.portal.azure.com/#create/Microsoft.CognitiveServicesComputerVision), so that you can agree to the Responsible AI terms for that resource. You can delete that account after agreeing.
17
-
- The ability to deploy a GPT-4 Turbo with Vision model in the [supported regions](https://learn.microsoft.com/azure/ai-services/openai/concepts/models#gpt-4-and-gpt-4-turbo-preview-model-availability). If you're not sure, try to create a deployment from your Azure OpenAI deployments page. You should be able to select:
18
-
19
-
| Model | Version |
20
-
|--|--|
21
-
|`gpt-4`|`vision-preview`|
22
-
17
+
- The ability to deploy a gpt-4o model in the [supported regions](https://learn.microsoft.com/azure/ai-services/openai/concepts/models#standard-deployment-model-availability). If you're not sure, try to create a gpt-4o deployment from your Azure OpenAI deployments page.
23
18
- Ensure that you can deploy the Azure OpenAI resource group in [a region where all required components are available](https://learn.microsoft.com/azure/cognitive-services/openai/concepts/models#model-summary-table-and-region-availability):
24
19
- Azure OpenAI models
25
20
- gpt-35-turbo
26
21
- text-embedding-ada-002
27
-
- gpt-4v
22
+
- gpt-4o
28
23
-[Azure AI Vision](https://learn.microsoft.com/azure/ai-services/computer-vision/)
29
24
30
25
### Setup and Usage
31
26
32
27
1.**Update repository:**
33
28
Pull the latest changes.
34
29
35
-
2.**Enable GPT-4 Turbo with Vision:**
30
+
2.**Enable GPT vision approach:**
36
31
37
32
First, make sure you do *not* have integrated vectorization enabled, since that is currently incompatible:
0 commit comments