You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/gpt-with-vision.md
+20-1Lines changed: 20 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -62,13 +62,32 @@ Base Pricing for GPT-4 Turbo with Vision is:
62
62
63
63
See the [Tokens section of the overview](/azure/ai-services/openai/overview#tokens) for information on how text and images translate to tokens.
64
64
65
+
If you turn on Enhancements to employ Azure Computer Vision foundational models to enhance the capabilities with GPT-4 Turbo with Vision, this does incur additional costs.
66
+
- Any image with text will incur usage for the **Enhanced add on features for Optical Character Recognition**: $1.50 per 1000 transactions
67
+
- Any image with objects detected will incur usage for the **Enhanced add-on features for Object Grounding**: $1.50 per 1000 transactions
68
+
65
69
Additionally, if you use video prompt integration with the Video Retrieval add-on, it accrues other costs:
66
70
- Ingestion: $0.05 per minute of video
67
71
- Transactions: $0.25 per 1000 queries of the Video Retrieval index
68
72
69
73
Processing videos involves the use of extra tokens to identify key frames for analysis. The number of these additional tokens will be roughly equivalent to the sum of the tokens in the text input, plus 700 tokens.
70
74
71
-
### Example price calculation
75
+
### Example image price calculation
76
+
> [!IMPORTANT]
77
+
> The following content is an example only, and prices are subject to change in the future.
78
+
79
+
For a typical use case, take an image with both visible objects and text and a 100-token prompt input. When the service processes the prompt, it generates 100 tokens of output. In the image, both text and objects can be detected. The price of this transaction would be:
0 commit comments