Skip to content

Commit cc3b7bc

Browse files
committed
update prices, columns and add new models
1 parent 4b86f72 commit cc3b7bc

File tree

1 file changed

+29
-28
lines changed
  • pages/public_cloud/ai_machine_learning/endpoints_guide_04_billing_concept

1 file changed

+29
-28
lines changed

pages/public_cloud/ai_machine_learning/endpoints_guide_04_billing_concept/guide.en-gb.md

Lines changed: 29 additions & 28 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: AI Endpoints - Billing and lifecycle
33
excerpt: Learn how we bill AI Endpoints
4-
updated: 2025-04-28
4+
updated: 2025-07-29
55
---
66

77
> [!primary]
@@ -38,33 +38,34 @@ Here is the model billing overview for AI Endpoints.
3838
> In appreciation of their continued support, our **Beta testers will have the possibility to keep using their existing API access keys and create new ones and won't be billed until 31th May**. After this date, the pricing will be implemented for them and clearly outlined in the table below, which details the categories, models, and their respective pricing information:
3939
>
4040
41-
| Category | Model | Price ($) | Price (€) | Unit Price |
42-
| -------------- | --------------- | ------ | ------ | ---------- |
43-
| Large Language Model (LLM) | Llama 3.3 70B Instruct | 0.70 | 0.67 | per 1M tokens |
44-
| Large Language Model (LLM) | Llama 3.1 70B Instruct | 0.70 | 0.67 | per 1M tokens |
45-
| Large Language Model (LLM) | Mixtral 8x7B Instruct v0.1 | 0.65 | 0.63 | per 1M tokens |
46-
| Large Language Model (LLM) | Mistral-Nemo-Instruct-2407 | 0.14 | 0.13 | per 1M tokens |
47-
| Large Language Model (LLM) | Llama 3.1 8B Instruct | 0.10 | 0.10 | per 1M tokens |
48-
| Large Language Model (LLM) | Mistral 7B Instruct v0.3 | 0.10 | 0.10 | per 1M tokens |
49-
| Reasoning LLM | DeepSeek R1 | Free | Free | per 1M tokens |
50-
| Reasoning LLM | DeepSeek R1 Distill Llama 70B | 0.70 | 0.67 | per 1M tokens |
51-
| Code LLM | Qwen2.5 Coder 32B Instruct | 0.90 | 0.87 | per 1M tokens |
52-
| Code LLM | Mamba Codestral 7B v0.1 | 0.20 | 0.19 | per 1M tokens |
53-
| Visual LLM | Qwen2.5 VL 72B Instruct | 0.95 | 0.91 | per 1M tokens |
54-
| Visual LLM | Llava Next Mistral 7B | 0.30 | 0.29 | per 1M tokens |
55-
| Embeddings | BGE Multilingual Gemma2 | 0.01 | 0.01 | per 1M tokens |
56-
| Embeddings | BGE-M3 | 0.01 | 0.01 | per 1M tokens |
57-
| Embeddings | BGE Base EN v1.5 | 0.01 | 0.005 | per 1M tokens |
58-
| Natural Language Processing (NLP) | Roberta Base Go Emotions | Free | Free | per 1M characters |
59-
| Natural Language Processing (NLP) | Bert Base Multilingual uncased sentiment | Free | Free | per 1M characters |
60-
| Natural Language Processing (NLP) | Bert Base NER | Free | Free | per 1M characters |
61-
| Natural Language Processing (NLP) | Bart Large CNN | Free | Free | per 1M characters |
62-
| Image generation| Stable Diffusion XL | Free | Free | per image |
63-
| Speech to Text | RIVA Automatic Speech Recognition | Free | Free | per hour |
64-
| Text to Speech | RIVA Text-to-Speech | Free | Free | per hour |
65-
| Translation | T5-Large | Free | Free | per 1M characters |
66-
| Computer vision | YOLOv11 Object Detection | Free | Free | per image |
67-
| Computer vision | YOLOv11 Image Segmentation | Free | Free | per image |
41+
| Category | Model | Input Price (\$) | Output Price (\$) | Input Price (€) | Output Price (€) | Unit Price |
42+
| -------------------------- | -------------------------- | ---------------- | ----------------- | --------------- | ---------------- | --------------------------------- |
43+
| Large Language Model (LLM) | Llama 3.3 70B Instruct | 0.74 | 0.74 | 0.67 | 0.67 | per 1 million tokens |
44+
| Large Language Model (LLM) | Llama 3.1 70B Instruct | 0.74 | 0.74 | 0.67 | 0.67 | per 1 million tokens |
45+
| Large Language Model (LLM) | Mixtral 8x7B Instruct v0.1 | 0.70 | 0.70 | 0.63 | 0.63 | per 1 million tokens |
46+
| Large Language Model (LLM) | Mistral Nemo Instruct 2407 | 0.14 | 0.14 | 0.13 | 0.13 | per 1 million tokens |
47+
| Large Language Model (LLM) | Llama 3.1 8B Instruct | 0.11 | 0.11 | 0.10 | 0.10 | per 1 million tokens |
48+
| Large Language Model (LLM) | Mistral 7B Instruct v0.3 | 0.11 | 0.11 | 0.10 | 0.10 | per 1 million tokens |
49+
| Reasoning LLM | Qwen 3 32B | 0.09 | 0.25 | 0.08 | 0.23 | per 1 million tokens |
50+
| Reasoning LLM | DeepSeek R1 Distill Llama 70B | 0.74 | 0.74 | 0.67 | 0.67 | per 1 million tokens |
51+
| Code LLM | Qwen 2.5 Coder 32B Instruct | 0.96 | 0.96 | 0.87 | 0.87 | per 1 million tokens |
52+
| Code LLM | Mamba Codestral 7B v0.1 | 0.21 | 0.21 | 0.19 | 0.19 | per 1 million tokens |
53+
| Visual LLM | Mistral Small 3.2 24B Instruct 2506 | 0.10 | 0.31 | 0.09 | 0.28 | per 1 million tokens |
54+
| Visual LLM | Qwen 2.5 VL 72B Instruct | 1.01 | 1.01 | 0.91 | 0.91 | per 1 million tokens |
55+
| Visual LLM | Llava Next Mistral 7B | 0.32 | 0.32 | 0.29 | 0.29 | per 1 million tokens |
56+
| Embeddings | BGE Multilingual Gemma2 | 0.01 | 0.01 | Free | Free | per 1 million tokens |
57+
| Embeddings | BGE M3 | 0.01 | 0.01 | Free | Free | per 1 million tokens |
58+
| Embeddings | BGE Base EN v1.5 | 0.01 | 0.01 | Free | Free | per 1 million tokens |
59+
| Natural Language Processing (NLP) | Roberta Base Go Emotions | Free | Free | Free | Free | per 1 million characters |
60+
| Natural Language Processing (NLP) | Bert Base Multilingual uncased sentiment | Free | Free | Free | Free | per 1 million characters |
61+
| Natural Language Processing (NLP) | Bert Base NER | Free | Free | Free | Free | per 1 million characters |
62+
| Natural Language Processing (NLP) | Bart Large CNN | Free | Free | Free | Free | per 1 million characters |
63+
| Image generation | Stable Diffusion XL | Free | Free | Free | Free | per image |
64+
| Audio Analysis | RIVA Automatic Speech Recognition | Free | Free | Free | Free | per hour |
65+
| Audio Analysis | RIVA Text-to-Speech | Free | Free | Free | Free | per hour |
66+
| Translation | T5-Large | Free | Free | Free | Free | per 1 million characters |
67+
| Computer vision | YOLOv11 Object Detection | Free | Free | Free | Free | per image |
68+
| Computer vision | YOLOv11 Image Segmentation | Free | Free | Free | Free | per image |
6869

6970
## Feedback
7071

0 commit comments

Comments
 (0)