|
1 | 1 | ---
|
2 | 2 | title: AI Endpoints - Billing and lifecycle
|
3 | 3 | excerpt: Learn how we bill AI Endpoints
|
4 |
| -updated: 2025-04-28 |
| 4 | +updated: 2025-07-29 |
5 | 5 | ---
|
6 | 6 |
|
7 | 7 | > [!primary]
|
@@ -38,33 +38,34 @@ Here is the model billing overview for AI Endpoints.
|
38 | 38 | > In appreciation of their continued support, our **Beta testers will have the possibility to keep using their existing API access keys and create new ones and won't be billed until 31th May**. After this date, the pricing will be implemented for them and clearly outlined in the table below, which details the categories, models, and their respective pricing information:
|
39 | 39 | >
|
40 | 40 |
|
41 |
| -| Category | Model | Price ($) | Price (€) | Unit Price | |
42 |
| -| -------------- | --------------- | ------ | ------ | ---------- | |
43 |
| -| Large Language Model (LLM) | Llama 3.3 70B Instruct | 0.70 | 0.67 | per 1M tokens | |
44 |
| -| Large Language Model (LLM) | Llama 3.1 70B Instruct | 0.70 | 0.67 | per 1M tokens | |
45 |
| -| Large Language Model (LLM) | Mixtral 8x7B Instruct v0.1 | 0.65 | 0.63 | per 1M tokens | |
46 |
| -| Large Language Model (LLM) | Mistral-Nemo-Instruct-2407 | 0.14 | 0.13 | per 1M tokens | |
47 |
| -| Large Language Model (LLM) | Llama 3.1 8B Instruct | 0.10 | 0.10 | per 1M tokens | |
48 |
| -| Large Language Model (LLM) | Mistral 7B Instruct v0.3 | 0.10 | 0.10 | per 1M tokens | |
49 |
| -| Reasoning LLM | DeepSeek R1 | Free | Free | per 1M tokens | |
50 |
| -| Reasoning LLM | DeepSeek R1 Distill Llama 70B | 0.70 | 0.67 | per 1M tokens | |
51 |
| -| Code LLM | Qwen2.5 Coder 32B Instruct | 0.90 | 0.87 | per 1M tokens | |
52 |
| -| Code LLM | Mamba Codestral 7B v0.1 | 0.20 | 0.19 | per 1M tokens | |
53 |
| -| Visual LLM | Qwen2.5 VL 72B Instruct | 0.95 | 0.91 | per 1M tokens | |
54 |
| -| Visual LLM | Llava Next Mistral 7B | 0.30 | 0.29 | per 1M tokens | |
55 |
| -| Embeddings | BGE Multilingual Gemma2 | 0.01 | 0.01 | per 1M tokens | |
56 |
| -| Embeddings | BGE-M3 | 0.01 | 0.01 | per 1M tokens | |
57 |
| -| Embeddings | BGE Base EN v1.5 | 0.01 | 0.005 | per 1M tokens | |
58 |
| -| Natural Language Processing (NLP) | Roberta Base Go Emotions | Free | Free | per 1M characters | |
59 |
| -| Natural Language Processing (NLP) | Bert Base Multilingual uncased sentiment | Free | Free | per 1M characters | |
60 |
| -| Natural Language Processing (NLP) | Bert Base NER | Free | Free | per 1M characters | |
61 |
| -| Natural Language Processing (NLP) | Bart Large CNN | Free | Free | per 1M characters | |
62 |
| -| Image generation| Stable Diffusion XL | Free | Free | per image | |
63 |
| -| Speech to Text | RIVA Automatic Speech Recognition | Free | Free | per hour | |
64 |
| -| Text to Speech | RIVA Text-to-Speech | Free | Free | per hour | |
65 |
| -| Translation | T5-Large | Free | Free | per 1M characters | |
66 |
| -| Computer vision | YOLOv11 Object Detection | Free | Free | per image | |
67 |
| -| Computer vision | YOLOv11 Image Segmentation | Free | Free | per image | |
| 41 | +| Category | Model | Input Price (\$) | Output Price (\$) | Input Price (€) | Output Price (€) | Unit Price | |
| 42 | +| -------------------------- | -------------------------- | ---------------- | ----------------- | --------------- | ---------------- | --------------------------------- | |
| 43 | +| Large Language Model (LLM) | Llama 3.3 70B Instruct | 0.74 | 0.74 | 0.67 | 0.67 | per 1 million tokens | |
| 44 | +| Large Language Model (LLM) | Llama 3.1 70B Instruct | 0.74 | 0.74 | 0.67 | 0.67 | per 1 million tokens | |
| 45 | +| Large Language Model (LLM) | Mixtral 8x7B Instruct v0.1 | 0.70 | 0.70 | 0.63 | 0.63 | per 1 million tokens | |
| 46 | +| Large Language Model (LLM) | Mistral Nemo Instruct 2407 | 0.14 | 0.14 | 0.13 | 0.13 | per 1 million tokens | |
| 47 | +| Large Language Model (LLM) | Llama 3.1 8B Instruct | 0.11 | 0.11 | 0.10 | 0.10 | per 1 million tokens | |
| 48 | +| Large Language Model (LLM) | Mistral 7B Instruct v0.3 | 0.11 | 0.11 | 0.10 | 0.10 | per 1 million tokens | |
| 49 | +| Reasoning LLM | Qwen 3 32B | 0.09 | 0.25 | 0.08 | 0.23 | per 1 million tokens | |
| 50 | +| Reasoning LLM | DeepSeek R1 Distill Llama 70B | 0.74 | 0.74 | 0.67 | 0.67 | per 1 million tokens | |
| 51 | +| Code LLM | Qwen 2.5 Coder 32B Instruct | 0.96 | 0.96 | 0.87 | 0.87 | per 1 million tokens | |
| 52 | +| Code LLM | Mamba Codestral 7B v0.1 | 0.21 | 0.21 | 0.19 | 0.19 | per 1 million tokens | |
| 53 | +| Visual LLM | Mistral Small 3.2 24B Instruct 2506 | 0.10 | 0.31 | 0.09 | 0.28 | per 1 million tokens | |
| 54 | +| Visual LLM | Qwen 2.5 VL 72B Instruct | 1.01 | 1.01 | 0.91 | 0.91 | per 1 million tokens | |
| 55 | +| Visual LLM | Llava Next Mistral 7B | 0.32 | 0.32 | 0.29 | 0.29 | per 1 million tokens | |
| 56 | +| Embeddings | BGE Multilingual Gemma2 | 0.01 | 0.01 | Free | Free | per 1 million tokens | |
| 57 | +| Embeddings | BGE M3 | 0.01 | 0.01 | Free | Free | per 1 million tokens | |
| 58 | +| Embeddings | BGE Base EN v1.5 | 0.01 | 0.01 | Free | Free | per 1 million tokens | |
| 59 | +| Natural Language Processing (NLP) | Roberta Base Go Emotions | Free | Free | Free | Free | per 1 million characters | |
| 60 | +| Natural Language Processing (NLP) | Bert Base Multilingual uncased sentiment | Free | Free | Free | Free | per 1 million characters | |
| 61 | +| Natural Language Processing (NLP) | Bert Base NER | Free | Free | Free | Free | per 1 million characters | |
| 62 | +| Natural Language Processing (NLP) | Bart Large CNN | Free | Free | Free | Free | per 1 million characters | |
| 63 | +| Image generation | Stable Diffusion XL | Free | Free | Free | Free | per image | |
| 64 | +| Audio Analysis | RIVA Automatic Speech Recognition | Free | Free | Free | Free | per hour | |
| 65 | +| Audio Analysis | RIVA Text-to-Speech | Free | Free | Free | Free | per hour | |
| 66 | +| Translation | T5-Large | Free | Free | Free | Free | per 1 million characters | |
| 67 | +| Computer vision | YOLOv11 Object Detection | Free | Free | Free | Free | per image | |
| 68 | +| Computer vision | YOLOv11 Image Segmentation | Free | Free | Free | Free | per image | |
68 | 69 |
|
69 | 70 | ## Feedback
|
70 | 71 |
|
|
0 commit comments