You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|[`gemma-3-27b-it`](#gemma-3-27b-it)| Google | 32k | Text | H100, H100-2 |[Gemma](https://ai.google.dev/gemma/terms)|
22
+
|[`llama-3.1-70b-instruct`](#llama-31-70b-instruct)| Meta | up to 128k tokens | Text | H100, H100-2 |[Llama 3.1 community](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct/blob/main/LICENSE)|
23
+
|[`llama-3.1-8b-instruct`](#llama-31-8b-instruct)| Meta | up to 128k tokens | Text | L4, L40S, H100, H100-2 |[Llama 3.1 community](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct/blob/main/LICENSE)|
24
+
|[`llama-3-70b-instruct`](#llama-3-70b-instruct)| Meta | 8k tokens | Text | H100 |[Llama 3 community](https://huggingface.co/meta-llama/Meta-Llama-3-8B/blob/main/LICENSE)|
25
+
|[`llama-3.3-70b-instruct`](#llama-33-70b-instruct)| Meta | up to 131k tokens | Text | H100, H100-2 |[Llama 3.3 community](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct)|
26
+
|[`llama-3-nemotron-70b`](#llama-31-nemotron-70b-instruct)| Nvidia | up to 128k tokens | Text | H100, H100-2 |[Llama 3.1 community](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct/blob/main/LICENSE)|
27
+
|[`deepseek-r1-distill-70b`](#deepseek-r1-distill-llama-70b)| Deepseek | up to 131k tokens | Text | H100, H100-2 |[MIT](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/blob/main/LICENSE) and [Llama 3.3 Community](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct/blob/main/LICENSE)|
28
+
|[`deepseek-r1-distill-8b`](#deepseek-r1-distill-llama-8b)| Deepseek | up to 131k tokens | Text | L4, L40S, H100 |[MIT](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/blob/main/LICENSE) and [Llama 3.1 Community](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct/blob/main/LICENSE)|
|`bge-multilingual-gemma2`| No | No | English, French, Chinese, Japanese, Korean |
60
64
|`sentence-t5-xxl`| No | No | English |
61
65
66
+
62
67
## Model details
63
68
<Messagetype="note">
64
69
Despite efforts for accuracy, the possibility of generated text containing inaccuracies or [hallucinations](/managed-inference/concepts/#hallucinations) exists. Always verify the content generated independently.
0 commit comments