Skip to content

Commit 80ab5af

Browse files
committed
Semi-generated page for all providers
1 parent b98a6be commit 80ab5af

23 files changed

+614
-20
lines changed

docs/inference-providers/_toctree.yml

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,24 @@
1313

1414
- title: Providers
1515
sections:
16+
- local: providers/cerebras
17+
title: Cerebras
18+
- local: providers/fal-ai
19+
title: Fal AI
20+
- local: providers/fireworks-ai
21+
title: Fireworks
22+
- local: providers/hyperbolic
23+
title: Hyperbolic
24+
- local: providers/nebius
25+
title: Nebius
26+
- local: providers/novita
27+
title: Novita
28+
- local: providers/replicate
29+
title: Replicate
1630
- local: providers/sambanova
1731
title: SambaNova
32+
- local: providers/together
33+
title: Together
1834
- title: API Reference
1935
sections:
2036
- local: tasks/index

docs/inference-providers/index.md

Lines changed: 16 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,22 @@ Hugging Face Inference Providers simplify and unify how developers access and ru
99

1010
To learn more about the launch of Inference Providers, check out our [announcement blog post](https://huggingface.co/blog/inference-providers).
1111

12+
## Partners
13+
14+
Here is the complete list of partners integrated with Inference Providers, and the supported tasks for each of them:
15+
16+
| Provider | Chat completion (LLM) | Chat completion (VLM) | Feature-extraction | Text-to-Image | Text-to-video |
17+
| ------------------------------------- | :-------------------: | :-------------------: | :----------------: | :-----------: | :-----------: |
18+
| [Cerebras](./providers/cerebras) || | | | |
19+
| [Fal AI](./providers/fal-ai) | | | |||
20+
| [Fireworks](./providers/fireworks-ai) ||| | | |
21+
| [Hyperbolic](./providers/hyperbolic) ||| | | |
22+
| [Nebius](./providers/nebius) ||| || |
23+
| [Novita](./providers/novita) ||| | ||
24+
| [Replicate](./providers/replicate) | | | |||
25+
| [SambaNova](./providers/sambanova) || || | |
26+
| [Together](./providers/together) ||| || |
27+
1228
## Why use Inference Providers?
1329

1430
Inference Providers offers a fast and simple way to explore thousands of models for a variety of tasks. Whether you're experimenting with ML capabilities or building a new application, this API gives you instant access to high-performing models across multiple domains:
@@ -28,14 +44,6 @@ Inference Providers offers a fast and simple way to explore thousands of models
2844
- **🔧 Developer-Friendly**: Simple requests, fast responses, and a consistent developer experience across Python and JavaScript clients.
2945
- **💰 Cost-Effective**: No extra markup on provider rates.
3046

31-
## Partners
32-
33-
Here is the complete list of partners integrated with Inference Providers, and the supported tasks for each of them:
34-
35-
| Provider | Chat completion | Feature-extraction | Text-to-Image | Text-to-video |
36-
| ---------------------------------- | :-------------: | :----------------: | :-----------: | :-----------: |
37-
| [SambaNova](./providers/sambanova) || | | |
38-
3947
## Inference Playground
4048

4149
To get started quickly with [Chat Completion models](http://huggingface.co/models?inference_provider=all&sort=trending&other=conversational), use the [Inference Playground](https://huggingface.co/playground) to easily test and compare models with your prompts.
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
If you want to update the content related to cerebras's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/cerebras.handlebars`.
7+
8+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
9+
--->
10+
11+
# Cerebras
12+
13+
![https://www.cerebras.ai/](https://upload.wikimedia.org/wikipedia/commons/thumb/1/15/Cerebras_logo.svg/512px-Cerebras_logo.svg.png)
14+
15+
Cerebras stands alone as the world’s fastest AI inference and training platform. Organizations across fields like medical research, cryptography, energy, and agentic AI use our CS-2 and CS-3 systems to build on-premise supercomputers, while developers and enterprises everywhere can access the power of Cerebras through our pay-as-you-go cloud offerings.
16+
17+
## Supported tasks
18+
19+
20+
### Chat Completion (LLM)
21+
22+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
23+
24+
<InferenceSnippet
25+
pipeline=text-generation
26+
providersMapping={ {"cerebras":{"modelId":"meta-llama/Llama-3.3-70B-Instruct","providerModelId":"llama-3.3-70b"} } }
27+
conversational />
28+
Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
If you want to update the content related to fal-ai's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/fal-ai.handlebars`.
7+
8+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
9+
--->
10+
11+
# Fal
12+
13+
![https://fal.ai/](https://images.seeklogo.com/logo-png/61/1/fal-ai-logo-png_seeklogo-611592.png)
14+
15+
Founded in 2021 by Burkay Gur and Gorkem Yurtseven, fal.ai was born out of a shared passion for AI and a desire to address the challenges in AI infrastructure observed during their tenures at Coinbase and Amazon.
16+
17+
## Supported tasks
18+
19+
20+
### Automatic Speech Recognition
21+
22+
Find out more about Automatic Speech Recognition [here](../tasks/automatic_speech_recognition).
23+
24+
<InferenceSnippet
25+
pipeline=automatic-speech-recognition
26+
providersMapping={ {"fal-ai":{"modelId":"openai/whisper-large-v3","providerModelId":"fal-ai/whisper"} } }
27+
/>
28+
29+
30+
### Text To Image
31+
32+
Find out more about Text To Image [here](../tasks/text_to_image).
33+
34+
<InferenceSnippet
35+
pipeline=text-to-image
36+
providersMapping={ {"fal-ai":{"modelId":"black-forest-labs/FLUX.1-dev","providerModelId":"fal-ai/flux/dev"} } }
37+
/>
38+
39+
40+
### Text To Video
41+
42+
Find out more about Text To Video [here](../tasks/text_to_video).
43+
44+
<InferenceSnippet
45+
pipeline=text-to-video
46+
providersMapping={ {"fal-ai":{"modelId":"Wan-AI/Wan2.1-T2V-14B","providerModelId":"fal-ai/wan-t2v"} } }
47+
/>
48+
Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,37 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
If you want to update the content related to fireworks-ai's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/fireworks-ai.handlebars`.
7+
8+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
9+
--->
10+
11+
# Fireworks AI
12+
13+
![https://fireworks.ai/](https://d1.awsstatic.com/fireworks-ai-wordmark-color-dark.93b1f27fdf77899fa02afb949fb27317ee4081ad.png)
14+
15+
Fireworks AI is a developer-centric platform that delivers high-performance generative AI solutions, enabling efficient deployment and fine-tuning of large language models (LLMs) and image models.
16+
## Supported tasks
17+
18+
19+
### Chat Completion (LLM)
20+
21+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
22+
23+
<InferenceSnippet
24+
pipeline=text-generation
25+
providersMapping={ {"fireworks-ai":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"accounts/fireworks/models/deepseek-v3-0324"} } }
26+
conversational />
27+
28+
29+
### Chat Completion (VLM)
30+
31+
Find out more about Chat Completion (VLM) [here](../tasks/chat-completion).
32+
33+
<InferenceSnippet
34+
pipeline=image-text-to-text
35+
providersMapping={ {"fireworks-ai":{"modelId":"meta-llama/Llama-4-Scout-17B-16E-Instruct","providerModelId":"accounts/fireworks/models/llama4-scout-instruct-basic"} } }
36+
conversational />
37+
Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
If you want to update the content related to hyperbolic's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/hyperbolic.handlebars`.
7+
8+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
9+
--->
10+
11+
# Hyperbolic
12+
13+
![https://hyperbolic.xyz/](https://cdn-images-1.medium.com/max/330/1*[email protected])
14+
15+
Hyperbolic is building an open-access platform for AI development by aggregating idle computing resources and making it seamlessly simple to use them.
16+
17+
## Supported tasks
18+
19+
20+
### Chat Completion (LLM)
21+
22+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
23+
24+
<InferenceSnippet
25+
pipeline=text-generation
26+
providersMapping={ {"hyperbolic":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek-ai/DeepSeek-V3-0324"} } }
27+
conversational />
28+
29+
30+
### Chat Completion (VLM)
31+
32+
Find out more about Chat Completion (VLM) [here](../tasks/chat-completion).
33+
34+
<InferenceSnippet
35+
pipeline=image-text-to-text
36+
providersMapping={ {"hyperbolic":{"modelId":"Qwen/Qwen2.5-VL-7B-Instruct","providerModelId":"Qwen/Qwen2.5-VL-7B-Instruct"} } }
37+
conversational />
38+
Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
If you want to update the content related to nebius's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/nebius.handlebars`.
7+
8+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
9+
--->
10+
11+
# Nebius
12+
13+
![https://nebius.com/](https://companieslogo.com/img/orig/NBIS_BIG-446495ba.png?t=1729269594)
14+
15+
​Nebius AI is a technology company specializing in AI-centric cloud platforms, offering scalable GPU clusters, managed services, and developer tools designed for intensive AI workloads. Headquartered in Amsterdam, Nebius provides flexible architecture and high-performance infrastructure to support AI model training and inference at any scale.
16+
17+
## Supported tasks
18+
19+
20+
### Chat Completion (LLM)
21+
22+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
23+
24+
<InferenceSnippet
25+
pipeline=text-generation
26+
providersMapping={ {"nebius":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek-ai/DeepSeek-V3-0324-fast"} } }
27+
conversational />
28+
29+
30+
### Chat Completion (VLM)
31+
32+
Find out more about Chat Completion (VLM) [here](../tasks/chat-completion).
33+
34+
<InferenceSnippet
35+
pipeline=image-text-to-text
36+
providersMapping={ {"nebius":{"modelId":"google/gemma-3-27b-it","providerModelId":"google/gemma-3-27b-it-fast"} } }
37+
conversational />
38+
39+
40+
### Text To Image
41+
42+
Find out more about Text To Image [here](../tasks/text_to_image).
43+
44+
<InferenceSnippet
45+
pipeline=text-to-image
46+
providersMapping={ {"nebius":{"modelId":"black-forest-labs/FLUX.1-dev","providerModelId":"black-forest-labs/flux-dev"} } }
47+
/>
48+
Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
If you want to update the content related to novita's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/novita.handlebars`.
7+
8+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
9+
--->
10+
11+
# Novita
12+
13+
![https://novita.ai/](https://novita.ai/logo/logo.svg)
14+
15+
​Novita AI is a comprehensive AI cloud platform that provides developers and businesses with access to over 200 APIs for tasks such as image generation, video processing, audio synthesis, and large language models.
16+
17+
## Supported tasks
18+
19+
20+
### Chat Completion (LLM)
21+
22+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
23+
24+
<InferenceSnippet
25+
pipeline=text-generation
26+
providersMapping={ {"novita":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek/deepseek-v3-0324"} } }
27+
conversational />
28+
29+
30+
### Chat Completion (VLM)
31+
32+
Find out more about Chat Completion (VLM) [here](../tasks/chat-completion).
33+
34+
<InferenceSnippet
35+
pipeline=image-text-to-text
36+
providersMapping={ {"novita":{"modelId":"meta-llama/Llama-4-Scout-17B-16E-Instruct","providerModelId":"meta-llama/llama-4-scout-17b-16e-instruct"} } }
37+
conversational />
38+
39+
40+
### Text To Video
41+
42+
Find out more about Text To Video [here](../tasks/text_to_video).
43+
44+
<InferenceSnippet
45+
pipeline=text-to-video
46+
providersMapping={ {"novita":{"modelId":"Wan-AI/Wan2.1-T2V-14B","providerModelId":"wan-t2v"} } }
47+
/>
48+
Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
If you want to update the content related to replicate's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/replicate.handlebars`.
7+
8+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
9+
--->
10+
11+
# Replicate
12+
13+
![https://replicate.com/](https://cdn.sanity.io/images/50q6fr1p/production/2542fad4ab944c0f5e1ab7507a3333a2d5f7f464-2626x684.png?auto=format)
14+
15+
Replicate is building tools so all software engineers can use AI as if it were normal software. You should be able to import an image generator the same way you import an npm package. You should be able to customize a model as easily as you can fork something on GitHub.
16+
17+
## Supported tasks
18+
19+
20+
### Text To Image
21+
22+
Find out more about Text To Image [here](../tasks/text_to_image).
23+
24+
<InferenceSnippet
25+
pipeline=text-to-image
26+
providersMapping={ {"replicate":{"modelId":"black-forest-labs/FLUX.1-dev","providerModelId":"black-forest-labs/flux-dev"} } }
27+
/>
28+
29+
30+
### Text To Video
31+
32+
Find out more about Text To Video [here](../tasks/text_to_video).
33+
34+
<InferenceSnippet
35+
pipeline=text-to-video
36+
providersMapping={ {"replicate":{"modelId":"Wan-AI/Wan2.1-T2V-14B","providerModelId":"wavespeedai/wan-2.1-t2v-480p"} } }
37+
/>
38+

0 commit comments

Comments
 (0)