You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Inference Providers offers a fast and simple way to explore thousands of models for a variety of tasks. Whether you're experimenting with ML capabilities or building a new application, this API gives you instant access to high-performing models across multiple domains:
@@ -28,14 +44,6 @@ Inference Providers offers a fast and simple way to explore thousands of models
28
44
-**🔧 Developer-Friendly**: Simple requests, fast responses, and a consistent developer experience across Python and JavaScript clients.
29
45
-**💰 Cost-Effective**: No extra markup on provider rates.
30
46
31
-
## Partners
32
-
33
-
Here is the complete list of partners integrated with Inference Providers, and the supported tasks for each of them:
To get started quickly with [Chat Completion models](http://huggingface.co/models?inference_provider=all&sort=trending&other=conversational), use the [Inference Playground](https://huggingface.co/playground) to easily test and compare models with your prompts.
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
If you want to update the content related to cerebras's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/cerebras.handlebars`.
7
+
8
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
Cerebras stands alone as the world’s fastest AI inference and training platform. Organizations across fields like medical research, cryptography, energy, and agentic AI use our CS-2 and CS-3 systems to build on-premise supercomputers, while developers and enterprises everywhere can access the power of Cerebras through our pay-as-you-go cloud offerings.
16
+
17
+
## Supported tasks
18
+
19
+
20
+
### Chat Completion (LLM)
21
+
22
+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
If you want to update the content related to fal-ai's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/fal-ai.handlebars`.
7
+
8
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
Founded in 2021 by Burkay Gur and Gorkem Yurtseven, fal.ai was born out of a shared passion for AI and a desire to address the challenges in AI infrastructure observed during their tenures at Coinbase and Amazon.
16
+
17
+
## Supported tasks
18
+
19
+
20
+
### Automatic Speech Recognition
21
+
22
+
Find out more about Automatic Speech Recognition [here](../tasks/automatic_speech_recognition).
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
If you want to update the content related to fireworks-ai's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/fireworks-ai.handlebars`.
7
+
8
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
Fireworks AI is a developer-centric platform that delivers high-performance generative AI solutions, enabling efficient deployment and fine-tuning of large language models (LLMs) and image models.
16
+
## Supported tasks
17
+
18
+
19
+
### Chat Completion (LLM)
20
+
21
+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
If you want to update the content related to hyperbolic's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/hyperbolic.handlebars`.
7
+
8
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
If you want to update the content related to nebius's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/nebius.handlebars`.
7
+
8
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
Nebius AI is a technology company specializing in AI-centric cloud platforms, offering scalable GPU clusters, managed services, and developer tools designed for intensive AI workloads. Headquartered in Amsterdam, Nebius provides flexible architecture and high-performance infrastructure to support AI model training and inference at any scale.
16
+
17
+
## Supported tasks
18
+
19
+
20
+
### Chat Completion (LLM)
21
+
22
+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
If you want to update the content related to novita's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/novita.handlebars`.
7
+
8
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
Novita AI is a comprehensive AI cloud platform that provides developers and businesses with access to over 200 APIs for tasks such as image generation, video processing, audio synthesis, and large language models.
16
+
17
+
## Supported tasks
18
+
19
+
20
+
### Chat Completion (LLM)
21
+
22
+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
If you want to update the content related to replicate's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/replicate.handlebars`.
7
+
8
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
Replicate is building tools so all software engineers can use AI as if it were normal software. You should be able to import an image generator the same way you import an npm package. You should be able to customize a model as easily as you can fork something on GitHub.
16
+
17
+
## Supported tasks
18
+
19
+
20
+
### Text To Image
21
+
22
+
Find out more about Text To Image [here](../tasks/text_to_image).
0 commit comments