You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/inference-providers/pricing.md
+12-43Lines changed: 12 additions & 43 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,62 +1,31 @@
1
1
# Pricing and Billing
2
2
3
-
Access 200+ models from leading AI inference providers with centralized, transparent, pay-as-you-go pricing. No infrastructure management required—just pay for what you use, with no markup from Hugging Face.
3
+
Inference Providers is a production-ready service involving external partners and is therefore a paid product. However, as a Hugging Face user, you get monthly credits to run experiments. The amount of credits you get depends on your type of account:
4
4
5
-
## Free Credits to Get Started
6
-
7
-
Every Hugging Face user receives monthly credits to experiment with Inference Providers:
| Enterprise Hub Organizations | Per-seat access |**$2.00 per seat**|
14
-
15
-
<Tip>
16
-
17
-
Your monthly credits automatically apply when you route requests through Hugging Face. For Enterprise organizations, credits are shared among all members.
18
-
19
-
</Tip>
20
-
21
-
## How Billing Works: Choose Your Approach
22
-
23
-
Inference Providers offers flexibility in how you're billed. Understanding these options upfront helps you choose the best approach for your needs:
24
-
25
-
| Feature |**Routed by Hugging Face**|**Custom Provider Key**|**Direct Calls**|
26
-
| :--- | :--- | :--- | :--- |
27
-
|**How it Works**| Your request routes through HF to the provider | You set a custom provider key in HF settings | You provide the provider key directly in your code |
28
-
|**Billing**| Pay-as-you-go on your HF account | Billed directly by the provider | Billed directly by the provider |
| Free Users | subject to change, less than $0.10 |
8
+
| PRO Users | $2.00 |
9
+
| Enterprise Hub Organizations | $2.00 per seat, shared among members |
41
10
42
11
To benefit from Enterprise Hub included credits, you need to explicitly specify the organization to be billed when performing the inference requests.
43
12
See the [Organization Billing section](#organization-billing) below for more details.
44
13
45
-
**PRO users and Enterprise Hub organizations** can continue using the API after exhausting their monthly credits. This ensures uninterrupted access to models for production workloads.
14
+
## Pay-as-you-Go
46
15
47
-
**Current Status**: Pay-as-you-Gois available for providers integrated with our billing system. We're actively integrating remaining providers—those not yet integrated will be blocked once free credits are exhausted.
16
+
**PRO users and Enterprise Hub organizations** can continue using the API once their monthly included credits are exhausted. This billing model, known as "Pay-as-you-Go" (PAYG), is charged on top of the monthly subscription. PAYG is only available for providers that are integrated with our billing system. We're actively working to integrate all providers, but in the meantime, any providers that are not yet integrated will be blocked once the free-tier limit is reached.
48
17
49
18
If you have remaining credits, we estimate costs for providers that aren’t fully integrated with our billing system. These estimates are usually higher than the actual cost to prevent abuse, which is why PAYG is currently disabled for those providers.
50
19
20
+
You can track your spending on your [billing page](https://huggingface.co/settings/billing).
21
+
51
22
<Tip>
52
23
53
24
Hugging Face charges you the same rates as the provider, with no additional fees. We just pass through the provider costs directly.
54
25
55
26
</Tip>
56
27
57
-
You can track your spending anytime on your [billing page](https://huggingface.co/settings/billing).
58
-
59
-
## Routed Requests vs Direct Calls (Detailed Comparison)
28
+
## Routed requests vs direct calls
60
29
61
30
The documentation above assumes you are making routed requests to external providers. In practice, there are 3 different ways to run inference, each with unique billing implications:
62
31
@@ -82,7 +51,7 @@ For instance, a request to [black-forest-labs/FLUX.1-dev](https://huggingface.co
82
51
83
52
The `"hf-inference"` provider is currently the default provider when working with the JavaScript and Python SDKs. Note that this default might change in the future.
84
53
85
-
## Billing forEnterprise Hub organizations
54
+
## Organization billing
86
55
87
56
For Enterprise Hub organizations, it is possible to centralize billing for all of your users. Each user still uses their own User Access Token but the requests are billed to your organization. This can be done by passing `"X-HF-Bill-To: my-org-name"` as a header in your HTTP requests.
0 commit comments