quickfix re. pricing system #1597

julien-c · 2025-02-03T18:04:53Z

the important part is in rate-limits.md

HuggingFaceDocBuilderDev · 2025-02-03T18:06:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Vaibhavs10 · 2025-02-03T19:01:36Z

docs/api-inference/rate-limits.md

+You get charged for every inference request, based on the compute time x price of the underlying hardware.

-Serverless API is not meant to be used for heavy production applications. If you need higher rate limits, consider [Inference Endpoints](https://huggingface.co/docs/inference-endpoints) to have dedicated resources.
+For instance, a request to [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1) that takes 10 seconds to complete on a GPU machine that costs $0.00012 per second to run, will be billed $0.0012.


this is slightly confusing - isn't it more based on the inference provider and what they charge? and specifically for LLMs more on the basis of the tokens processed and generated?

Maybe we can use a different example here, say Flux, each generation is X dollars?

(so it's a bit easy to grok)

essentially talking about the routed requests here (maybe it's supposed to be somewhere else and I'm confused)

this doc is about HF's own Inference , not Inference Providers, but i agree its a tad confusing :)

Let's take an example that actually runs on HF's Inference API then?

yes, do you have one on hand?

"deepseek-ai/DeepSeek-R1-Distill-Qwen-32B" if we still want to ride the deepseek wave

oops i just merged vb's suggestion (Flux) but we can add more examples in the future

totally fine with Flux! Makes sense to set an example with fixed cost

It gets tons of abuse tho - and is down quite a lot - recommended BFL flux instead - which always works 😅

docs/api-inference/index.md

docs/api-inference/rate-limits.md

docs/api-inference/pricing.md

Co-authored-by: Lucain <[email protected]>

docs/api-inference/pricing.md

Co-authored-by: vb <[email protected]>

Wauplin

Thanks

SBrandeis

great

quickfix re. pricing system

4f30336

julien-c requested review from SBrandeis, Vaibhavs10, Wauplin, hanouticelina and pcuenca February 3, 2025 18:04

Vaibhavs10 reviewed Feb 3, 2025

View reviewed changes

docs/api-inference/index.md Outdated Show resolved Hide resolved

Wauplin reviewed Feb 4, 2025

View reviewed changes

docs/api-inference/rate-limits.md Show resolved Hide resolved

rename doc page

6ef710e

Wauplin reviewed Feb 4, 2025

View reviewed changes

docs/api-inference/pricing.md Outdated Show resolved Hide resolved

Update docs/api-inference/pricing.md

b308790

Co-authored-by: Lucain <[email protected]>

Vaibhavs10 reviewed Feb 4, 2025

View reviewed changes

docs/api-inference/pricing.md Outdated Show resolved Hide resolved

julien-c and others added 2 commits February 4, 2025 14:23

Update docs/api-inference/pricing.md

6a017f5

Co-authored-by: vb <[email protected]>

Update docs/api-inference/index.md

770f9ef

Co-authored-by: vb <[email protected]>

Vaibhavs10 approved these changes Feb 4, 2025

View reviewed changes

Wauplin approved these changes Feb 4, 2025

View reviewed changes

julien-c merged commit 91ba7a3 into main Feb 4, 2025
2 checks passed

julien-c deleted the api-quotas branch February 4, 2025 13:27

SBrandeis reviewed Feb 4, 2025

View reviewed changes

quickfix re. pricing system #1597

quickfix re. pricing system #1597

Uh oh!

Conversation

julien-c commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 3, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

julien-c commented Feb 3, 2025 •

edited

Loading