Inference Providers docs

_List taken from https://github.com/huggingface/hub-docs/pull/1652#issue-2950293503._

###  TODO list:
- [x] add link to announcement blog: https://huggingface.co/blog/inference-providers
- [ ] things to explain somewhere
  - [x] "routed request" vs "direct calls"
  - [x] mention it is production ready (wasn't the case with Inference API)
  - [ ] many providers: some are faster, some are cheaper, some are more reliable, some propose more models, some propose different tasks, etc.
- [x] billing
  - [x] case "routed request with HF token" vs "routed request with provider token" vs "direct call"
  - [x] case "billing enabled" (Pay as you go) vs "not enabled" (only within free-tier)
  - [x] no extra fee from HF
  - [x] bill to organization 
- [x] website UI
  - [x] search for models (UI): https://huggingface.co/models?inference_provider=fireworks-ai&sort=trending
  - [x] search all models (UI): https://huggingface.co/models?inference_provider=all&sort=trending
  - [x] set custom API key from provider (optional)
  - [x] order providers by preference (apply to widget and code snippets)
  - [x] widgets
  - [x] playground hf.co/playground
- [x] Pages: 1 provider == 1 doc page with 1 description + 1 list of supported tasks + 1 snippet 
- [x] add list of providers on landing page
- [x] Page Hub API: 
  - [x] search for models (API): https://huggingface.co/api/models?inference_provider=fireworks-ai
  - [x] get warm models (any provider): https://huggingface.co/api/models?inference=warm
  - [x] get warm status for a model: https://huggingface.co/api/models/google/gemma-3-27b-it?expand[]=inference
  - [x] get providers for a model: https://huggingface.co/api/models/Qwen/QwQ-32B?expand[]=inferenceProviderMapping
- [x] Page: "how to be registered as a provider"? (https://github.com/huggingface/hub-docs/issues/1664)
  - [x] get in touch!
  - [x] JS client integration
  - [x] Model mapping integration
  - [x] Billing integration => how does it work
  - [x] Python client integration
  - [x] how to test your endpoints?
- [x] tasks page with code snippets / API schema / recommended models / etc. (started in https://github.com/huggingface/hub-docs/pull/1643)
  - [x] add text to video (https://github.com/huggingface/hub-docs/pull/1672)
  - [x] split between "favorite tasks" (LLMs, T2I, T2V, etc.) and "other tasks"
- [x] current pages:
  - [x] index/intro => adapt
  - [x] getting started => removed in favor of intro
  - [x] Supported Models => removed
  - [x] pricing and rate limits => merged with "billing"
  - [x] security / data policy => kept with new wording
- [x] rename https://huggingface.co/docs/api-inference vers https://huggingface.co/docs/inference-providers => https://github.com/huggingface/hub-docs/pull/1666
- [ ] mention Inference Endpoints somewhere
- [x] add nice visual of all providers https://github.com/huggingface/hub-docs/pull/1669
- [x] remove legacy headers https://github.com/huggingface/hub-docs/pull/1673

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inference Providers docs #1662

TODO list:

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inference Providers docs #1662

Description

TODO list:

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions