[Inference doc] Next gen inference snippets #1643

Wauplin · 2025-03-21T15:13:45Z

How to review

Go to https://moon-ci-docs.huggingface.co/docs/inference-providers/pr_1643/en/tasks/index

Description

Opening this PR as a draft as it'll requires some deeper changes in doc-builder cc @julien-c @gary149 @mishig25 @SBrandeis @hanouticelina

This PR updates the script to generate the tasks pages of the Inference API docs (e.g. text-to-image page). The short-term goal is to revamp the "Inference API" docs as an "Inference Providers" one, making them more provider-centric. With this PR, each task page will have

a Recommended models section => same as today. Recommended models taken from hf.co/tasks + filtered to keep only warm ones
an API specification section => same as today. Specs are taken from the jsonschema specs that we have and rendered in a table.
a Using the API section => this is the revamped one. We want to display snippets for all providers supporting the task, and for all clients for which we have code snippets. To chose which model to highlight, I'm taking for each provider the "live" model with the most "likes30d". My only concern with this strategy is that it might lead to recurrent changes in the docs (which are automated daily).
- In practice, it would be very nice to extend the current <inferencesnippets> tag to have an interface similar to the one on the model page

What I did in this PR is to generate the snippets and serialize them inside some <snippet provider="..." language="..." client="..."> tags. This is where doc-builder should be updated to take them into account.

EDIT: using the new </InferenceSnippet> svelte component from huggingface/doc-builder#549 (thanks @mishig25!)

HuggingFaceDocBuilderDev · 2025-03-21T15:16:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

mishig25 · 2025-03-31T07:37:23Z

Handled by huggingface/doc-builder#549

see example: #1656

julien-c · 2025-04-02T10:44:21Z

hmm https://moon-ci-docs.huggingface.co/docs/api-inference/pr_1643/en/index has not been updated no?

Wauplin · 2025-04-02T15:00:39Z

hmm https://moon-ci-docs.huggingface.co/docs/api-inference/pr_1643/en/index has not been updated no?

It's now under https://moon-ci-docs.huggingface.co/docs/inference-providers/pr_1643/en/index

(but in general this PR is not ready, I finishing it up)

Wauplin · 2025-04-02T16:31:35Z

@julien-c @gary149 @hanouticelina @SBrandeis PR is finally ready for review. Best way to review is to go on https://moon-ci-docs.huggingface.co/docs/inference-providers/pr_1643/en/tasks/index

julien-c · 2025-04-02T17:43:52Z

docs/inference-providers/tasks/automatic-speech-recognition.md

+<InferenceSnippet
+    pipeline=automatic-speech-recognition
+    providersMapping={ {"fal-ai":{"modelId":"openai/whisper-large-v3","providerModelId":"fal-ai/whisper"},"hf-inference":{"modelId":"openai/whisper-large-v3-turbo","providerModelId":"openai/whisper-large-v3-turbo"}} }
+/>


so in the end you don't embed the full snippets in the markdown. Correct?

but we still need to regenerate the .md from time to time to pick up the mapping?

Yes exactly. If we want we can still have an .md export for llms but not for now

but we still need to regenerate the .md from time to time to pick up the mapping?

yep, written here

so in the end you don't embed the full snippets in the markdown. Correct?

inference snippets strings are directly coming from https://github.com/huggingface/huggingface.js (single source of truth)

^ I think that statement does not hold anymore - we moved snippets code to moon-landing

I think there is a confusion here 😬

Inference snippets are indeed located in huggingface.js/inference (or I'm not aware of the change) and providers mapping is on the Hub => and pulled from the Hub by the script in this PR to make sure we always get up-to-date models in the docs => done in CI once a day

So all good, right?

Sorry, I've mistaken 'snippets' and 'widgets' - my bad!

julien-c · 2025-04-02T17:51:37Z

docs/inference-providers/tasks/fill-mask.md

-</js>
-
-</inferencesnippet>
+No snippet available for this task.


there's no warm hf-inference for this seminal task? Que pena!!

Wauplin added 3 commits March 20, 2025 17:47

more advanced

d539697

better now? :)

8331942

update

111eca8

Wauplin marked this pull request as draft March 21, 2025 15:13

Wauplin mentioned this pull request Mar 26, 2025

Revamp Inference Providers doc #1652

Merged

43 tasks

mishig25 closed this Mar 31, 2025

mishig25 reopened this Mar 31, 2025

Wauplin mentioned this pull request Apr 1, 2025

Inference Providers docs #1662

Closed

44 tasks

Merge branch 'main' into snipets-next-gen

2129a28

Wauplin added 3 commits April 2, 2025 18:06

big step

cf5d962

better?

a11736a

typo

b28b739

Wauplin marked this pull request as ready for review April 2, 2025 16:30

Wauplin requested review from SBrandeis, gary149, hanouticelina and julien-c April 2, 2025 16:31

julien-c reviewed Apr 2, 2025

View reviewed changes

hanouticelina approved these changes Apr 3, 2025

View reviewed changes

SBrandeis approved these changes Apr 3, 2025

View reviewed changes

Wauplin merged commit 4c01d39 into main Apr 3, 2025
2 checks passed

Wauplin deleted the snippets-next-gen branch April 3, 2025 09:02

Wauplin mentioned this pull request Jun 25, 2025

Add back "Image Text to Text" page #1796

Merged

[Inference doc] Next gen inference snippets #1643

[Inference doc] Next gen inference snippets #1643

Uh oh!

Conversation

Wauplin commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How to review

Description

Uh oh!

HuggingFaceDocBuilderDev commented Mar 21, 2025

Uh oh!

mishig25 commented Mar 31, 2025

Uh oh!

julien-c commented Apr 2, 2025

Uh oh!

Wauplin commented Apr 2, 2025

Uh oh!

Wauplin commented Apr 2, 2025

Uh oh!

julien-c Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

julien-c Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

mishig25 Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mishig25 Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SBrandeis Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

SBrandeis Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

julien-c Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Wauplin commented Mar 21, 2025 •

edited

Loading

mishig25 Apr 2, 2025 •

edited

Loading

mishig25 Apr 2, 2025 •

edited

Loading