[Inference Providers] revamp documentation #1457

hanouticelina · 2025-05-16T09:28:41Z

Similar to huggingface/huggingface_hub#3085.

hanouticelina · 2025-05-16T09:29:04Z

packages/inference/README.md

+2.  [Inference Endpoints](https://huggingface.co/docs/inference-endpoints/index): a product to easily deploy models to production. Inference is run by Hugging Face in a dedicated, fully managed infrastructure on a cloud provider of your choice.
+3.  Local endpoints: you can also run inference with local inference servers like [llama.cpp](https://github.com/ggerganov/llama.cpp), [Ollama](https://ollama.com/), [vLLM](https://github.com/vllm-project/vllm), [LiteLLM](https://docs.litellm.ai/docs/simple_proxy), or [Text Generation Inference (TGI)](https://github.com/huggingface/text-generation-inference) by connecting the client to these local endpoints.

-You can also try out a live [interactive notebook](https://observablehq.com/@huggingface/hello-huggingface-js-inference), see some demos on [hf.co/huggingfacejs](https://huggingface.co/huggingfacejs), or watch a [Scrimba tutorial that explains how Inference Endpoints works](https://scrimba.com/scrim/cod8248f5adfd6e129582c523).


I removed broken/outdated demos link

hanouticelina · 2025-05-16T09:31:09Z

packages/inference/README.md

+This task reads some text and outputs raw float values, that are usually consumed as part of a semantic database/semantic search.

 ```typescript
-const openai = new InferenceClient(OPENAI_TOKEN).endpoint("https://api.openai.com");


no need for this imo + i added a tip line 635

HuggingFaceDocBuilderDev · 2025-05-16T09:31:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

julien-c

looks great!

julien-c · 2025-05-16T09:35:31Z

packages/inference/README.md

+## Using local endpoints
+
+You can use `InferenceClient` to run chat completion with local inference servers (llama.cpp, vllm, litellm server, TGI, mlx, etc.) running on your own machine. The API should be OpenAI API-compatible.
+
+```typescript
+import { InferenceClient } from '@huggingface/inference';
+
+const hf = new InferenceClient(undefined, {
+	endpointUrl: "http://localhost:8080",
+});
+
+const response = await hf.chatCompletion({
+	messages: [
+		{
+			role: "user",
+			content: "What is the capital of France?",
+		},
+	],
+});
+
+console.log(response.choices[0].message.content);
+```
+
+<Tip>
+
+Similarily to the OpenAI JS client, `InferenceClient` can be used to run Chat Completion inference with any OpenAI REST API-compatible endpoint.
+
+</Tip>


looks great

SBrandeis

very nice improvement !!!

Wauplin

🔥

revamp docs

c7324e8

hanouticelina requested review from SBrandeis, Wauplin and julien-c May 16, 2025 09:28

hanouticelina commented May 16, 2025

View reviewed changes

julien-c approved these changes May 16, 2025

View reviewed changes

SBrandeis approved these changes May 16, 2025

View reviewed changes

hanouticelina merged commit 47c3ccb into main May 16, 2025
6 checks passed

hanouticelina deleted the update-inference-docs branch May 16, 2025 10:24

Wauplin reviewed May 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inference Providers] revamp documentation #1457

[Inference Providers] revamp documentation #1457

Uh oh!

hanouticelina commented May 16, 2025

Uh oh!

hanouticelina May 16, 2025

Uh oh!

hanouticelina May 16, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 16, 2025

Uh oh!

julien-c left a comment

Uh oh!

julien-c May 16, 2025

Uh oh!

SBrandeis left a comment

Uh oh!

Uh oh!

Wauplin left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[Inference Providers] revamp documentation #1457

[Inference Providers] revamp documentation #1457

Uh oh!

Conversation

hanouticelina commented May 16, 2025

Uh oh!

hanouticelina May 16, 2025

Choose a reason for hiding this comment

Uh oh!

hanouticelina May 16, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 16, 2025

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

julien-c May 16, 2025

Choose a reason for hiding this comment

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants