Access token from env in snippets #570

Wauplin · 2025-06-04T08:36:22Z

Follow-up PR after huggingface/huggingface.js#1514

@gary149

Solve #1361. Long awaited feature for @gary149. I did not go for the cleanest solution but it works well and should be robust/flexible enough if we need to fix something in the future. ## EDIT: breaking change => access token should be passed as `opts.accessToken` now in `snippets.getInferenceSnippets` ## TODO once merged: - [ ] adapt in moon-landing for snippets on model page huggingface-internal/moon-landing#13964 - [ ] adapt in doc-builder for `<inferencesnippet>` html tag (used in hub-docs) huggingface/doc-builder#570 - [ ] hardcoded examples in hub-docs huggingface/hub-docs#1764 ## Some examples: ### JS client ```js import { InferenceClient } from "@huggingface/inference"; const client = new InferenceClient(process.env.HF_TOKEN); const chatCompletion = await client.chatCompletion({ provider: "hf-inference", model: "meta-llama/Llama-3.1-8B-Instruct", messages: [ { role: "user", content: "What is the capital of France?", }, ], }); console.log(chatCompletion.choices[0].message); ``` ### Python client ```py import os from huggingface_hub import InferenceClient client = InferenceClient( provider="hf-inference", api_key=os.environ["HF_TOKEN"], ) completion = client.chat.completions.create( model="meta-llama/Llama-3.1-8B-Instruct", messages=[ { "role": "user", "content": "What is the capital of France?" } ], ) print(completion.choices[0].message) ``` ### openai client ```py import os from openai import OpenAI client = OpenAI( base_url="https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.1-8B-Instruct/v1", api_key=os.environ["HF_TOKEN"], ) completion = client.chat.completions.create( model="meta-llama/Llama-3.1-8B-Instruct", messages=[ { "role": "user", "content": "What is the capital of France?" } ], ) print(completion.choices[0].message) ``` ### curl ```sh curl https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.1-8B-Instruct/v1/chat/completions \ -H "Authorization: Bearer $HF_TOKEN" \ -H 'Content-Type: application/json' \ -d '{ "messages": [ { "role": "user", "content": "What is the capital of France?" } ], "model": "meta-llama/Llama-3.1-8B-Instruct", "stream": false }' ``` ### check out PR diff for more examples --------- Co-authored-by: Simon Brandeis <[email protected]>

Wauplin · 2025-06-04T12:26:29Z

Let's merge now that @huggingace/inference is shipped :)

Access token from env in snippets

d7a87e5

Wauplin mentioned this pull request Jun 4, 2025

[InferenceSnippet] Take token from env variable if not set huggingface/huggingface.js#1514

Merged

3 tasks

SBrandeis approved these changes Jun 4, 2025

View reviewed changes

bump huggingface.js

421197e

Wauplin marked this pull request as ready for review June 4, 2025 12:22

Wauplin merged commit 0967fa8 into main Jun 4, 2025
5 checks passed

Wauplin deleted the access-token-from-env-in-snippets branch June 4, 2025 12:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Access token from env in snippets #570

Access token from env in snippets #570

Uh oh!

Wauplin commented Jun 4, 2025 •

edited

Loading

Uh oh!

Wauplin commented Jun 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Access token from env in snippets #570

Access token from env in snippets #570

Uh oh!

Conversation

Wauplin commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Wauplin commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Wauplin commented Jun 4, 2025 •

edited

Loading

Wauplin commented Jun 4, 2025 •

edited

Loading