Skip to content

Commit 7927491

Browse files
authored
Merge pull request #8910 from burtenshaw/update-hf-inference
docs: update hugging face inference
2 parents 6e9c43a + 0d645fc commit 7927491

File tree

4 files changed

+91
-43
lines changed

4 files changed

+91
-43
lines changed

docs/customize/model-providers/more/huggingfaceinferenceapi.mdx

Lines changed: 0 additions & 41 deletions
This file was deleted.

docs/customize/model-providers/overview.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@ These are the most commonly used model providers that offer a wide range of capa
2020
| [xAI](/customize/model-providers/more/xAI) | Grok models from xAI | Chat, Edit, Apply |
2121
| [Vertex AI](/customize/model-providers/top-level/vertexai) | Google Cloud's machine learning platform | Chat, Edit, Apply, Embeddings |
2222
| [Inception](/customize/model-providers/top-level/inception) | On-premises open-source model runners | Chat, Edit, Apply |
23+
| [HuggingFace](/customize/model-providers/top-level/huggingfaceinference) | Platform for open source models with inference providers and endpoints | Chat, Edit, Apply, Embeddings |
2324

2425
## Additional Model Providers
2526

@@ -37,7 +38,6 @@ Beyond the top-level providers, Continue supports many other options:
3738
| [Cohere](/customize/model-providers/more/cohere) | Models specialized for semantic search and text generation |
3839
| [NVIDIA](/customize/model-providers/more/nvidia) | GPU-accelerated model hosting |
3940
| [Cloudflare](/customize/model-providers/more/cloudflare) | Edge-based AI inference services |
40-
| [HuggingFace](/customize/model-providers/more/huggingfaceinferenceapi) | Platform for open source models |
4141

4242
### Local Model Options
4343

Lines changed: 89 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,89 @@
1+
---
2+
title: "Hugging Face"
3+
---
4+
5+
Hugging Face is the main platform for sharing open AI models. It provides inference in two ways. [Inference Providers](https://huggingface.co/docs/inference-providers/index) and [Inference Endpoints](https://huggingface.co/docs/inference-endpoints/index).
6+
7+
## Inference Providers
8+
9+
Inference Providers is a serverless service powered by external inference providers and routed through Hugging Face and paid per token.
10+
11+
<Info>
12+
13+
You can access your access token from [Hugging Face](https://huggingface.co/settings/tokens) and prioritize your [providers in settings](https://huggingface.co/settings/inference-providers/overview).
14+
15+
</Info>
16+
17+
<Tabs>
18+
<Tab title="YAML">
19+
```yaml title="config.yaml"
20+
name: My Config
21+
version: 0.0.1
22+
schema: v1
23+
24+
models:
25+
- name: deepseek
26+
provider: huggingface-inference-providers
27+
model: deepseek-ai/DeepSeek-V3.2-Exp
28+
apiKey: <YOUR_HF_TOKEN>
29+
apiBase: https://router.huggingface.co/v1
30+
```
31+
</Tab>
32+
<Tab title="JSON (Deprecated)">
33+
```json title="config.json"
34+
{
35+
"models": [
36+
{
37+
"title": "deepseek",
38+
"provider": "huggingface-inference-providers",
39+
"model": "deepseek-ai/DeepSeek-V3.2-Exp",
40+
"apiKey": "<YOUR_HF_TOKEN>",
41+
"apiBase": "https://router.huggingface.co/v1"
42+
}
43+
]
44+
}
45+
```
46+
</Tab>
47+
</Tabs>
48+
49+
## Inference Endpoints
50+
51+
Inference Endpoints is a dedicated service that allows you to run your open models dedicated hardware. It is a more advanced way to get inference from Hugging Face models where you have more control over the whole process.
52+
53+
<Info>
54+
55+
Before you can use Inference Endpoints, you need to create an endpoint. You can do this by going to [Inference Endpoints](https://endpoints.huggingface.co/burtenshaw/endpoints/dedicated) and clicking on "Create Endpoint".
56+
57+
</Info>
58+
59+
<Tabs>
60+
<Tab title="YAML">
61+
```yaml title="config.yaml"
62+
name: My Config
63+
version: 0.0.1
64+
schema: v1
65+
66+
models:
67+
- name: deepseek
68+
provider: huggingface-inference-endpoints
69+
model: <ENDPOINT_ID>
70+
apiKey: <YOUR_HF_TOKEN>
71+
apiBase: https://<YOUR_ENDPOINT_ID>.aws.endpoints.huggingface.cloud
72+
```
73+
</Tab>
74+
<Tab title="JSON (Deprecated)">
75+
```json title="config.json"
76+
{
77+
"models": [
78+
{
79+
"title": "deepseek",
80+
"provider": "huggingface-inference-endpoints",
81+
"model": "<ENDPOINT_ID>",
82+
"apiKey": "<YOUR_HF_TOKEN>",
83+
"apiBase": "https://<YOUR_ENDPOINT_ID>.aws.endpoints.huggingface.cloud"
84+
}
85+
]
86+
}
87+
```
88+
</Tab>
89+
</Tabs>

docs/docs.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -184,6 +184,7 @@
184184
"customize/model-providers/top-level/azure",
185185
"customize/model-providers/top-level/bedrock",
186186
"customize/model-providers/top-level/gemini",
187+
"customize/model-providers/top-level/huggingfaceinference",
187188
"customize/model-providers/top-level/inception",
188189
"customize/model-providers/top-level/lmstudio",
189190
"customize/model-providers/top-level/ollama",
@@ -200,7 +201,6 @@
200201
"customize/model-providers/more/deepseek",
201202
"customize/model-providers/more/deepinfra",
202203
"customize/model-providers/more/groq",
203-
"customize/model-providers/more/huggingfaceinferenceapi",
204204
"customize/model-providers/more/llamacpp",
205205
"customize/model-providers/more/llamastack",
206206
"customize/model-providers/more/mistral",

0 commit comments

Comments
 (0)