Skip to content

Commit 5806d38

Browse files
authored
Merge branch 'main' into update-inference-providers-docs-automated-pr
2 parents 2985a78 + a40c87e commit 5806d38

File tree

7 files changed

+145
-50
lines changed

7 files changed

+145
-50
lines changed

docs/inference-providers/_toctree.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,8 @@
2323
title: Featherless AI
2424
- local: providers/fireworks-ai
2525
title: Fireworks
26+
- local: providers/groq
27+
title: Groq
2628
- local: providers/hyperbolic
2729
title: Hyperbolic
2830
- local: providers/hf-inference

docs/inference-providers/index.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@ Here is the complete list of partners integrated with Inference Providers, and t
2020
| [Fal AI](./providers/fal-ai) | | | |||
2121
| [Featherless AI](./providers/featherless-ai) || | | | |
2222
| [Fireworks](./providers/fireworks-ai) ||| | | |
23+
| [Groq](./providers/groq) || | | | |
2324
| [HF Inference](./providers/hf-inference) ||||| |
2425
| [Hyperbolic](./providers/hyperbolic) ||| | | |
2526
| [Nebius](./providers/nebius) ||||| |
Lines changed: 69 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,69 @@
1+
<!---
2+
WARNING
3+
4+
This markdown file has been generated from a script. Please do not edit it directly.
5+
6+
### Template
7+
8+
If you want to update the content related to groq's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/groq.handlebars`.
9+
10+
### Logos
11+
12+
If you want to update groq's logo, upload a file by opening a PR on https://huggingface.co/datasets/huggingface/documentation-images/tree/main/inference-providers/logos. Ping @wauplin and @celinah on the PR to let them know you uploaded a new logo.
13+
Logos must be in .png format and be named `groq-light.png` and `groq-dark.png`. Visit https://huggingface.co/settings/theme to switch between light and dark mode and check that the logos are displayed correctly.
14+
15+
### Generation script
16+
17+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
18+
--->
19+
20+
# Groq
21+
22+
<div class="flex justify-center">
23+
<a href="https://groq.com/" target="_blank">
24+
<img class="block dark:hidden" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/inference-providers/logos/groq-light.png"/>
25+
<img class="hidden dark:block" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/inference-providers/logos/groq-dark.png"/>
26+
</a>
27+
</div>
28+
29+
<div class="flex">
30+
<a href="https://huggingface.co/groq" target="_blank">
31+
<img class="block dark:hidden" src="https://huggingface.co/datasets/huggingface/badges/resolve/main/follow-us-on-hf-lg.svg"/>
32+
<img class="hidden dark:block" src="https://huggingface.co/datasets/huggingface/badges/resolve/main/follow-us-on-hf-lg-dark.svg"/>
33+
</a>
34+
</div>
35+
36+
Groq is fast AI inference. Their groundbreaking LPU technology delivers record-setting performance and efficiency for GenAI models. With custom chips specifically designed for AI inference workloads and a deterministic, software-first approach, Groq eliminates the bottlenecks of conventional hardware to enable real-time AI applications with predictable latency and exceptional throughput so developers can build fast.
37+
38+
For latest pricing, visit our [pricing page](https://groq.com/pricing/).
39+
40+
## Resources
41+
- **Website**: https://groq.com/
42+
- **Documentation**: https://console.groq.com/docs
43+
- **Community Forum**: https://community.groq.com/
44+
- **X**: [@GroqInc](https://x.com/GroqInc)
45+
- **LinkedIn**: [Groq](https://www.linkedin.com/company/groq/)
46+
- **YouTube**: [Groq](https://www.youtube.com/@GroqInc)
47+
48+
## Supported tasks
49+
50+
51+
### Chat Completion (LLM)
52+
53+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
54+
55+
<InferenceSnippet
56+
pipeline=text-generation
57+
providersMapping={ {"groq":{"modelId":"Qwen/Qwen3-32B","providerModelId":"qwen/qwen3-32b"} } }
58+
conversational />
59+
60+
61+
### Chat Completion (VLM)
62+
63+
Find out more about Chat Completion (VLM) [here](../tasks/chat-completion).
64+
65+
<InferenceSnippet
66+
pipeline=image-text-to-text
67+
providersMapping={ {"groq":{"modelId":"meta-llama/Llama-4-Scout-17B-16E-Instruct","providerModelId":"meta-llama/llama-4-scout-17b-16e-instruct"} } }
68+
conversational />
69+

docs/inference-providers/tasks/chat-completion.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,7 @@ This is a subtask of [`text-generation`](https://huggingface.co/docs/inference-p
2525
- [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B): Smaller variant of one of the most powerful models.
2626
- [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct): Very powerful text generation model trained to follow instructions.
2727
- [microsoft/phi-4](https://huggingface.co/microsoft/phi-4): Powerful text generation model by Microsoft.
28+
- [Qwen/Qwen2.5-7B-Instruct-1M](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct-1M): Strong conversational model that supports very long instructions.
2829
- [Qwen/Qwen2.5-Coder-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct): Text generation model used to write code.
2930
- [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1): Powerful reasoning based open large language model.
3031

docs/inference-providers/tasks/text-generation.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -30,6 +30,7 @@ For more details about the `text-generation` task, check out its [dedicated page
3030
- [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B): Smaller variant of one of the most powerful models.
3131
- [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct): Very powerful text generation model trained to follow instructions.
3232
- [microsoft/phi-4](https://huggingface.co/microsoft/phi-4): Powerful text generation model by Microsoft.
33+
- [Qwen/Qwen2.5-7B-Instruct-1M](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct-1M): Strong conversational model that supports very long instructions.
3334
- [Qwen/Qwen2.5-Coder-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct): Text generation model used to write code.
3435
- [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1): Powerful reasoning based open large language model.
3536

0 commit comments

Comments
 (0)