Cerebras provider

daisyfaithauma · daisyfaithauma · commit 346734e96792 · 2025-02-04T15:24:55.000Z
diff --git a/src/content/docs/ai-gateway/providers/cerebras.mdx b/src/content/docs/ai-gateway/providers/cerebras.mdx
@@ -0,0 +1,61 @@
+---
+title: Cerebras
+pcx_content_type: get-started
+---
+
+[Cerebras](https://inference-docs.cerebras.ai/) offers developers a low-latency solution for AI model inference.
+
+## Endpoint
+
+```txt
+https://gateway.ai.cloudflare.com/v1/{account_id}/{gateway_id}/cerebras-ai
+```
+
+## Prerequisites
+
+When making requests to Cerebras, ensure you have the following:
+
+- Your AI Gateway Account ID.
+- Your AI Gateway gateway name.
+- An active Cerebras API token.
+- The name of the Cerebras model you want to use.
+
+## Examples
+
+### cURL
+
+```bash title="Example fetch request"
+curl -X POST https://gateway.ai.cloudflare.com/v1/ACCOUNT_TAG/GATEWAY/cerebras/chat/completions \
+ --header 'content-type: application/json' \
+ --header 'Authorization: Bearer CEREBRAS_TOKEN' \
+ --data '{
+    "model": "llama3.1-8b",
+    "messages": [
+        {
+            "role": "user",
+            "content": "What is Cloudflare?"
+        }
+    ]
+}'
+```
+
+### Use Cerebras through Cerebras Cloud SDK with JavaScript
+
+```js title="JavaScript"
+import Cerebras from "@cerebras/cerebras_cloud_sdk";
+
+const client = new Cerebras({
+	apiKey: process.env["CEREBRAS_API_KEY"], // This is the default and can be omitted
+});
+
+async function main() {
+	const completionCreateResponse = await client.chat.completions.create({
+		messages: [{ role: "user", content: "Why is fast inference important?" }],
+		model: "llama3.1-8b",
+	});
+
+	console.log(completionCreateResponse);
+}
+
+main();
+```