You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
AI Gateway offers multiple endpoints for each Gateway you create - one endpoint per provider, and one Universal Endpoint. The Universal Endpoint requires some adjusting to your schema, but supports additional features. Some of these features are, for example, retrying a request if it fails the first time, or configuring a [fallback model/provider](/ai-gateway/configuration/fallbacks/).
20
18
21
19
You can use the Universal endpoint to contact every provider. The payload is expecting an array of message, and each message is an object with the following parameters:
@@ -25,17 +23,17 @@ You can use the Universal endpoint to contact every provider. The payload is exp
25
23
-`authorization`: the content of the Authorization HTTP Header that should be used when contacting this provider. This usually starts with “Token” or “Bearer”.
26
24
-`query`: the payload as the provider expects it in their official API.
27
25
28
-
## Example
26
+
## cURL example
29
27
30
28
<Renderfile="universal-gateway-example" />
31
29
32
30
The above will send a request to Workers AI Inference API, if it fails it will proceed to OpenAI. You can add as many fallbacks as you need, just by adding another JSON in the array.
33
31
34
-
## Websockets API <Badgetext="beta"variant="tip"size="small" />
32
+
## WebSockets API <Badgetext="beta"variant="tip"size="small" />
35
33
36
34
The Universal Endpoint can also be accessed via a [WebSockets API](/ai-gateway/configuration/websockets-api/) which provides a single persistent connection, enabling continuous communication. This API supports all AI providers connected to AI Gateway, including those that do not natively support WebSockets.
37
35
38
-
## Example request
36
+
## WebSockets example
39
37
40
38
```javascript
41
39
importWebSocketfrom"ws";
@@ -87,7 +85,7 @@ Since the same settings can be configured in multiple locations, AI Gateway appl
87
85
88
86
This hierarchy ensures consistent behavior, prioritizing the most specific configurations. Use provider-level and request-level headers for fine-tuned control, and gateway settings for general defaults.
89
87
90
-
## Example
88
+
## Hierarchy example
91
89
92
90
This example demonstrates how headers set at different levels impact caching behavior:
0 commit comments