wip

taras-yemets · taras-yemets · commit 3551c9d19c2e · 2024-11-04T14:23:34.000+02:00
diff --git a/docs/cody/model-configuration/examples.mdx b/docs/cody/model-configuration/examples.mdx
@@ -417,15 +417,79 @@ In the configuration above, we:
 
 </Accordion>
 
-<Accordion title="Google Vertex (Gemini)"></Accordion>
+<Accordion title="Google Vertex (Gemini)">
+
+TODO
+
+</Accordion>
 
 <Accordion title="Google Vertex (public)">
 
 TODO
 
 </Accordion>
 
-<Accordion title="AWS Bedrock"></Accordion>
+<Accordion title="AWS Bedrock">
+
+```json
+"cody.enabled": true,
+"modelConfiguration": {
+  "sourcegraph": null,
+  "providerOverrides": [
+      {
+          "id": "aws-bedrock",
+          "displayName": "AWS Bedrock",
+          "serverSideConfig": {
+              "type": "awsBedrock",
+              "accessToken": "token",
+              "endpoint": "us-west-2",
+              "region": "us-west-2"
+          }
+      }
+  ],
+  "modelOverrides": [
+      {
+          "modelRef": "aws-bedrock::2023-06-01::claude-3-opus",
+          "displayName": "Claude 3 Opus (AWS Bedrock)",
+          "modelName": "anthropic.claude-3-opus-20240229-v1:0",
+          "capabilities": ["edit", "chat", "autocomplete"],
+          "category": "other",
+          "status": "stable",
+          "tier": "pro",
+          "contextWindow": {
+              "maxInputTokens": 45000,
+              "maxOutputTokens": 4000
+          }
+      }
+  ],
+  "defaultModels": {
+      "chat": "aws-bedrock::2023-06-01::claude-3-opus",
+      "fastChat": "aws-bedrock::2023-06-01::claude-3-opus",
+      "autocomplete": "aws-bedrock::2023-06-01::claude-3-opus",
+  }
+}
+```
+
+In the configuration described above, we:
+
+-   Set up a provider override for AWS Bedrock, routing requests for this provider directly to the specified endpoint,
+    bypassing Cody Gateway.
+-   Add the `"aws-bedrock::2023-06-01::claude-3-opus"` model, which is used for all Cody features.
+    We do not add other models for simplicity, as adding multiple models is already covered in the examples above.
+
+Provider override `serverSideConfig` fields:
+
+| Field         | Description                                                                                                                                                                                                                                                                                                     |
+| ------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| `type`        | Must be `"awsBedrock"`.                                                                                                                                                                                                                                                                                         |
+| `accessToken` | Leave empty to rely on instance role bindings or other AWS configurations in the frontend service. Use `<ACCESS_KEY_ID>:<SECRET_ACCESS_KEY>` for direct credential configuration, or `<ACCESS_KEY_ID>:<SECRET_ACCESS_KEY>:<SESSION_TOKEN>` if a session token is also required.                                 |
+| `endpoint`    | For Pay-as-you-go, set it to an AWS region code (e.g., `us-west-2`) when using a public Amazon Bedrock endpoint. For Provisioned Throughput, set it to the provisioned VPC endpoint for the bedrock-runtime API (e.g., `https://vpce-0a10b2345cd67e89f-abc0defg.bedrock-runtime.us-west-2.vpce.amazonaws.com`). |
+| `region`      | The region to use when configuring API clients. This is necessary because the 'frontend' binary's container cannot access environment variables from the host OS.                                                                                                                                               |
+
+Provisioned throughput for AWS Bedrock models can be configured using the `"awsBedrockProvisionedThroughput"` server-side
+configuration type. For more details, refer to the [Model Overrides](/cody/model-configuration#model-overrides) section.
+
+</Accordion>
 
 ## Self-hosted models
 
diff --git a/docs/cody/model-configuration/index.mdx b/docs/cody/model-configuration/index.mdx
@@ -230,7 +230,7 @@ This field is an array of items, each with the following fields:
     It includes two fields:
     -   `maxInputTokens` - Specifies the maximum number of tokens for the contextual data in the prompt (e.g., question, relevant snippets).
     -   `maxOutputTokens` - Specifies the maximum number of tokens allowed in the response.
--   `serverSideConfig` - Additional configuration for the model. The available fields include:
+-   `serverSideConfig` - Additional configuration for the model. Can be one of the following:
 
     -   `awsBedrockProvisionedThroughput` - Specifies provisioned throughput settings for AWS Bedrock models, with the following fields: