Skip to content

Commit 79ccef5

Browse files
committed
Set bedrock embed model batch size (#905)
1 parent c450584 commit 79ccef5

File tree

2 files changed

+3
-1
lines changed

2 files changed

+3
-1
lines changed

services/api/src/owl/configs/oss.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -237,7 +237,7 @@ def azure_ai_api_key_plain(self) -> str:
237237

238238
@property
239239
def bedrock_api_key_plain(self) -> str:
240-
return self.azure_ai_api_key.get_secret_value()
240+
return self.bedrock_api_key.get_secret_value()
241241

242242
@property
243243
def cerebras_api_key_plain(self) -> str:

services/api/src/owl/utils/lm.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1285,6 +1285,8 @@ async def embedding(
12851285
else:
12861286
hyperparams["input_type"] = "search_document"
12871287
batch_size = 96 # limit on cohere server
1288+
elif ctx.deployment.provider == CloudProvider.BEDROCK:
1289+
batch_size = 96
12881290
elif ctx.deployment.provider == CloudProvider.JINA_AI:
12891291
batch_size = 128 # don't know limit, but too large will timeout
12901292
elif ctx.deployment.provider == CloudProvider.VOYAGE:

0 commit comments

Comments
 (0)