Skip to content

Commit 55b68da

Browse files
committed
Increase boto3 max_pool_connections from default 10 to 50
The default urllib3 connection pool size of 10 causes HTTP connection queuing under concurrent LLM requests, degrading total response time by ~60% at 20 concurrent requests.
1 parent 4696f44 commit 55b68da

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

singlestoredb/ai/chat.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -114,6 +114,7 @@ def obo_token_getter_fn() -> Optional[str]:
114114
cfg_kwargs = {
115115
'signature_version': UNSIGNED,
116116
'retries': {'max_attempts': 1, 'mode': 'standard'},
117+
'max_pool_connections': 50,
117118
}
118119
if read_timeout is not None:
119120
cfg_kwargs['read_timeout'] = read_timeout

0 commit comments

Comments
 (0)