-
I'm trying out 1.18.0-alpha and the new Assistants V2 capabilities with Code Interpreter. It works well with file search but with code interpreter I'm mostly getting rate limited by AOAI (eg. Rate limit is exceeded. Try again in 9 seconds.) I note that polling is done to the thread every second, is that configurable? (Eg. call every 5 or 10 seconds instead)? Thanks Edit: I note using Azure.AI.OpenAI - I can poll manually (and can do so every five seconds to avoid rate limits). Will add as issue since default polling in SK appear to be problematic. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
I experienced this issue as well. |
Beta Was this translation helpful? Give feedback.
-
You should be able to configure polling frequency using the I'd be curious to understand more clearly on the configuration of your deployment. I was under the impression that throttling was based on tokens. Are you able to share the configuration of your rate limit? |
Beta Was this translation helpful? Give feedback.
You should be able to configure polling frequency using the
PollingOptions
property:https://github.com/microsoft/semantic-kernel/blob/feature-connectors-openai/dotnet/src/Agents/OpenAI/OpenAIAssistantAgent.cs#L56
I'd be curious to understand more clearly on the configuration of your deployment. I was under the impression that throttling was based on tokens. Are you able to share the configuration of your rate limit?