You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## Cost and quota considerations forDeepseek models deployed as serverless API endpoints
1139
+
## Cost and quota considerations forDeepSeek models deployed as serverless API endpoints
1140
1140
1141
1141
Quota is managed per deployment. Each deployment has a rate limit of200,000 tokens per minute and 1,000API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits aren't sufficient for your scenarios.
0 commit comments