Skip to content

Commit de8f627

Browse files
authored
Merge pull request #5550 from PatrickFarley/imagen
Imagen
2 parents 401c406 + d007e24 commit de8f627

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

articles/ai-services/openai/quotas-limits.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -208,6 +208,16 @@ The rate limits for each `gpt-4o` audio model deployment are 100 K TPM and 1 K R
208208

209209
M = million | K = thousand
210210

211+
## GPT-image-1 rate limits
212+
213+
### GPT0-image-1 global standard
214+
215+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
216+
|---|---|:---:|:---:|
217+
|`gpt-image-1`|Enterprise agreement | N/A | 20 |
218+
|`gpt-image-1` |Default | N/A | 6 |
219+
220+
211221
## Usage tiers
212222

213223
Global standard deployments use Azure's global infrastructure, dynamically routing customer traffic to the data center with best availability for the customer’s inference requests. Similarly, Data zone standard deployments allow you to use Azure global infrastructure to dynamically route traffic to the data center within the Microsoft defined data zone with the best availability for each request. This enables more consistent latency for customers with low to medium levels of traffic. Customers with high sustained levels of usage might see greater variability in response latency.

0 commit comments

Comments
 (0)