Skip to content

Commit e01dc4f

Browse files
Merge pull request #281836 from gojira/keijik/deps-07-25
Updates to GLobal Standard
2 parents d5c42f8 + d1d7a47 commit e01dc4f

File tree

1 file changed

+2
-4
lines changed

1 file changed

+2
-4
lines changed

articles/ai-services/openai/how-to/deployment-types.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -55,11 +55,9 @@ Standard deployments are optimized for low to medium volume workloads with high
5555
> [!IMPORTANT]
5656
> Data might be processed outside of the resource's Azure geography, but data storage remains in its Azure geography. [Learn more about data residency](https://azure.microsoft.com/explore/global-infrastructure/data-residency/).
5757
58-
Global deployments are available in the same Azure OpenAI resources as non-global offers but allow you to leverage Azure's global infrastructure to dynamically route traffic to the data center with best availability for each request. Global standard will provide the highest default quota for new models and eliminates the need to load balance across multiple resources.
58+
Global deployments are available in the same Azure OpenAI resources as non-global deployment types but allow you to leverage Azure's global infrastructure to dynamically route traffic to the data center with best availability for each request. Global standard provides the highest default quota and eliminates the need to load balance across multiple resources.
5959

60-
The deployment type is optimized for low to medium volume workloads with high burstiness. Customers with high consistent volume may experience greater latency variability. The threshold is set per model. See the [quotas page to learn more](./quota.md).
61-
62-
For customers that require the lower latency variance at large workload usage, we recommend purchasing provisioned throughput.
60+
Customers with high consistent volume may experience greater latency variability. The threshold is set per model. See the [quotas page to learn more](./quota.md). For applications that require the lower latency variance at large workload usage, we recommend purchasing provisioned throughput.
6361

6462
### How to disable access to global deployments in your subscription
6563

0 commit comments

Comments
 (0)