adding links

aahill · aahill · commit b4cb6671084e · 2024-08-01T15:11:57.000-07:00
diff --git a/articles/ai-services/openai/concepts/provisioned-throughput.md b/articles/ai-services/openai/concepts/provisioned-throughput.md
@@ -14,6 +14,9 @@ recommendations: false
 
 The provisioned throughput capability allows you to specify the amount of throughput you require in a deployment. The service then allocates the necessary model processing capacity and ensures it's ready for you. Throughput is defined in terms of provisioned throughput units (PTU) which is a normalized way of representing the throughput for your deployment. Each model-version pair requires different amounts of PTU to deploy and provide different amounts of throughput per PTU.
 
+> [!NOTE]
+> On July 29th 2024, Microsoft switched to an hourly/reservation PTU offering that offers usability improvements. For more details, see the [PTU migration article](../provisioned-migration.md#whats-changing).
+
 ## What does the provisioned deployment type provide?
 
 - **Predictable performance:** stable max latency and throughput for uniform workloads. 
@@ -47,7 +50,7 @@ An Azure OpenAI Deployment is a unit of management for a specific OpenAI Model.
 
 ### Provisioned throughput units
 
-Provisioned throughput units (PTU) are units of model processing capacity that you can reserve and deploy for processing prompts and generating completions. The minimum PTU deployment, increments, and processing capacity associated with each unit varies by model type & version.
+Provisioned throughput units (PTU) are units of model processing capacity that you can reserve and deploy for processing prompts and generating completions. The minimum PTU deployment, increments, and processing capacity associated with each unit varies by model type & version. 
 
 ### Deployment types
 
diff --git a/articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md b/articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md
@@ -21,6 +21,9 @@ This article walks you through the process of onboarding to [Provisioned Through
 
 You should consider switching from pay-as-you-go to provisioned throughput when you have well-defined, predictable throughput requirements. Typically, this occurs when the application is ready for production or has already been deployed in production and there's an understanding of the expected traffic. This will allow users to accurately forecast the required capacity and avoid unexpected billing.  
 
+> [!NOTE]
+> On July 29th 2024, Microsoft switched to an hourly/reservation PTU offering that offers usability improvements. For more details, see the [PTU migration article](../provisioned-migration.md#whats-changing).
+
 ### Typical PTU scenarios
 
 - An application that is ready for production or in production.
diff --git a/articles/ai-services/openai/provisioned-migration.md b/articles/ai-services/openai/provisioned-migration.md
@@ -72,6 +72,7 @@ Customers will no longer obtain quota by contacting their sales teams. Instead,
   
 The target is to respond to all quota requests within two business days. However, many requests will be autoapproved and responses can be expected within 30 minutes in these cases.
 
+
 ## New hourly reservation payment model
 
 > [!NOTE]
@@ -104,6 +105,7 @@ Microsoft has introduced a new “Hourly/reservation” payment model for provis
 > - Create deployments on new Azure OpenAI resources without commitments.
 > - Migrate an existing resources off its commitments.
 
+
 ## Hourly reservation model details
 
 Details on the hourly/reservation model can be found in the [Azure OpenAI Provisioned Onboarding Guide](./how-to/provisioned-throughput-onboarding.md)