combined 4o models

ChrisHMSFT · ChrisHMSFT · commit 540992d864be · 2024-10-14T23:33:30.000-04:00
diff --git a/articles/ai-services/openai/concepts/provisioned-throughput.md b/articles/ai-services/openai/concepts/provisioned-throughput.md
@@ -44,12 +44,12 @@ Generating output tokens requires more processing and the more tokens generated,
 
 To help with simplifying the sizing effort, the table below outlines the TPM per PTU for the `gpt-4o` and `gpt-4o-mini` models
 
-|     | **gpt-4o**, **2024-05-13**   | **gpt-4o**, **2024-08-06**   | **gpt-4o-mini**, **2024-07-18**   | 
-| --| --| --|--|
-| Deployable Increments | 50 | 50 | 25|
-| Input TPM per PTU | 2,500 | 2,500 | 37,000  |
-| Output TPM per PTU | 833 | 833  | 12,333 |
-| Latency target | > 25 tokens per second* | > 25 tokens per second*  | > 33 tokens per second* |
+|     | **gpt-4o**, **2024-05-13**   & **gpt-4o**, **2024-08-06**   | **gpt-4o-mini**, **2024-07-18**   | 
+| --| --| --|
+| Deployable Increments | 50 | 25|
+| Input TPM per PTU | 2,500 | 37,000  |
+| Output TPM per PTU | 833  | 12,333 |
+| Latency target | > 25 tokens per second*  | > 33 tokens per second* |
 
 \*  Calculated as the average of the per-call average generated tokens on a 1-minute bassis over the month
 \** For a full list please see the [AOAI Studio calcualator](https://oai.azure.com/portal/calculator)