MicrosoftDocs
diff --git a/‎articles/ai-services/openai/concepts/prompt-engineering.md
Lines changed: 1 addition & 1 deletion b/‎articles/ai-services/openai/concepts/prompt-engineering.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-services/openai/how-to/prompt-caching.md
Lines changed: 0 additions & 3 deletions b/‎articles/ai-services/openai/how-to/prompt-caching.md
Lines changed: 0 additions & 3 deletions
diff --git a/‎articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md
Lines changed: 2 additions & 2 deletions b/‎articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/machine-learning/how-to-use-batch-pipeline-deployments.md
Lines changed: 1 addition & 1 deletion b/‎articles/machine-learning/how-to-use-batch-pipeline-deployments.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/machine-learning/how-to-use-foundation-models.md
Lines changed: 14 additions & 12 deletions b/‎articles/machine-learning/how-to-use-foundation-models.md
Lines changed: 14 additions & 12 deletions
diff --git a/‎articles/machine-learning/media/how-to-use-foundation-models/model-import.png
-85 KB b/‎articles/machine-learning/media/how-to-use-foundation-models/model-import.png
-85 KB
diff --git a/‎articles/open-datasets/dataset-the-cancer-genome-atlas.md
Lines changed: 1 addition & 1 deletion b/‎articles/open-datasets/dataset-the-cancer-genome-atlas.md
Lines changed: 1 addition & 1 deletion
@@ -1,5 +1,5 @@
 ---
-title: Azure OpenAI in Azure AI Foundry Models | Prompt engineering techniques
+title:  Prompt engineering techniques | Azure OpenAI in Azure AI Foundry Models
 titleSuffix: Azure OpenAI
 description: Learn how to use prompt engineering to optimize your work with Azure OpenAI.
 ms.service: azure-ai-openai
 
@@ -34,9 +34,6 @@ Currently only the following models support prompt caching with Azure OpenAI:
 - `gpt-4.1-2025-04-14`
 - `gpt-4.1-nano-2025-04-14`
 
-> [!NOTE]
-> Prompt caching is now also available as part of model fine-tuning for `gpt-4o` and `gpt-4o-mini`. Refer to the fine-tuning section of the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) for details.
-
 ## API support
 
 Official support for prompt caching was first added in API version `2024-10-01-preview`. At this time, only the o-series model family supports the `cached_tokens` API response parameter.
 
@@ -3,7 +3,7 @@ title:  Understanding costs associated with provisioned throughput units (PTU)
 description: Learn about provisioned throughput costs and billing in Azure AI Foundry. 
 ms.service: azure-ai-openai
 ms.topic: conceptual 
-ms.date: 06/13/2025
+ms.date: 06/25/2025
 manager: nitinme
 author: aahill 
 ms.author: aahi 
@@ -83,7 +83,7 @@ For example, for `gpt-4.1:2025-04-14`, 1 output token counts as 4 input tokens t
 |Global & data zone provisioned scale increment| 5 | 5|5| 5 | 5 |5|5|5|5|  100|100|
 |Regional provisioned minimum deployment|25| 50|25| 25 |50 | 25|25|50|25| NA|NA|
 |Regional provisioned scale increment|25| 50|25| 25 | 50 | 25|50|50|25|NA|NA|
-|Input TPM per PTU|5,400 | 3,000|14,900| 59,400 | 600 | 2,500|230|2,500|37,000|4,000|4,000|
+|Input TPM per PTU|5,400 | 3,000|14,900| 59,400 | 3,000 | 2,500|230|2,500|37,000|4,000|4,000|
 |Latency Target Value| 99% > 66 Tokens Per Second\* | 99% > 40 Tokens Per Second\* | 99% > 50 Tokens Per Second\*| 99% > 60 Tokens Per Second\* | 99% > 40 Tokens Per Second\* | 99% > 66 Tokens Per Second\* | 99% > 25 Tokens Per Second\* | 99% > 25 Tokens Per Second\* | 99% > 33 Tokens Per Second\* | 99% > 50 Tokens Per Second\*| 99% > 50 Tokens Per Second\*|
 
 \* Calculated as the average request latency on a per-minute basis across the month.
 
@@ -236,7 +236,7 @@ ml_client.compute.begin_delete(name="batch-cluster")
 
 ## Next steps
 
-- [How to deploy a training pipeline with batch endpoints)](how-to-use-batch-training-pipeline.md)
+- [How to deploy a training pipeline with batch endpoints](how-to-use-batch-training-pipeline.md)
 - [How to deploy a pipeline to perform batch scoring with preprocessing](how-to-use-batch-scoring-pipeline.md)
 - [Create batch endpoints from pipeline jobs](how-to-use-batch-pipeline-from-job.md)
 - [Create jobs and input data for batch endpoints](how-to-access-data-batch-endpoints-jobs.md)
 
@@ -15,21 +15,21 @@ ms.collection: ce-skilling-ai-copilot
 
 # How to use Open Source foundation models curated by Azure Machine Learning 
 
-In this article, you learn how to fine tune, evaluate and deploy foundation models in the Model Catalog. 
+In this article, you learn how to fine tune, evaluate, and deploy foundation models in the Model Catalog. 
 
-You can quickly test out any pre-trained model using the Sample Inference form on the model card, providing your own sample input to test the result. Additionally, the model card for each model includes a brief description of the model and links to samples for code based inferencing, fine-tuning and evaluation of the model.
+You can quickly test out any pre-trained model using the Sample Inference form on the model card, providing your own sample input to test the result. Additionally, the model card for each model includes a brief description of the model and links to samples for code based inferencing, fine-tuning, and evaluation of the model.
 
 ## How to evaluate foundation models using your own test data
 
-You can evaluate a Foundation Model against your test dataset, using either the Evaluate UI form or by using the code based samples, linked from the model card.
+You can evaluate a foundation model against your test dataset, using either the Evaluate UI form or by using the code based samples, linked from the model card.
 
 ### Evaluating using the studio
 
 You can invoke the Evaluate model form by selecting the **Evaluate** button on the model card for any foundation model.
 
 :::image type="content" source="./media/how-to-use-foundation-models/evaluate-quick-wizard.png" alt-text="Screenshot showing the evaluation settings form after the user selects the evaluate button on a model card for a foundation model.":::
 
-Each model can be evaluated for the specific inference task that the model will be used for.
+You can evaluate each model for the specific inference task that you use the model for.
 
 **Test Data:**
 
@@ -46,7 +46,7 @@ Each model can be evaluated for the specific inference task that the model will
 
 ### Evaluating using code based samples
 
-To enable users to get started with model evaluation, we have published samples (both Python notebooks and CLI examples) in the [Evaluation samples in azureml-examples git repo](https://github.com/Azure/azureml-examples/tree/main/sdk/python/foundation-models/system/evaluation). Each model card also links to evaluation samples for corresponding tasks
+To enable you to get started with model evaluation, we provide samples (both Python notebooks and CLI examples) in the [Evaluation samples in azureml-examples git repo](https://github.com/Azure/azureml-examples/tree/main/sdk/python/foundation-models/system/evaluation). Each model card also links to evaluation samples for corresponding tasks
 
 ## How to fine-tune foundation models using your own training data
 
@@ -69,7 +69,7 @@ You can invoke the fine-tune settings form by selecting on the **Fine-tune** but
 
 1. Pass in the training data you would like to use to fine-tune your model. You can choose to either upload a local file (in JSONL, CSV or TSV format) or select an existing registered dataset from your workspace. 
 
-1. Once you've selected the dataset, you need to map the columns from your input data, based on the schema needed for the task. For example: map the column names that correspond to the 'sentence' and 'label' keys for Text Classification
+1. Once you select the dataset, you need to map the columns from your input data, based on the schema needed for the task. For example: map the column names that correspond to the 'sentence' and 'label' keys for Text Classification
 
 :::image type="content" source="./media/how-to-use-foundation-models/finetune-map-data-columns.png" lightbox="./media/how-to-use-foundation-models/finetune-map-data-columns.png" alt-text="Screenshot showing the fine-tune map in the foundation models evaluate wizard.":::
 
@@ -90,7 +90,7 @@ Currently, Azure Machine Learning supports fine-tuning models for the following
 * Summarization
 * Translation
 
-To enable users to quickly get started with fine-tuning, we have published samples (both Python notebooks and CLI examples) for each task in the [azureml-examples git repo Finetune samples](https://github.com/Azure/azureml-examples/tree/main/sdk/python/foundation-models/system/finetune). Each model card also links to fine-tuning samples for supported fine-tuning tasks.
+To enable users to quickly get started with fine-tuning, we have published samples (both Python notebooks and CLI examples) for each task in the [azureml-examples git repo fine-tune samples](https://github.com/Azure/azureml-examples/tree/main/sdk/python/foundation-models/system/finetune). Each model card also links to fine-tuning samples for supported fine-tuning tasks.
 
 ## Deploying foundation models to endpoints for inferencing
 
@@ -117,11 +117,13 @@ If you're deploying a Llama-2, Phi, Nemotron, Mistral, Dolly or Deci-DeciLM mode
 
 ### Deploying using code based samples
 
-To enable users to quickly get started with deployment and inferencing, we have published samples in the [Inference samples in the azureml-examples git repo](https://github.com/Azure/azureml-examples/tree/main/sdk/python/foundation-models/system/inference). The published samples include Python notebooks and CLI examples. Each model card also links to Inference samples for Real time and Batch inferencing.
+To enable you to quickly get started with deployment and inferencing, we provide samples in the [Inference samples in the azureml-examples git repo](https://github.com/Azure/azureml-examples/tree/main/sdk/python/foundation-models/system/inference). The published samples include Python notebooks and CLI examples. Each model card also links to Inference samples for Real time and Batch inferencing.
 
 ## Import foundation models
 
-If you're looking to use an open source model that isn't included in the model catalog, you can import the model from Hugging Face into your Azure Machine Learning workspace. Hugging Face is an open-source library for natural language processing (NLP) that provides pre-trained models for popular NLP tasks. Currently, model import supports importing models for the following tasks, as long as the model meets the requirements listed in the Model Import Notebook:
+If you search the model catalog and don't find the open source model you need, you can import it from Hugging Face into your Azure Machine Learning workspace. The **Import** button appears in the model catalog only when your search returns no results.
+
+Hugging Face is an open-source library for natural language processing (NLP) that provides pre-trained models for popular NLP tasks. Currently, model import supports importing models for the following tasks, as long as the model meets the requirements listed in the Model Import Notebook:
 
 * fill-mask
 * token-classification
@@ -136,15 +138,15 @@ If you're looking to use an open source model that isn't included in the model c
 > [!NOTE] 
 > Models from Hugging Face are subject to third-party license terms available on the Hugging Face model details page. It is your responsibility to comply with the model's license terms.
 
-You can select the **Import** button on the top-right of the model catalog to use the Model Import Notebook.
+When your search returns no results, select  **Import model notebook**  to use the Model Import Notebook.
 
-:::image type="content" source="./media/how-to-use-foundation-models/model-import.png" alt-text="Screenshot showing the model import button as it's displayed in the top right corner on the foundation model catalog.":::
+:::image type="content" source="./media/how-to-use-foundation-models/model-import.png" alt-text="Screenshot showing the model import button as it is displayed when search returns no results in the foundation model catalog.":::
 
 The model import notebook is also included in the azureml-examples git repo [here](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/system/import/import_model_into_registry.ipynb).
 
 In order to import the model, you need to pass in the `MODEL_ID` of the model you wish to import from Hugging Face. Browse models on Hugging Face hub and identify the model to import. Make sure the task type of the model is among the supported task types. Copy the model ID, which is available in the URI of the page or can be copied using the copy icon next to the model name. Assign it to the variable 'MODEL_ID' in the Model import notebook. For example:
 
-:::image type="content" source="./media/how-to-use-foundation-models/hugging-face-model-id.png" alt-text="Screenshot showing an example of a hugging face model ID ('bert-base-uncased') as it's displayed in the hugging face model documentation page.":::
+:::image type="content" source="./media/how-to-use-foundation-models/hugging-face-model-id.png" alt-text="Screenshot showing an example of a hugging face model ID ('bert-base-uncased') as it is displayed in the hugging face model documentation page.":::
 
 You need to provide compute for the Model import to run. Running the Model Import results in the specified model being imported from Hugging Face and registered to your Azure Machine Learning workspace. You can then fine-tune this model or deploy it to an endpoint for inferencing.
 
 
@@ -14,7 +14,7 @@ ms.date: 09/22/2022
 
 [!INCLUDE [Open Dataset usage notice](./includes/open-datasets-change-notice.md)]
 
-The Cancer Genome Atlas (TCGA), a landmark cancer genomics program, molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types[[1]](https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga). The TCGA cancer data made available publically are two tiers: open or controlled access. 
+The Cancer Genome Atlas (TCGA), a landmark cancer genomics program, molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types[[1]](https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga). The TCGA cancer data are made publicly available in two tiers: open or controlled access.
 
 - Open access [available on Azure]: This dataset contains deindentified clinical and biospecimen data or summarized data that doesn't contain any individually identifiable information. The data types included are Gene expression, methylation beta values and protein quantification. DNA level datatype includes gene level copy number and masked copy number segment.
 - Controlled access: This dataset is the individual level sequence data and requires approval through dbGap for access.