Skip to content

Commit 9b798df

Browse files
authored
Merge pull request #281975 from ShawnJackson/azure-ai-studio-conceptual-articles
[AQ] edit pass: Azure AI Studio conceptual articles
2 parents 905a81c + dd70531 commit 9b798df

8 files changed

+334
-284
lines changed

articles/ai-studio/concepts/concept-model-distillation.md

Lines changed: 16 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -15,28 +15,28 @@ ms.custom: references_regions
1515

1616
# Distillation in Azure AI Studio
1717

18-
In this article
19-
- [Distillation](#distillation)
20-
- [Next Steps](#next-steps)
18+
In Azure AI Studio, you can use distillation to efficiently train a student model.
2119

22-
In Azure AI Studio, you can leverage Distillation to efficiently train the student model.
20+
## What is distillation?
2321

24-
## Distillation
22+
In machine learning, distillation is a technique for transferring knowledge from a large, complex model (often called the *teacher model*) to a smaller, simpler model (the *student model*). This process helps the smaller model achieve similar performance to the larger one while being more efficient in terms of computation and memory usage.
2523

26-
In machine learning, distillation is a technique used to transfer knowledge from a large, complex model (often called the “teacher model”) to a smaller, simpler model (the “student model”). This process helps the smaller model achieve similar performance to the larger one while being more efficient in terms of computation and memory usage.
24+
## Distillation steps
2725

28-
The main steps in knowledge distillation involve:
26+
The main steps in knowledge distillation are:
2927

30-
- **Using the teacher model** to generate predictions for the dataset.
28+
1. Use the teacher model to generate predictions for the dataset.
3129

32-
- **Training the student model** using these predictions, along with the original dataset, to mimic the teacher model’s behavior.
33-
34-
You can use the sample notebook available at this [link](https://aka.ms/meta-llama-3.1-distillation) to see how to perform distillation. In this sample notebook, the teacher model used the Meta Llama 3.1 405B Instruct model, and the student model used the Meta Llama 3.1 8B Instruct.
30+
1. Train the student model by using these predictions, along with the original dataset, to mimic the teacher model's behavior.
3531

36-
We used an advanced prompt during synthetic data generation, which incorporates Chain of thought (COT) reasoning, resulting in higher accuracy data labels in the synthetic data. This further improves the accuracy of the distilled model.
32+
## Sample notebook
3733

38-
## Next steps
39-
- [What is Azure AI Studio?](../what-is-ai-studio.md)
40-
- [Learn more about deploying Meta Llama models](../how-to/deploy-models-llama.md)
34+
You can use the [sample notebook](https://aka.ms/meta-llama-3.1-distillation) to see how to perform distillation. In this sample notebook, the teacher model uses the Meta Llama 3.1 405B instruction model, and the student model uses the Meta Llama 3.1 8B instruction model.
35+
36+
We used an advanced prompt during synthetic data generation. The advanced prompt incorporates chain-of-thought (CoT) reasoning, which results in higher-accuracy data labels in the synthetic data. This labeling further improves the accuracy of the distilled model.
4137

42-
- [Azure AI FAQ article](../faq.yml)
38+
## Related content
39+
40+
- [What is Azure AI Studio?](../what-is-ai-studio.md)
41+
- [Deploy Meta Llama 3.1 models with Azure AI Studio](../how-to/deploy-models-llama.md)
42+
- [Azure AI Studio FAQ](../faq.yml)
Lines changed: 15 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Synthetic data generation in AI Studio
33
titleSuffix: Azure AI Studio
4-
description: Learn how to generate Synthetic dataset in Azure AI Studio.
4+
description: Learn how to generate a synthetic dataset in Azure AI Studio.
55
manager: scottpolly
66
ms.service: azure-ai-studio
77
ms.topic: how-to
@@ -15,23 +15,26 @@ ms.custom: references_regions
1515

1616
# Synthetic data generation in Azure AI Studio
1717

18-
In Azure AI Studio, you can use synthetic data generation to efficiently produce predictions for your datasets. In this article, you're introduced to the concept of synthetic data generation and how it can be used in machine learning.
18+
In Azure AI Studio, you can use synthetic data generation to efficiently produce predictions for your datasets. This article introduces you to the concept of synthetic data generation and how you can use it in machine learning.
1919

20+
## What is synthetic data generation?
2021

21-
## Synthetic data generation
22+
Synthetic data generation involves creating artificial data that mimics the statistical properties of real-world data. This data is generated through algorithms and machine learning techniques. You can use the data in various ways, such as computer simulations or modeling real-world events.
2223

23-
Synthetic data generation involves creating artificial data that mimics the statistical properties of real-world data. This data is generated using algorithms and machine learning techniques, and it can be used in various ways, such as computer simulations or by modeling real-world events.
24+
## Benefits
2425

25-
In machine learning, synthetic data is valuable for several reasons:
26+
In machine learning, synthetic data is valuable for:
2627

27-
**Data Augmentation:** It helps in expanding the size of training datasets, which is crucial for training robust machine learning models. This is especially useful when real-world data is scarce or expensive to obtain.
28+
- **Data augmentation**: It helps in expanding the size of training datasets, which is crucial for training robust machine learning models. This expansion technique is especially useful when real-world data is scarce or expensive to obtain.
2829

29-
**Testing and Validation:** It allows for extensive testing and validation of machine learning models under various scenarios without the need for real-world data.
30+
- **Testing and validation**: It allows for extensive testing and validation of machine learning models under various scenarios without the need for real-world data.
3031

31-
You can use the sample notebook available at this [link](https://aka.ms/meta-llama-3.1-datagen) to see how to generate Synthetic data.
32+
## Sample notebook
3233

33-
## Next steps
34-
- [What is Azure AI Studio?](../what-is-ai-studio.md)
35-
- [Learn more about deploying Meta Llama models](../how-to/deploy-models-llama.md)
34+
To see how to generate synthetic data, you can use the [sample notebook](https://aka.ms/meta-llama-3.1-datagen).
35+
36+
## Related content
3637

37-
- [Azure AI FAQ article](../faq.yml)
38+
- [What is Azure AI Studio?](../what-is-ai-studio.md)
39+
- [Deploy Meta Llama 3.1 models with Azure AI Studio](../how-to/deploy-models-llama.md)
40+
- [Azure AI Studio FAQ](../faq.yml)

0 commit comments

Comments
 (0)