You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/concepts/concept-model-distillation.md
+16-16Lines changed: 16 additions & 16 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,28 +15,28 @@ ms.custom: references_regions
15
15
16
16
# Distillation in Azure AI Studio
17
17
18
-
In this article
19
-
-[Distillation](#distillation)
20
-
-[Next Steps](#next-steps)
18
+
In Azure AI Studio, you can use distillation to efficiently train a student model.
21
19
22
-
In Azure AI Studio, you can leverage Distillation to efficiently train the student model.
20
+
## What is distillation?
23
21
24
-
## Distillation
22
+
In machine learning, distillation is a technique for transferring knowledge from a large, complex model (often called the *teacher model*) to a smaller, simpler model (the *student model*). This process helps the smaller model achieve similar performance to the larger one while being more efficient in terms of computation and memory usage.
25
23
26
-
In machine learning, distillation is a technique used to transfer knowledge from a large, complex model (often called the “teacher model”) to a smaller, simpler model (the “student model”). This process helps the smaller model achieve similar performance to the larger one while being more efficient in terms of computation and memory usage.
24
+
## Distillation steps
27
25
28
-
The main steps in knowledge distillation involve:
26
+
The main steps in knowledge distillation are:
29
27
30
-
-**Using the teacher model** to generate predictions for the dataset.
28
+
1. Use the teacher model to generate predictions for the dataset.
31
29
32
-
-**Training the student model** using these predictions, along with the original dataset, to mimic the teacher model’s behavior.
33
-
34
-
You can use the sample notebook available at this [link](https://aka.ms/meta-llama-3.1-distillation) to see how to perform distillation. In this sample notebook, the teacher model used the Meta Llama 3.1 405B Instruct model, and the student model used the Meta Llama 3.1 8B Instruct.
30
+
1. Train the student model by using these predictions, along with the original dataset, to mimic the teacher model's behavior.
35
31
36
-
We used an advanced prompt during synthetic data generation, which incorporates Chain of thought (COT) reasoning, resulting in higher accuracy data labels in the synthetic data. This further improves the accuracy of the distilled model.
32
+
## Sample notebook
37
33
38
-
## Next steps
39
-
-[What is Azure AI Studio?](../what-is-ai-studio.md)
40
-
-[Learn more about deploying Meta Llama models](../how-to/deploy-models-llama.md)
34
+
You can use the [sample notebook](https://aka.ms/meta-llama-3.1-distillation) to see how to perform distillation. In this sample notebook, the teacher model uses the Meta Llama 3.1 405B instruction model, and the student model uses the Meta Llama 3.1 8B instruction model.
35
+
36
+
We used an advanced prompt during synthetic data generation. The advanced prompt incorporates chain-of-thought (CoT) reasoning, which results in higher-accuracy data labels in the synthetic data. This labeling further improves the accuracy of the distilled model.
41
37
42
-
-[Azure AI FAQ article](../faq.yml)
38
+
## Related content
39
+
40
+
-[What is Azure AI Studio?](../what-is-ai-studio.md)
41
+
-[Deploy Meta Llama 3.1 models with Azure AI Studio](../how-to/deploy-models-llama.md)
description: Learn how to generate Synthetic dataset in Azure AI Studio.
4
+
description: Learn how to generate a synthetic dataset in Azure AI Studio.
5
5
manager: scottpolly
6
6
ms.service: azure-ai-studio
7
7
ms.topic: how-to
@@ -15,23 +15,26 @@ ms.custom: references_regions
15
15
16
16
# Synthetic data generation in Azure AI Studio
17
17
18
-
In Azure AI Studio, you can use synthetic data generation to efficiently produce predictions for your datasets. In this article, you're introduced to the concept of synthetic data generation and how it can be used in machine learning.
18
+
In Azure AI Studio, you can use synthetic data generation to efficiently produce predictions for your datasets. This article introduces youto the concept of synthetic data generation and how you can use it in machine learning.
19
19
20
+
## What is synthetic data generation?
20
21
21
-
## Synthetic data generation
22
+
Synthetic data generation involves creating artificial data that mimics the statistical properties of real-world data. This data is generated through algorithms and machine learning techniques. You can use the data in various ways, such as computer simulations or modeling real-world events.
22
23
23
-
Synthetic data generation involves creating artificial data that mimics the statistical properties of real-world data. This data is generated using algorithms and machine learning techniques, and it can be used in various ways, such as computer simulations or by modeling real-world events.
24
+
## Benefits
24
25
25
-
In machine learning, synthetic data is valuable for several reasons:
26
+
In machine learning, synthetic data is valuable for:
26
27
27
-
**Data Augmentation:** It helps in expanding the size of training datasets, which is crucial for training robust machine learning models. This is especially useful when real-world data is scarce or expensive to obtain.
28
+
-**Data augmentation**: It helps in expanding the size of training datasets, which is crucial for training robust machine learning models. This expansion technique is especially useful when real-world data is scarce or expensive to obtain.
28
29
29
-
**Testing and Validation:** It allows for extensive testing and validation of machine learning models under various scenarios without the need for real-world data.
30
+
-**Testing and validation**: It allows for extensive testing and validation of machine learning models under various scenarios without the need for real-world data.
30
31
31
-
You can use the sample notebook available at this [link](https://aka.ms/meta-llama-3.1-datagen) to see how to generate Synthetic data.
32
+
## Sample notebook
32
33
33
-
## Next steps
34
-
-[What is Azure AI Studio?](../what-is-ai-studio.md)
35
-
-[Learn more about deploying Meta Llama models](../how-to/deploy-models-llama.md)
34
+
To see how to generate synthetic data, you can use the [sample notebook](https://aka.ms/meta-llama-3.1-datagen).
35
+
36
+
## Related content
36
37
37
-
-[Azure AI FAQ article](../faq.yml)
38
+
-[What is Azure AI Studio?](../what-is-ai-studio.md)
39
+
-[Deploy Meta Llama 3.1 models with Azure AI Studio](../how-to/deploy-models-llama.md)
0 commit comments