Skip to content

Commit a7d8087

Browse files
committed
Updates to small and large LM article
1 parent beec9c0 commit a7d8087

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

articles/aks/concepts-ai-ml-language-models.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ Language models are powerful machine learning models used for natural language p
1717

1818
*Conventional language models* have been used in supervised settings for research purposes where the models are trained on well-labeled text datasets for specific tasks. *Pre-trained language models* offer an accessible way to get started with AI and have become more widely used in recent years. These models are trained on large-scale text corpora from the internet using deep neural networks and can be fine-tuned on smaller datasets for specific tasks.
1919

20-
The size of a language model is determined by the its number of parameters, or *weights*, that determine how the model processes input data and generates output. Parameters are learned during the training process by adjusting the weights within layers of the model to minimize the difference between the model's predictions and the actual data. The more parameters a model has, the more complex and expressive it is, but also the more computationally expensive it is to train and use.
20+
The size of a language model is determined by its number of parameters, or *weights*, that determine how the model processes input data and generates output. Parameters are learned during the training process by adjusting the weights within layers of the model to minimize the difference between the model's predictions and the actual data. The more parameters a model has, the more complex and expressive it is, but also the more computationally expensive it is to train and use.
2121

2222
In general, **small language models** have *fewer than 10 billion parameters*, and **large language models** have *more than 10 billion parameters*. For example, the new Microsoft Phi-3 model family has three versions with different sizes: mini (3.8 billion parameters), small (7 billion parameters), and medium (14 billion parameters).
2323

@@ -37,7 +37,7 @@ Small language models are a good choice if you want models that are:
3737
Small language models are suitable for use cases that require:
3838

3939
* **Limited data or resources**, and you need a quick and simple solution.
40-
* **Well-defined or narrow tasks**, and you don't need a lot of creativity in the output.
40+
* **Well-defined or narrow tasks**, and you don't need much creativity in the output.
4141
* **High-precision and low-recall tasks**, and you value accuracy and quality over coverage and quantity.
4242
* **Sensitive or regulated tasks**, and you need to ensure the transparency and accountability of the model.
4343

0 commit comments

Comments
 (0)