Skip to content

Commit 80b5def

Browse files
committed
Small and large LMs on AKS
1 parent fd92d96 commit 80b5def

File tree

1 file changed

+16
-9
lines changed

1 file changed

+16
-9
lines changed

articles/aks/concepts-ai-ml-language-models.md

Lines changed: 16 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -2,23 +2,22 @@
22
title: Concepts - Small and large language models
33
description: Learn about small and large language models, including when and how you can use them with your Azure Kubernetes Service (AKS) AI and machine learning workloads.
44
ms.topic: conceptual
5-
ms.date: 06/17/2024
5+
ms.date: 06/19/2024
66
author: schaffererin
77
ms.author: schaffererin
88
---
99

1010
# Concepts - Small and large language models
1111

12-
In this article, you learn about small and large language models, including when to use them and how you can use them with your Azure Kubernetes Service (AKS) AI and machine learning workloads.
13-
12+
In this article, you learn about small and large language models, including when to use them and how you can use them with your AI and machine learning workloads on Azure Kubernetes Service (AKS).
1413

1514
## What are language models?
1615

1716
Language models are powerful machine learning models used for natural language processing (NLP) tasks, such as text generation and sentiment analysis. These models represent natural language based on the probability of words or sequences of words occurring in a given context.
1817

19-
*Conventional language models* are used in supervised settings where they're trained on well-labeled text datasets for specific tasks. *Pretrained language models* are trained on large-scale text corpora from the internet using deep neural networks and can be fine-tuned on smaller datasets for specific tasks.
18+
*Conventional language models* have been used in supervised settings for research purposes where the models are trained on well-labeled text datasets for specific tasks. *Pre-trained language models* have become more widely used in recent years. These modes are trained on large-scale text corpora from the internet using deep neural networks and can be fine-tuned on smaller datasets for specific tasks.
2019

21-
The size of a language model is determined by the number of parameters it has, which is the number of weights that determine how the model processes input data and generates output. Parameters are learned during the training process by adjusting the weights to minimize the difference between the model's predictions and the actual data. The more parameters a model has, the more complex and expressive it is, but also the more computationally expensive it is to train and use.
20+
The size of a language model is determined by the its number of parameters, or *weights*, that determine how the model processes input data and generates output. Parameters are learned during the training process by adjusting the weights within layers of the model to minimize the difference between the model's predictions and the actual data. The more parameters a model has, the more complex and expressive it is, but also the more computationally expensive it is to train and use.
2221

2322
In general, **small language models** have *fewer than 100 million parameters*, and **large language models** have *more than 100 million parameters*. For example, GPT-2 has four versions with different sizes: small (124 million parameters), medium (355 million parameters), large (774 million parameters), and extra-large (1.5 billion parameters).
2423

@@ -77,12 +76,20 @@ The following table lists some popular, high-performance large language models:
7776
| | | |
7877
| | | |
7978

80-
## X
79+
## Use language models with AKS
8180

82-
## Y
81+
The Kubernetes AI Toolchain Operator (KAITO) is a Kubernetes operator that automates AI and machine learning model deployments in Kubernetes clusters. The KAITO add-on for AKS simplifies the experience of running OSS AI models on your AKS clusters. The add-on automatically provisions the necessary GPU nodes and sets up the associated interference server as an endpoint server to your AI models.
8382

84-
## Z
83+
For more information, [Deploy an AI model on AKS with the AI toolchain operator][ai-toolchain-operator].
8584

8685
## Next steps
8786

88-
XYZ
87+
To learn more about AI and machine learning with AKS, see the following articles:
88+
89+
* [Deploy an application that uses OpenAI on AKS][openai-aks]
90+
* [Build and deploy data and machine learning pipelines with Flyte on AKS][flyte-aks]
91+
92+
<!-- LINKS -->
93+
[ai-toolchain-operator]: ./ai-toolchain-operator.md
94+
[openai-aks]: ./open-ai-quickstart.md
95+
[flyte-aks]: ./use-flyte.md

0 commit comments

Comments
 (0)