You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/concept-model-catalog.md
+14-13Lines changed: 14 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -32,13 +32,13 @@ Models are organized by Collections in the Model Catalog. There are three types
32
32
33
33
For information on Azure OpenAI models, refer to [Azure OpenAI Service](../ai-services/openai/overview.md).
34
34
35
-
For models **Curated by Azure AI** and **Open models from the Hugging Face hub**, some of these can be deployed as Real-time endpoints and some of these are available to be deployed using Pay-as-you-go billing (Models as a Service). These models can be discovered, compared, evaluated, finetuned (when supported) and deployed at scale and integrated into your Generative AI applications with enterprise-grade security and data governance.
35
+
For models **Curated by Azure AI** and **Open models from the Hugging Face hub**, some of these can be deployed as Real-time endpoints and some of these are available to be deployed using Pay-as-you-go billing (Models as a Service). These models can be discovered, compared, evaluated, fine-tuned (when supported) and deployed at scale and integrated into your Generative AI applications with enterprise-grade security and data governance.
36
36
37
-
***Discover:** Review model cards, try sample inference and browse code samples to evaluate, finetune, or deploy the model.
37
+
***Discover:** Review model cards, try sample inference and browse code samples to evaluate, fine-tune, or deploy the model.
38
38
***Compare:** Compare benchmarks across models and datasets available in the industry to assess which one meets your business scenario.
39
39
***Evaluate:** Evaluate if the model is suited for your specific workload by providing your own test data. Evaluation metrics make it easy to visualize how well the selected model performed in your scenario.
40
-
***Finetune:** Customize finetunable models using your own training data and pick the best model by comparing metrics across all your finetuning jobs. Built-in optimizations speed up finetuning and reduce the memory and compute needed for finetuning.
41
-
***Deploy:** Deploy pretrained models or finetuned models seamlessly for inference. Models that can be deployed to real-time endpoints can also be downloaded.
40
+
***Fine-tune:** Customize fine-tunable models using your own training data and pick the best model by comparing metrics across all your fine-tuning jobs. Built-in optimizations speed up fine-tuning and reduce the memory and compute needed for fine-tuning.
41
+
***Deploy:** Deploy pretrained models or fine-tuned models seamlessly for inference. Models that can be deployed to real-time endpoints can also be downloaded.
42
42
43
43
## Model deployment: Real-time endpoints and Models as a Service (Pay-as-you-go)
44
44
@@ -70,15 +70,15 @@ The capability to deploy models to real-time endpoints builds on platform capabi
70
70
71
71
### How are models made available for Real-time endpoints?
72
72
73
-
The models are made available through [Azure Machine Learning registries](concept-machine-learning-registries-mlops.md) that enable ML first approach to [hosting and distributing Machine Learning assets](how-to-share-models-pipelines-across-workspaces-with-registries.md) such as model weights, container runtimes for running the models, pipelines for evaluating and finetuning the models and datasets for benchmarks and samples. These ML Registries build on top of highly scalable and enterprise ready infrastructure that:
73
+
The models are made available through [Azure Machine Learning registries](concept-machine-learning-registries-mlops.md) that enable ML first approach to [hosting and distributing Machine Learning assets](how-to-share-models-pipelines-across-workspaces-with-registries.md) such as model weights, container runtimes for running the models, pipelines for evaluating and fine-tuning the models and datasets for benchmarks and samples. These ML Registries build on top of highly scalable and enterprise ready infrastructure that:
74
74
75
75
* Delivers low latency access model artifacts to all Azure regions with built-in geo-replication.
76
76
77
77
* Supports enterprise security requirements as [limiting access to models with Azure Policy](how-to-regulate-registry-deployments.md) and [secure deployment with managed virtual networks](how-to-network-isolation-model-catalog.md).
78
78
79
-
### Evaluate and finetune models deployed as Real-time endpoints
79
+
### Evaluate and fine-tune models deployed as Real-time endpoints
80
80
81
-
You can evaluate and finetune in the “Curated by Azure AI” collection in Azure Machine Learning using Azure Machine Learning Pipelines. You can either choose to bring your own evaluation and finetuning code and just access model weights or use Azure Machine Learning components that offer built-in evaluation and finetuning capabilities. To learn more, [follow this link](how-to-use-foundation-models.md).
81
+
You can evaluate and fine-tune in the “Curated by Azure AI” collection in Azure Machine Learning using Azure Machine Learning Pipelines. You can either choose to bring your own evaluation and fine-tuning code and just access model weights or use Azure Machine Learning components that offer built-in evaluation and fine-tuning capabilities. To learn more, [follow this link](how-to-use-foundation-models.md).
82
82
83
83
### Deploy models for inference as Real-time endpoints
84
84
@@ -99,7 +99,7 @@ Prompt flow offers capabilities for prototyping, experimenting, iterating, and d
99
99
100
100
### Work with models not in the Model Catalog
101
101
102
-
For models not available in the Model Catalog, Azure Machine Learning provides an open and extensible platform for working with models of your choice. You can bring a model with any framework or runtime using Azure Machine Learning’s open and extensible platform capabilities such as [Azure Machine Learning environments](concept-environments.md) for containers that can package frameworks and runtimes and [Azure Machine Learning pipelines](concept-ml-pipelines.md) for code to evaluate or finetune the models. Refer to this notebook for sample reference to import models and work with the [built-in runtimes and pipelines](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/system/import/import_model_into_registry.ipynb).
102
+
For models not available in the Model Catalog, Azure Machine Learning provides an open and extensible platform for working with models of your choice. You can bring a model with any framework or runtime using Azure Machine Learning’s open and extensible platform capabilities such as [Azure Machine Learning environments](concept-environments.md) for containers that can package frameworks and runtimes and [Azure Machine Learning pipelines](concept-ml-pipelines.md) for code to evaluate or fine-tune the models. Refer to this notebook for sample reference to import models and work with the [built-in runtimes and pipelines](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/system/import/import_model_into_registry.ipynb).
103
103
104
104
105
105
## Models as a Service (Pay-as-you-go)
@@ -120,17 +120,17 @@ The discovery, subscription, and consumption experience for models deployed via
120
120
121
121
Deploying a model through MaaS allows users to get access to ready to use inference APIs without the need to configure infrastructure or provision GPUs, saving engineering time and resources. These APIs can be integrated with several LLM tools and usage is billed as described in the previous section.
122
122
123
-
### Finetune models through MaaS with Pay-as-you-go
123
+
### Fine-tune models through MaaS with Pay-as-you-go
124
124
125
-
For models that are available through MaaS and support finetuning, users can take advantage of hosted finetuning with pay-as-you-go billing to tailor the models using data they provide. For more information, see [finetune a Llama 2 model](../ai-studio/how-to/fine-tune-model-llama.md) in Azure AI Studio.
125
+
For models that are available through MaaS and support fine-tuning, users can take advantage of hosted fine-tuning with pay-as-you-go billing to tailor the models using data they provide. For more information, see [fine-tune a Llama 2 model](../ai-studio/how-to/fine-tune-model-llama.md) in Azure AI Studio.
126
126
127
127
### RAG with models deployed through MaaS
128
128
129
129
Azure AI Studio enables users to make use of Vector Indexes and Retrieval Augmented Generation. Models that can be deployed via MaaS can be used to generate embeddings and inferencing based on custom data to generate answers specific to their use case. For more information, see [Retrieval augmented generation and indexes](concept-retrieval-augmented-generation.md).
130
130
131
131
### Regional availability of offers and models
132
132
133
-
Pay-as-you-go deployment is available only to users whose Azure subscription belongs to a billing account in a country where the model provider has made the offer available (see “offer availability region” in the table in the next section). If the offer is available in the relevant region, the user then must have a Workspace in the Azure region where the model is available for deployment or finetuning, as applicable (see “Workspace region” columns in the table below).
133
+
Pay-as-you-go deployment is available only to users whose Azure subscription belongs to a billing account in a country where the model provider has made the offer available (see “offer availability region” in the table in the next section). If the offer is available in the relevant region, the user then must have a Workspace in the Azure region where the model is available for deployment or fine-tuning, as applicable (see “Workspace region” columns in the table below).
134
134
135
135
Model | Offer availability region | Workspace Region for Deployment | Workspace Region for Finetuning
136
136
--|--|--|--
@@ -147,6 +147,7 @@ Azure Machine Learning implements a default configuration of [Azure AI Content S
147
147
148
148
## Learn more
149
149
150
-
* Learn [how to use foundation Models in Azure Machine Learning](./how-to-use-foundation-models.md) for finetuning, evaluation, and deployment using Azure Machine Learning studio UI or code based methods.
150
+
* Learn [how to use foundation Models in Azure Machine Learning](./how-to-use-foundation-models.md) for fine-tuning, evaluation, and deployment using Azure Machine Learning studio UI or code based methods.
151
151
* Explore the [Model Catalog in Azure Machine Learning studio](https://ml.azure.com/model/catalog). You need an [Azure Machine Learning workspace](./quickstart-create-resources.md) to explore the catalog.
152
-
*[Evaluate, finetune, and deploy models](./how-to-use-foundation-models.md) curated by Azure Machine Learning.
152
+
*[Evaluate, fine-tune, and deploy models](./how-to-use-foundation-models.md) curated by Azure Machine Learning.
0 commit comments