Skip to content

Commit 30b7595

Browse files
Merge branch 'main' of https://github.com/MicrosoftDocs/azure-ai-docs-pr into us316607c
2 parents 90d166c + 84adfbd commit 30b7595

20 files changed

+266
-285
lines changed

articles/ai-studio/concepts/deployments-overview.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.service: azure-ai-studio
77
ms.custom:
88
- ignite-2023
99
- build-2024
10-
ms.topic: conceptual
10+
ms.topic: concept-article
1111
ms.date: 5/21/2024
1212
ms.reviewer: fasantia
1313
ms.author: mopeakande
@@ -96,4 +96,4 @@ Optimizing LLMs requires a careful consideration of several factors, including o
9696
- [Deploy Azure OpenAI models with Azure AI Studio](../how-to/deploy-models-openai.md)
9797
- [Deploy Meta Llama 3.1 models with Azure AI Studio](../how-to/deploy-models-llama.md)
9898
- [Deploy large language models with Azure AI Studio](../how-to/deploy-models-open.md)
99-
- [Azure AI Studio FAQ](../faq.yml)
99+
- [Model catalog and collections in Azure AI Studio](../how-to/model-catalog-overview.md)

articles/machine-learning/concept-automl-forecasting-methods.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -92,7 +92,7 @@ where $H$ is the forecast horizon, $l_{\text{max}}$ is the maximum lag order, an
9292

9393
$T_{\text{CV}} = 2H + (n_{\text{CV}} - 1) n_{\text{step}} + \text{max}(l_{\text{max}}, s_{\text{window}}) + 1$,
9494

95-
where $n_{\text{CV}}$ is the number of cross-validation folds and $n_{\text{step}}$ is the CV step size, or offset between CV folds. The basic logic behind these formulas is that you should always have at least a horizon of training observations for each time series, including some padding for lags and cross-validation splits. See [forecasting model selection](./concept-automl-forecasting-sweeping.md#model-selection) for more details on cross-validation for forecasting.
95+
where $n_{\text{CV}}$ is the number of cross-validation folds and $n_{\text{step}}$ is the CV step size, or offset between CV folds. The basic logic behind these formulas is that you should always have at least a horizon of training observations for each time series, including some padding for lags and cross-validation splits. See [forecasting model selection](./concept-automl-forecasting-sweeping.md#model-selection-in-automl) for more details on cross-validation for forecasting.
9696

9797
### Missing data handling
9898
AutoML's time series models require regularly spaced observations in time. Regularly spaced, here, includes cases like monthly or yearly observations where the number of days between observations may vary. Prior to modeling, AutoML must ensure there are no missing series values _and_ that the observations are regular. Hence, there are two missing data cases:

articles/machine-learning/concept-automl-forecasting-sweeping.md

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.service: azure-machine-learning
1010
ms.subservice: automl
1111
ms.topic: concept-article
1212
ms.custom: automl, sdkv1
13-
ms.date: 09/25/2024
13+
ms.date: 10/01/2024
1414

1515
#customer intent: As a developer, I want to use AutoML in Azure Machine Learning, so I can search for (sweep) and select forecasting models.
1616
---
@@ -19,8 +19,6 @@ ms.date: 09/25/2024
1919

2020
This article describes how automated machine learning (AutoML) in Azure Machine Learning searches for and selects forecasting models. If you're interested in learning more about the forecasting methodology in AutoML, see [Overview of forecasting methods in AutoML](concept-automl-forecasting-methods.md). To explore training examples for forecasting models in AutoML, see [Set up AutoML to train a time-series forecasting model with the SDK and CLI](how-to-auto-train-forecast.md).
2121

22-
<a name="model-sweeping"></a>
23-
2422
## Model sweeping in AutoML
2523

2624
The central task for AutoML is to train and evaluate several models and choose the best one with respect to the given primary metric. The word "model" in this case refers to both the model class, such as ARIMA or Random Forest, and the specific hyper-parameter settings that distinguish models within a class. For instance, ARIMA refers to a class of models that share a mathematical template and a set of statistical assumptions. Training, or _fitting_, an ARIMA model requires a list of positive integers that specify the precise mathematical form of the model. These values are the hyper-parameters. The models ARIMA(1, 0, 1) and ARIMA(2, 1, 2) have the same class, but different hyper-parameters. These definitions can be separately fit with the training data and evaluated against each other. AutoML searches, or _sweeps_, over different model classes and within classes by varying the hyper-parameters.
@@ -41,8 +39,6 @@ For a description of the different model types, see the [Forecasting models in A
4139

4240
The amount of sweeping by AutoML depends on the forecasting job configuration. You can specify the stopping criteria as a time limit or a limit on the number of trials, or the equivalent number of models. Early termination logic can be used in both cases to stop sweeping if the primary metric isn't improving.
4341

44-
<a name="model-selection"></a>
45-
4642
## Model selection in AutoML
4743

4844
AutoML follows a three-phase process to search for and select forecasting models:

articles/machine-learning/concept-azure-machine-learning-v2.md

Lines changed: 11 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.topic: conceptual
1010
ms.author: sgilley
1111
author: sdgilley
1212
ms.reviewer: balapv
13-
ms.date: 08/21/2024
13+
ms.date: 09/30/2024
1414
#Customer intent: As a data scientist, I want to understand the big picture about how Azure Machine Learning works.
1515
---
1616

@@ -41,7 +41,7 @@ This document provides a quick overview of these resources and assets.
4141
To use the Python SDK code examples in this article:
4242

4343
1. Install the [Python SDK v2](https://aka.ms/sdk-v2-install)
44-
2. Create a connection to your Azure Machine Learning subscription. The examples all rely on `ml_client`. To create a workspace, the connection does not need a workspace name, since you may not yet have one. All other examples in this article require that the workspace name is included in the connection.
44+
2. Create a connection to your Azure Machine Learning subscription. The examples all rely on `ml_client`. To create a workspace, the connection doesn't need a workspace name, since you may not yet have one. All other examples in this article require that the workspace name is included in the connection.
4545

4646
```python
4747
# import required libraries
@@ -82,7 +82,7 @@ Sign in to [Azure Machine Learning studio](https://ml.azure.com).
8282

8383
## Workspace
8484

85-
The workspace is the top-level resource for Azure Machine Learning, providing a centralized place to work with all the artifacts you create when you use Azure Machine Learning. The workspace keeps a history of all jobs, including logs, metrics, output, and a snapshot of your scripts. The workspace stores references to resources like datastores and compute. It also holds all assets like models, environments, components and data asset.
85+
The workspace is the top-level resource for Azure Machine Learning, providing a centralized place to work with all the artifacts you create when you use Azure Machine Learning. The workspace keeps a history of all jobs, including logs, metrics, output, and a snapshot of your scripts. The workspace stores references to resources like datastores and compute. It also holds all assets like models, environments, components, and data asset.
8686

8787
### Create a workspace
8888

@@ -133,7 +133,7 @@ A compute is a designated compute resource where you run your job or host your e
133133

134134
* **Compute instance** - a fully configured and managed development environment in the cloud. You can use the instance as a training or inference compute for development and testing. It's similar to a virtual machine on the cloud.
135135
* **Compute cluster** - a managed-compute infrastructure that allows you to easily create a cluster of CPU or GPU compute nodes in the cloud.
136-
* **Serverless compute** - a compute cluster you access on the fly. When you use serverless compute, you don't need to create your own cluster. All compute lifecycle management is offloaded to Azure Machine Learning.
136+
* **Serverless compute** - a compute cluster you access on the fly. When you use serverless compute, you don't need to create your own cluster. All compute lifecycle management is offloaded to Azure Machine Learning.
137137
* **Inference cluster** - used to deploy trained machine learning models to Azure Kubernetes Service. You can create an Azure Kubernetes Service (AKS) cluster from your Azure Machine Learning workspace, or attach an existing AKS cluster.
138138
* **Attached compute** - You can attach your own compute resources to your workspace and use them for training and inference.
139139

@@ -171,11 +171,10 @@ az ml compute create --file my_compute.yml
171171
```
172172

173173
For the content of the file, see [compute YAML examples](https://github.com/Azure/azureml-examples/tree/main/cli/resources/compute).
174-
.
175174

176175
### [Studio](#tab/azure-studio)
177176

178-
1. Select a workspace if you are not already in one.
177+
1. Select a workspace if you aren't already in one.
179178
1. From the left-hand menu, select **Compute**.
180179
1. On the top, select a tab to specify the type of compute you want to create.
181180
1. Select **New** to create the new compute.
@@ -235,7 +234,7 @@ For the content of the file, see [datastore YAML examples](https://github.com/Az
235234

236235
### [Studio](#tab/azure-studio)
237236

238-
1. Select a workspace if you are not already in one.
237+
1. Select a workspace if you aren't already in one.
239238
1. From the left-hand menu, select **Data**.
240239
1. On the top, select **Datastores**.
241240
1. Select **Create** to create a new datastore.
@@ -246,7 +245,7 @@ To learn more about using a datastore, see [Create and manage data assets](how-t
246245

247246
## Model
248247

249-
Azure Machine Learning models consist of the binary file(s) that represent a machine learning model and any corresponding metadata. Models can be created from a local or remote file or directory. For remote locations `https`, `wasbs` and `azureml` locations are supported. The created model will be tracked in the workspace under the specified name and version. Azure Machine Learning supports three types of storage format for models:
248+
Azure Machine Learning models consist of one or more binary files that represent a machine learning model and any corresponding metadata. Models can be created from a local or remote file or directory. For remote locations `https`, `wasbs` and `azureml` locations are supported. The created model is tracked in the workspace under the specified name and version. Azure Machine Learning supports three types of storage format for models:
250249

251250
* `custom_model`
252251
* `mlflow_model`
@@ -260,13 +259,13 @@ For more information on how to create models in the registry, see [Work with mod
260259

261260
## Environment
262261

263-
Azure Machine Learning environments are an encapsulation of the environment where your machine learning task happens. They specify the software packages, environment variables, and software settings around your training and scoring scripts. The environments are managed and versioned entities within your Machine Learning workspace. Environments enable reproducible, auditable, and portable machine learning workflows across a variety of computes.
262+
Azure Machine Learning environments are an encapsulation of the environment where your machine learning task happens. They specify the software packages, environment variables, and software settings around your training and scoring scripts. The environments are managed and versioned entities within your Machine Learning workspace. Environments enable reproducible, auditable, and portable machine learning workflows across various computes.
264263

265264
### Types of environment
266265

267266
Azure Machine Learning supports two types of environments: curated and custom.
268267

269-
Curated environments are provided by Azure Machine Learning and are available in your workspace by default. Intended to be used as is, they contain collections of Python packages and settings to help you get started with various machine learning frameworks. These pre-created environments also allow for faster deployment time. For a full list, see the [curated environments article](resource-curated-environments.md).
268+
Curated environments are provided by Azure Machine Learning and are available in your workspace by default. Intended to be used as is, they contain collections of Python packages and settings to help you get started with various machine learning frameworks. These precreated environments also allow for faster deployment time. For a full list, see the [curated environments article](resource-curated-environments.md).
270269

271270
In custom environments, you're responsible for setting up your environment and installing packages or any other dependencies that your training or scoring script needs on the compute. Azure Machine Learning allows you to create your own environment using
272271

@@ -291,7 +290,7 @@ For more information, see [environment YAML schema](reference-yaml-environment.m
291290

292291
### [Studio](#tab/azure-studio)
293292

294-
1. Select a workspace if you are not already in one.
293+
1. Select a workspace if you aren't already in one.
295294
1. From the left-hand menu, select **Environments**.
296295
1. On the top, select **Custom environments**.
297296
1. Select **Create** to create a new custom environment.
@@ -314,7 +313,7 @@ Azure Machine Learning allows you to work with different types of data:
314313
* `boolean`
315314
* `number`
316315

317-
For most scenarios, you'll use URIs (`uri_folder` and `uri_file`) - a location in storage that can be easily mapped to the filesystem of a compute node in a job by either mounting or downloading the storage to the node.
316+
For most scenarios, you use URIs (`uri_folder` and `uri_file`) - a location in storage that can be easily mapped to the filesystem of a compute node in a job by either mounting or downloading the storage to the node.
318317

319318
`mltable` is an abstraction for tabular data that is to be used for AutoML Jobs, Parallel Jobs, and some advanced scenarios. If you're just starting to use Azure Machine Learning and aren't using AutoML, we strongly encourage you to begin with URIs.
320319

articles/machine-learning/concept-component.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.subservice: core
88
ms.author: lagayhar
99
author: lgayhardt
1010
ms.reviewer: keli19
11-
ms.date: 01/19/2024
11+
ms.date: 09/30/2024
1212
ms.topic: conceptual
1313
---
1414
# What is an Azure Machine Learning component?

articles/machine-learning/concept-fairness-ml.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.topic: conceptual
99
ms.author: lagayhar
1010
author: lgayhardt
1111
ms.reviewer: mesameki
12-
ms.date: 02/21/2024
12+
ms.date: 09/30/2024
1313
ms.custom: responsible-ml, devx-track-python
1414
#Customer intent: As a data scientist, I want to learn about machine learning fairness and how to assess and mitigate unfairness in machine learning models.
1515
---

0 commit comments

Comments
 (0)