Skip to content

Commit be3ff7a

Browse files
Merge pull request #212119 from sdgilley/sdg-v2-glossary
Add glossary
2 parents f1309b7 + 58fb746 commit be3ff7a

File tree

2 files changed

+96
-0
lines changed

2 files changed

+96
-0
lines changed
Lines changed: 94 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,94 @@
1+
---
2+
title: Azure Machine Learning glossary
3+
description: Glossary of terms for the Azure Machine Learning platform.
4+
services: machine-learning
5+
ms.service: machine-learning
6+
ms.subservice: core
7+
ms.topic: overview
8+
author: frogglew
9+
ms.author: saoh
10+
ms.reviewer: sgilley
11+
ms.date: 09/21/2022
12+
---
13+
14+
# Azure Machine Learning glossary
15+
16+
The Azure Machine Learning glossary is a short dictionary of terminology for the Azure Machine Learning platform. For the general Azure terminology, see also:
17+
18+
* [Microsoft Azure glossary: A dictionary of cloud terminology on the Azure platform](../azure-glossary-cloud-terminology.md)
19+
* [Cloud computing terms](https://azure.microsoft.com/overview/cloud-computing-dictionary/) - General industry cloud terms.
20+
* [Azure fundamental concepts](/azure/cloud-adoption-framework/ready/considerations/fundamental-concepts) - Microsoft Cloud Adoption Framework for Azure.
21+
22+
## Component
23+
24+
An Azure Machine Learning [component](concept-component.md) is a self-contained piece of code that does one step in a machine learning pipeline. Components are the building blocks of advanced machine learning pipelines. Components can do tasks such as data processing, model training, model scoring, and so on. A component is analogous to a function - it has a name, parameters, expects input, and returns output.
25+
26+
27+
## Compute
28+
29+
A compute is a designated compute resource where you run your job or host your endpoint. Azure Machine learning supports the following types of compute:
30+
31+
* **Compute cluster** - a managed-compute infrastructure that allows you to easily create a cluster of CPU or GPU compute nodes in the cloud.
32+
* **Compute instance** - a fully configured and managed development environment in the cloud. You can use the instance as a training or inference compute for development and testing. It's similar to a virtual machine on the cloud.
33+
* **Inference cluster** - used to deploy trained machine learning models to Azure Kubernetes Service. You can create an Azure Kubernetes Service (AKS) cluster from your Azure ML workspace, or attach an existing AKS cluster.
34+
* **Attached compute** - You can attach your own compute resources to your workspace and use them for training and inference.
35+
36+
## Data
37+
38+
Azure Machine Learning allows you to work with different types of data:
39+
40+
* URIs (a location in local/cloud storage)
41+
* `uri_folder`
42+
* `uri_file`
43+
* Tables (a tabular data abstraction)
44+
* `mltable`
45+
* Primitives
46+
* `string`
47+
* `boolean`
48+
* `number`
49+
50+
For most scenarios, you'll use URIs (`uri_folder` and `uri_file`) - a location in storage that can be easily mapped to the filesystem of a compute node in a job by either mounting or downloading the storage to the node.
51+
52+
`mltable` is an abstraction for tabular data that is to be used for AutoML Jobs, Parallel Jobs, and some advanced scenarios. If you're just starting to use Azure Machine Learning and aren't using AutoML, we strongly encourage you to begin with URIs.
53+
54+
55+
## Datastore
56+
57+
Azure Machine Learning datastores securely keep the connection information to your data storage on Azure, so you don't have to code it in your scripts. You can register and create a datastore to easily connect to your storage account, and access the data in your underlying storage service. The CLI v2 and SDK v2 support the following types of cloud-based storage services:
58+
59+
* Azure Blob Container
60+
* Azure File Share
61+
* Azure Data Lake
62+
* Azure Data Lake Gen2
63+
64+
## Environment
65+
66+
Azure Machine Learning environments are an encapsulation of the environment where your machine learning task happens. They specify the software packages, environment variables, and software settings around your training and scoring scripts. The environments are managed and versioned entities within your Machine Learning workspace. Environments enable reproducible, auditable, and portable machine learning workflows across various computes.
67+
68+
### Types of environment
69+
70+
Azure ML supports two types of environments: curated and custom.
71+
72+
Curated environments are provided by Azure Machine Learning and are available in your workspace by default. Intended to be used as is, they contain collections of Python packages and settings to help you get started with various machine learning frameworks. These pre-created environments also allow for faster deployment time. For a full list, see the [curated environments article](resource-curated-environments.md).
73+
74+
In custom environments, you're responsible for setting up your environment. Make sure to install the packages and any other dependencies that your training or scoring script needs on the compute. Azure ML allows you to create your own environment using
75+
76+
* A docker image
77+
* A base docker image with a conda YAML to customize further
78+
* A docker build context
79+
80+
## Model
81+
82+
Azure machine learning models consist of the binary file(s) that represent a machine learning model and any corresponding metadata. Models can be created from a local or remote file or directory. For remote locations `https`, `wasbs` and `azureml` locations are supported. The created model will be tracked in the workspace under the specified name and version. Azure ML supports three types of storage format for models:
83+
84+
* `custom_model`
85+
* `mlflow_model`
86+
* `triton_model`
87+
88+
## Workspace
89+
90+
The workspace is the top-level resource for Azure Machine Learning, providing a centralized place to work with all the artifacts you create when you use Azure Machine Learning. The workspace keeps a history of all jobs, including logs, metrics, output, and a snapshot of your scripts. The workspace stores references to resources like datastores and compute. It also holds all assets like models, environments, components and data asset.
91+
92+
## Next steps
93+
94+
[What is Azure Machine Learning?](overview-what-is-azure-machine-learning.md)

articles/machine-learning/toc.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,8 @@
77
- name: What is Azure Machine Learning?
88
displayName: AML, services, overview, introduction
99
href: overview-what-is-azure-machine-learning.md
10+
- name: Glossary
11+
href: azure-machine-learning-glossary.md
1012
- name: What is Azure Machine Learning studio?
1113
displayName: AML, studio, designer
1214
href: overview-what-is-machine-learning-studio.md

0 commit comments

Comments
 (0)