Skip to content

Commit bcf4b3a

Browse files
authored
Merge pull request #104311 from lobrien/1486482-Retrain-Models
When and how to retrain models
2 parents 78c7bbd + 19d60e7 commit bcf4b3a

File tree

1 file changed

+13
-0
lines changed

1 file changed

+13
-0
lines changed

articles/machine-learning/concept-model-management-and-deployment.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -134,6 +134,19 @@ This information helps you understand how your model is being used. The collecte
134134

135135
For more information, see [How to enable model data collection](how-to-enable-data-collection.md).
136136

137+
## Retrain your model on new data
138+
139+
Often, you'll want to update your model, or even retrain it from scratch, as you receive new information. Sometimes, receiving new data is an expected part of the domain. Other times, as discussed in [Detect data drift (preview) on datasets](how-to-monitor-datasets.md), model performance can degrade in the face of such things as changes to a particular sensor, natural data changes such as seasonal effects, or features shifting in their relation to other features.
140+
141+
There is no universal answer to "How do I know if I should retrain?" but Azure ML event and monitoring tools previously discussed are good starting points for automation. Once you have decided to retrain, you should:
142+
143+
- Preprocess your data using a repeatable, automated process
144+
- Train your new model
145+
- Compare the outputs of your new model to those of your old model
146+
- Use predefined criteria to choose whether to replace your old model
147+
148+
A theme of the above steps is that your retraining should be automated, not ad hoc. [Azure Machine Learning pipelines](concept-ml-pipelines.md) are a good answer for creating workflows relating to data preparation, training, validation, and deployment. Read [Retrain models with Azure Machine Learning designer (preview)](how-to-retrain-designer.md) to see how pipelines and the Azure Machine Learning designer fit into a retraining scenario.
149+
137150
## Automate the ML lifecycle
138151

139152
You can use GitHub and Azure Pipelines to create a continuous integration process that trains a model. In a typical scenario, when a Data Scientist checks a change into the Git repo for a project, the Azure Pipeline will start a training run. The results of the run can then be inspected to see the performance characteristics of the trained model. You can also create a pipeline that deploys the model as a web service.

0 commit comments

Comments
 (0)