Merge pull request #204584 from santiagxf/santiagxf/aml-mlflow-runs-fix

v-dirichards · web-flow · commit 2b2a8220fd0b · 2022-07-13T15:00:39.000-05:00
AML mlflow runs fix
diff --git a/articles/machine-learning/concept-mlflow-models.md b/articles/machine-learning/concept-mlflow-models.md
@@ -157,10 +157,19 @@ name: mlflow-env
 
 ### Model's predict function
 
-All MLflow models contain a `predict` function. This function is the one that is called when a model is deployed using a no-code-deployment experience. What the `predict` function returns (classes, probabilities, a forecast, etc.) depend on the framework (i.e. flavor) used for training. Read the documentation of each flavor to know what they return.
+All MLflow models contain a `predict` function. **This function is the one that is called when a model is deployed using a no-code-deployment experience**. What the `predict` function returns (classes, probabilities, a forecast, etc.) depend on the framework (i.e. flavor) used for training. Read the documentation of each flavor to know what they return.
 
 In same cases, you may need to customize this function to change the way inference is executed. On those cases, you will need to [log models with a different behavior in the predict method](how-to-log-mlflow-models.md#logging-models-with-a-different-behavior-in-the-predict-method) or [log a custom model's flavor](how-to-log-mlflow-models.md#logging-custom-models).
 
+## Loading MLflow models back
+
+Models created as MLflow models can be loaded back directly from the run where they were logged, from the file system where they are saved or from the model registry where they are registered. MLflow provides a consistent way to load those models regardless of the location.
+
+There are two workflows available for loading models:
+
+* **Loading back the same object and types that were logged:**: You can load models using MLflow SDK and obtain an instance of the model with types belonging to the training library. For instance, an ONNX model will return a `ModelProto` while a decision tree trained with Scikit-Learn model will return a `DecisionTreeClassifier` object. Use `mlflow.<flavor>.load_model()` to do so.
+* **Loading back a model for running inference:** You can load models using MLflow SDK and obtain a wrapper where MLflow warranties there will be a `predict` function. It doesn't matter which flavor you are using, every MLflow model needs to implement this contract. Furthermore, MLflow warranties that this function can be called using arguments of type `pandas.DataFrame`, `numpy.ndarray` or `dict[strin, numpyndarray]` (depending on the signature of the model). MLflow handles the type conversion to the input type the model actually expects. Use `mlflow.pyfunc.load_model()` to do so.
+
 ## Start logging models
 
 We recommend starting taking advantage of MLflow models in Azure Machine Learning. There are different ways to start using the model's concept with MLflow. Read [How to log MLFlow models](how-to-log-mlflow-models.md) to a comprehensive guide.
diff --git a/articles/machine-learning/how-to-log-mlflow-models.md b/articles/machine-learning/how-to-log-mlflow-models.md
@@ -42,14 +42,13 @@ import mlflow
 from xgboost import XGBClassifier
 from sklearn.metrics import accuracy_score
 
-with mlflow.start_run():
-    mlflow.autolog()
+mlflow.autolog()
 
-    model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
-    model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
-    y_pred = model.predict(X_test)
+model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
+model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
 
-    accuracy = accuracy_score(y_test, y_pred)
+y_pred = model.predict(X_test)
+accuracy = accuracy_score(y_test, y_pred)
 ```
 
 > [!TIP]
@@ -76,34 +75,33 @@ from sklearn.metrics import accuracy_score
 from mlflow.models import infer_signature
 from mlflow.utils.environment import _mlflow_conda_env
 
-with mlflow.start_run():
-    mlflow.autolog(log_models=False)
-
-    model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
-    model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
-    y_pred = model.predict(X_test)
-
-    accuracy = accuracy_score(y_test, y_pred)
-    
-    # Signature
-    signature = infer_signature(X_test, y_test)
-    
-    # Conda environment
-    custom_env =_mlflow_conda_env(
-        additional_conda_deps=None,
-        additional_pip_deps=["xgboost==1.5.2"],
-        additional_conda_channels=None,
-    )
-    
-    # Sample
-    input_example = X_train.sample(n=1)
-
-    # Log the model manually
-    mlflow.xgboost.log_model(model, 
-                             artifact_path="classifier", 
-                             conda_env=custom_env,
-                             signature=signature,
-                             input_example=input_example)
+mlflow.autolog(log_models=False)
+
+model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
+model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
+y_pred = model.predict(X_test)
+
+accuracy = accuracy_score(y_test, y_pred)
+
+# Signature
+signature = infer_signature(X_test, y_test)
+
+# Conda environment
+custom_env =_mlflow_conda_env(
+    additional_conda_deps=None,
+    additional_pip_deps=["xgboost==1.5.2"],
+    additional_conda_channels=None,
+)
+
+# Sample
+input_example = X_train.sample(n=1)
+
+# Log the model manually
+mlflow.xgboost.log_model(model, 
+                         artifact_path="classifier", 
+                         conda_env=custom_env,
+                         signature=signature,
+                         input_example=input_example)
 ```
 
 > [!NOTE]
@@ -166,20 +164,19 @@ from xgboost import XGBClassifier
 from sklearn.metrics import accuracy_score
 from mlflow.models import infer_signature
 
-with mlflow.start_run():
-    mlflow.xgboost.autolog(log_models=False)
+mlflow.xgboost.autolog(log_models=False)
 
-    model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
-    model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
-    y_probs = model.predict_proba(X_test)
+model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
+model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
+y_probs = model.predict_proba(X_test)
 
-    accuracy = accuracy_score(y_test, y_probs.argmax(axis=1))
-    mlflow.log_metric("accuracy", accuracy)
+accuracy = accuracy_score(y_test, y_probs.argmax(axis=1))
+mlflow.log_metric("accuracy", accuracy)
 
-    signature = infer_signature(X_test, y_probs)
-    mlflow.pyfunc.log_model("classifier", 
-                            python_model=ModelWrapper(model),
-                            signature=signature)
+signature = infer_signature(X_test, y_probs)
+mlflow.pyfunc.log_model("classifier", 
+                        python_model=ModelWrapper(model),
+                        signature=signature)
 ```
 
 > [!TIP]
@@ -248,33 +245,32 @@ from sklearn.preprocessing import OrdinalEncoder
 from sklearn.metrics import accuracy_score
 from mlflow.models import infer_signature
 
-with mlflow.start_run():
-    mlflow.xgboost.autolog(log_models=False)
-    
-    encoder = OrdinalEncoder(handle_unknown='ignore')
-    X_train['thal'] = enc.fit_transform(X_train['thal'])
-    X_test['thal'] = enc.transform(X_test['thal'])
-    
-    model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
-    model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
-    y_probs = model.predict_proba(X_test)
-
-    accuracy = accuracy_score(y_test, y_probs.argmax(axis=1))
-    mlflow.log_metric("accuracy", accuracy)
-
-    encoder_path = 'encoder.pkl'
-    joblib.dump(encoder, encoder_path)
-    model_path = "xgb.model"
-    model.save_model(model_path)
-
-    signature = infer_signature(X, y_probs)
-    mlflow.pyfunc.log_model("classifier", 
-                            python_model=ModelWrapper(),
-                            artifacts={ 
-                                'encoder': encoder_path,
-                                'model': model_path 
-                            },
-                            signature=signature)
+mlflow.xgboost.autolog(log_models=False)
+
+encoder = OrdinalEncoder(handle_unknown='ignore')
+X_train['thal'] = enc.fit_transform(X_train['thal'])
+X_test['thal'] = enc.transform(X_test['thal'])
+
+model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
+model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
+y_probs = model.predict_proba(X_test)
+
+accuracy = accuracy_score(y_test, y_probs.argmax(axis=1))
+mlflow.log_metric("accuracy", accuracy)
+
+encoder_path = 'encoder.pkl'
+joblib.dump(encoder, encoder_path)
+model_path = "xgb.model"
+model.save_model(model_path)
+
+signature = infer_signature(X, y_probs)
+mlflow.pyfunc.log_model("classifier", 
+                        python_model=ModelWrapper(),
+                        artifacts={ 
+                            'encoder': encoder_path,
+                            'model': model_path 
+                        },
+                        signature=signature)
 ```
 
 # [Using a model loader](#tab/loader)
@@ -340,25 +336,24 @@ from xgboost import XGBClassifier
 from sklearn.metrics import accuracy_score
 from mlflow.models import infer_signature
 
-with mlflow.start_run():
-    mlflow.xgboost.autolog(log_models=False)
+mlflow.xgboost.autolog(log_models=False)
 
-    model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
-    model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
-    y_probs = model.predict_proba(X_test)
+model = XGBClassifier(use_label_encoder=False, eval_metric="logloss")
+model.fit(X_train, y_train, eval_set=[(X_test, y_test)], verbose=False)
+y_probs = model.predict_proba(X_test)
 
-    accuracy = accuracy_score(y_test, y_probs.argmax(axis=1))
-    mlflow.log_metric("accuracy", accuracy)
+accuracy = accuracy_score(y_test, y_probs.argmax(axis=1))
+mlflow.log_metric("accuracy", accuracy)
 
-    model_path = "xgb.model"
-    model.save_model(model_path)
+model_path = "xgb.model"
+model.save_model(model_path)
 
-    signature = infer_signature(X_test, y_probs)
-    mlflow.pyfunc.log_model("classifier",
-                            data_path=model_path,
-                            code_path=["loader_module.py"],
-                            loader_module="loader_module",
-                            signature=signature)
+signature = infer_signature(X_test, y_probs)
+mlflow.pyfunc.log_model("classifier",
+                        data_path=model_path,
+                        code_path=["loader_module.py"],
+                        loader_module="loader_module",
+                        signature=signature)
 ```
 
 ---
diff --git a/articles/machine-learning/how-to-manage-models-mlflow.md b/articles/machine-learning/how-to-manage-models-mlflow.md
@@ -90,7 +90,7 @@ mlflow.register_model(f"file://{model_local_path}", "local-model-test")
 > [!NOTE]
 > Notice how the model URI schema `file:/` requires absolute paths.
 
-## Querying models
+## Querying model registries
 
 ### Querying all the models in the registry
 
@@ -123,6 +123,17 @@ If you need a specific version of the model, you can indicate so:
 client.get_model_version(model_name, version=2)
 ```
 
+## Loading models from registry
+
+You can load models directly from the registry to restore the models objects that were logged. Use the functions `mlflow.<flavor>.load_model()` or `mlflow.pyfunc.load_model()` indicating the URI of the model you want to load using the following syntax:
+
+* `models:/<model-name>/latest`, to load the last version of the model.
+* `models:/<model-name>/<version-number>`, to load a specific version of the model.
+* `models:/<model-name>/<stage-name>`, to load a specific version in a given stage for a model. View [Model stages](#model-stages) for details.
+
+> [!TIP]
+> For learning about the difference between `mlflow.<flavor>.load_model()` and `mlflow.pyfunc.load_model()`, view [Loading MLflow models back](concept-mlflow-models.md#loading-mlflow-models-back) article.
+
 ## Model stages
 
 MLflow supports model's stages to manage model's lifecycle. Model's version can transition from one stage to another. Stages are assigned to a model's version (instead of models) which means that a given model can have multiple versions on different stages.
diff --git a/articles/machine-learning/how-to-track-experiments-mlflow.md b/articles/machine-learning/how-to-track-experiments-mlflow.md
@@ -251,6 +251,9 @@ MLflow also allows you to both operations at once and download and load the mode
   model = mlflow.xgboost.load_model(f"runs:/{last_run.info.run_id}/{artifact_path}")
   ```
 
+> [!TIP]
+> You can also load models from the registry using MLflow. View [loading MLflow models with MLflow](how-to-manage-models-mlflow.md#loading-models-from-registry) for details.
+
 ## Getting child (nested) runs
 
 MLflow supports the concept of child (nested) runs. They are useful when you need to spin off training routines requiring being tracked independently from the main training process. This is the typical case of hyper-parameter tuning for instance. You can query all the child runs of a specific run using the property tag `mlflow.parentRunId`, which contains the run ID of the parent run.
diff --git a/articles/machine-learning/toc.yml b/articles/machine-learning/toc.yml
@@ -414,28 +414,28 @@
             href: how-to-migrate-from-estimators-to-scriptrunconfig.md
           - name: Reinforcement learning
             href: how-to-use-reinforcement-learning.md
-          - name: Track & monitor training
-            items:
-              - name: Monitor training jobs
-                displayName: cancel, fail, status, child run
-                href: how-to-track-monitor-analyze-runs.md
-              - name: Log & view metrics, parameters and files
-                displayName: troubleshoot, log, files, tracing, metrics
-                href: how-to-log-view-metrics.md
-              - name: Log MLflow models
-                href: how-to-log-mlflow-models.md
-              - name: Visualize runs with TensorBoard
-                displayName: log, monitor, metrics
-                href: how-to-monitor-tensorboard.md
-              - name: Migrate from SDK v1 logging to MLflow
-                href: reference-migrate-sdk-v1-mlflow-tracking.md
           - name: Use Key Vault when training
             displayName: secrets keyvault
             href: how-to-use-secrets-in-runs.md
         - name: Train with the CLI v2
           href: how-to-train-cli.md
         - name: Train with the REST API
           href: how-to-train-with-rest.md
+        - name: Track & monitor training
+          items:
+            - name: Monitor training jobs
+              displayName: cancel, fail, status, child run
+              href: how-to-track-monitor-analyze-runs.md
+            - name: Log & view metrics, parameters and files
+              displayName: troubleshoot, log, files, tracing, metrics
+              href: how-to-log-view-metrics.md
+            - name: Log MLflow models
+              href: how-to-log-mlflow-models.md
+            - name: Visualize runs with TensorBoard
+              displayName: log, monitor, metrics
+              href: how-to-monitor-tensorboard.md
+            - name: Migrate from SDK v1 logging to MLflow
+              href: reference-migrate-sdk-v1-mlflow-tracking.md
         - name: Train and track with MLflow
           items:
             - name: Track experiments with MLflow