You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/monitor-azure-machine-learning.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,7 +22,7 @@ When you have critical applications and business processes relying on Azure reso
22
22
> *[Start, monitor, and cancel training runs](how-to-track-monitor-analyze-runs.md)
23
23
> *[Log metrics for training runs](how-to-log-view-metrics.md)
24
24
> *[Track experiments with MLflow](how-to-use-mlflow.md)
25
-
> *[Visualize runs with TensorBoard](how-to-monitor-tensorboard.md)
25
+
> *[Visualize runs with TensorBoard](v1/how-to-monitor-tensorboard.md)
26
26
>
27
27
> If you want to monitor information generated by models deployed to online endpoints, see [Monitor online endpoints](how-to-monitor-online-endpoints.md).
Copy file name to clipboardExpand all lines: articles/machine-learning/v1/concept-azure-machine-learning-architecture.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -253,7 +253,7 @@ Azure Machine Learning provides the following monitoring and logging capabilitie
253
253
*[Start, monitor, and cancel training runs](../how-to-track-monitor-analyze-runs.md)
254
254
*[Log metrics for training runs](../how-to-log-view-metrics.md)
255
255
*[Track experiments with MLflow](../how-to-use-mlflow.md)
256
-
*[Visualize runs with TensorBoard](../how-to-monitor-tensorboard.md)
256
+
*[Visualize runs with TensorBoard](how-to-monitor-tensorboard.md)
257
257
* For **Administrators**, you can monitor information about the workspace, related Azure resources, and events such as resource creation and deletion by using Azure Monitor. For more information, see [How to monitor Azure Machine Learning](../monitor-azure-machine-learning.md).
258
258
* For **DevOps** or **MLOps**, you can monitor information generated by models deployed as web services to identify problems with the deployments and gather data submitted to the service. For more information, see [Collect model data](how-to-enable-data-collection.md) and [Monitor with Application Insights](../how-to-enable-app-insights.md).
In this article, you learn how to view your experiment jobs and metrics in TensorBoard using [the `tensorboard` package](/python/api/azureml-tensorboard/) in the main Azure Machine Learning SDK. Once you've inspected your experiment jobs, you can better tune and retrain your machine learning models.
23
23
@@ -26,28 +26,28 @@ In this article, you learn how to view your experiment jobs and metrics in Tenso
26
26
How you launch TensorBoard with Azure Machine Learning experiments depends on the type of experiment:
27
27
+ If your experiment natively outputs log files that are consumable by TensorBoard, such as PyTorch, Chainer and TensorFlow experiments, then you can [launch TensorBoard directly](#launch-tensorboard) from experiment's job history.
28
28
29
-
+ For experiments that don't natively output TensorBoard consumable files, such as like Scikit-learn or Azure Machine Learning experiments, use [the `export_to_tensorboard()` method](#export) to export the job histories as TensorBoard logs and launch TensorBoard from there.
29
+
+ For experiments that don't natively output TensorBoard consumable files, such as like Scikit-learn or Azure Machine Learning experiments, use [the `export_to_tensorboard()` method](#option-2-export-history-as-log-to-view-in-tensorboard) to export the job histories as TensorBoard logs and launch TensorBoard from there.
30
30
31
31
> [!TIP]
32
-
> The information in this document is primarily for data scientists and developers who want to monitor the model training process. If you are an administrator interested in monitoring resource usage and events from Azure Machine learning, such as quotas, completed training jobs, or completed model deployments, see [Monitoring Azure Machine Learning](monitor-azure-machine-learning.md).
32
+
> The information in this document is primarily for data scientists and developers who want to monitor the model training process. If you are an administrator interested in monitoring resource usage and events from Azure Machine learning, such as quotas, completed training jobs, or completed model deployments, see [Monitoring Azure Machine Learning](../monitor-azure-machine-learning.md).
33
33
34
34
## Prerequisites
35
35
36
36
* To launch TensorBoard and view your experiment job histories, your experiments need to have previously enabled logging to track its metrics and performance.
37
37
* The code in this document can be run in either of the following environments:
38
38
* Azure Machine Learning compute instance - no downloads or installation necessary
39
-
* Complete the [Quickstart: Get started with Azure Machine Learning](quickstart-create-resources.md) to create a dedicated notebook server pre-loaded with the SDK and the sample repository.
39
+
* Complete the [Quickstart: Get started with Azure Machine Learning](../quickstart-create-resources.md) to create a dedicated notebook server pre-loaded with the SDK and the sample repository.
40
40
* In the samples folder on the notebook server, find two completed and expanded notebooks by navigating to these directories:
*[Install the Azure Machine Learning SDK](/python/api/overview/azure/ml/install) with the `tensorboard` extra
45
-
*[Create an Azure Machine Learning workspace](quickstart-create-resources.md).
46
-
*[Create a workspace configuration file](./v1/how-to-configure-environment-v1.md).
45
+
*[Create an Azure Machine Learning workspace](../quickstart-create-resources.md).
46
+
*[Create a workspace configuration file](how-to-configure-environment-v1.md).
47
47
48
48
## Option 1: Directly view job history in TensorBoard
49
49
50
-
This option works for experiments that natively outputs log files consumable by TensorBoard, such as PyTorch, Chainer, and TensorFlow experiments. If that is not the case of your experiment, use [the `export_to_tensorboard()` method](#export) instead.
50
+
This option works for experiments that natively outputs log files consumable by TensorBoard, such as PyTorch, Chainer, and TensorFlow experiments. If that is not the case of your experiment, use [the `export_to_tensorboard()` method](#option-2-export-history-as-log-to-view-in-tensorboard) instead.
51
51
52
52
The following example code uses the [MNIST demo experiment](https://raw.githubusercontent.com/tensorflow/tensorflow/r1.8/tensorflow/examples/tutorials/mnist/mnist_with_summaries.py) from TensorFlow's repository in a remote compute target, Azure Machine Learning Compute. Next, we will configure and start a job for training the TensorFlow model, and then
53
53
start TensorBoard against this TensorFlow experiment.
> While this example used TensorFlow, TensorBoard can be used as easily with PyTorch or Chainer. TensorFlow must be available on the machine running TensorBoard, but is not necessary on the machine doing PyTorch or Chainer computations.
185
185
186
186
187
-
<aname="export"></a>
188
-
189
187
## Option 2: Export history as log to view in TensorBoard
190
188
191
189
The following code sets up a sample experiment, begins the logging process using the Azure Machine Learning job history APIs, and exports the experiment job history into logs consumable by TensorBoard for visualization.
@@ -295,5 +293,5 @@ tb.stop()
295
293
296
294
In this how-to you, created two experiments and learned how to launch TensorBoard against their job histories to identify areas for potential tuning and retraining.
297
295
298
-
* If you are satisfied with your model, head over to our [How to deploy a model](./v1/how-to-deploy-and-where.md) article.
299
-
* Learn more about [hyperparameter tuning](how-to-tune-hyperparameters.md).
296
+
* If you are satisfied with your model, head over to our [How to deploy a model](how-to-deploy-and-where.md) article.
297
+
* Learn more about [hyperparameter tuning](../how-to-tune-hyperparameters.md).
0 commit comments