You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Automated machine learning supports data that resides on your local desktop or in the cloud such as Azure Blob Storage. The data can be read into scikit-learn supported data formats. You can read the data into:
68
+
Automated machine learning supports data that resides on your local desktop or in the cloud such as Azure Blob Storage. The data can be read into a Pandas DataFrame or an Azure Machine Learning dataset. The following code examples demonstrate how to store the data in these formats. [Learn more about datatsets](https://github.com/MicrosoftDocs/azure-docs-pr/pull/how-to-create-register-datasets.md).
69
69
70
-
*Numpy arrays X (features) and y (target variable, also known as label)
70
+
*TabularDataset
71
71
* Pandas dataframe
72
72
73
73
>[!Important]
@@ -77,13 +77,14 @@ Automated machine learning supports data that resides on your local desktop or i
## Fetch data for running experiment on remote compute
@@ -145,14 +145,14 @@ Some examples include:
145
145
1. Classification experiment using AUC weighted as the primary metric with a max time of 12,000 seconds per iteration, with the experiment to end after 50 iterations and 2 cross-validation folds.
146
146
147
147
```python
148
-
automl_classifier = AutoMLConfig(
148
+
automl_classifier=AutoMLConfig(
149
149
task='classification',
150
150
primary_metric='AUC_weighted',
151
151
max_time_sec=12000,
152
152
iterations=50,
153
153
blacklist_models='XGBoostClassifier',
154
-
X=X,
155
-
y=y,
154
+
training_data=train_data,
155
+
label_column_name=label,
156
156
n_cross_validations=2)
157
157
```
158
158
2. Below is an example of a regression experiment set to end after 100 iterations, with each iteration lasting up to 600 seconds with 5 validation cross folds.
Automated machine learning allows you to understand feature importance. During the training process, you can get global feature importance for the model. For classification scenarios, you can also get class-level feature importance. You must provide a validation dataset (X_valid) to get feature importance.
472
+
Automated machine learning allows you to understand feature importance. During the training process, you can get global feature importance for the model. For classification scenarios, you can also get class-level feature importance. You must provide a validation dataset (validation_data) to get feature importance.
473
473
474
474
There are two ways to generate feature importance.
475
475
@@ -479,7 +479,7 @@ There are two ways to generate feature importance.
479
479
from azureml.train.automl.automlexplainer import explain_model
0 commit comments