Skip to content

Commit 404e59b

Browse files
authored
Merge pull request #112882 from harneetvirk/master
AzureML Python SDK 2020-04-27 Release Notes
2 parents fcda222 + a831aab commit 404e59b

File tree

1 file changed

+43
-16
lines changed

1 file changed

+43
-16
lines changed

articles/machine-learning/azure-machine-learning-release-notes.md

Lines changed: 43 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,33 @@ In this article, learn about Azure Machine Learning releases. For the full SDK
1717

1818
See [the list of known issues](resource-known-issues.md) to learn about known bugs and workarounds.
1919

20+
## 2020-04-27
21+
22+
### Azure Machine Learning SDK for Python v1.4.0
23+
24+
+ **New features**
25+
+ AmlCompute clusters now support setting up a managed identity on the cluster at the time of provisioning. Just specify whether you would like to use a system-assigned identity or a user-assigned identity, and pass an identityId in the case of the latter. You can then setup permissiosn to access various resources like Storage or ACR in a way that the identity of the compute gets used to securely access the data, instead of a token based approach that AmlCompute employs today. Check out our SDK reference for more information on the parameters.
26+
27+
28+
+ **Breaking changes**
29+
+ AmlCompute clusters supported a Preview feature around run-based creation, that we are planning on deprecating in two weeks. You can continue to create persistent compute targets as always by using the Amlcompute class, but the specific approach of specifying the identifier "amlcompute" as the compute target in run config will not be supported in the near future.
30+
31+
+ **Bug fixes and improvements**
32+
+ **azureml-automl-runtime**
33+
+ Enable support for unhashable type when calculating number of unique values in a column.
34+
+ **azureml-core**
35+
+ Improved stability when reading from Azure Blob Storage using a TabularDataset.
36+
+ Improved documentation for the `grant_workspace_msi` parameter for `Datastore.register_azure_blob_store`.
37+
+ Fixed bug with `datastore.upload` where if the `src_dir` argument ended with a `/` or `\`, we will fail to upload.
38+
+ Added actionable error message when trying to upload to an Azure Blob Storage datastore that does not have an access key or SAS token.
39+
+ **azureml-interpret**
40+
+ Added upper bound to file size for the visualization data on uploaded explanations.
41+
+ **azureml-train-automl-client**
42+
+ Explicitly checking for label_column_name & weight_column_name parameters for AutoMLConfig to be of type string.
43+
+ **azureml-contrib-pipeline-steps**
44+
+ ParallelRunStep now supports dataset as pipeline parameter. User can construct pipeline with sample dataset and can change input dataset of same type (file or tabular) for new pipeline run.
45+
46+
2047
## 2020-04-13
2148

2249
### Azure Machine Learning SDK for Python v1.3.0
@@ -40,13 +67,12 @@ See [the list of known issues](resource-known-issues.md) to learn about known bu
4067
+ Added Environment.clone(new_name) API to create a copy of Environment object
4168
+ Environment.docker.base_dockerfile accepts filepath. If able to resolve a file, the content will be read into base_dockerfile environment property
4269
+ Automatically reset mutually exclusive values for base_image and base_dockerfile when user manually sets a value in Environment.docker
43-
+ Dataset: fixed dataset download failure if data path containing unicode characters
44-
+ Dataset: improved dataset mount caching mechanism to respect the minimum disk space requirement in Azure Machine Learning Compute, which avoids making the node unusable and causing the job to be canceled
4570
+ Added user_managed flag in RSection which indicates whether the environment is managed by user or by AzureML.
46-
+ Dataset: we add an index for the timeseries column when you access a timeseries dataset as a pandas dataframes, which is used to speed up access to timeseries based data access. Previously, the index was given the same name as the timestamp column, confusing users about which is the actual timestamp column and which is the index. We now don't give any specific name to the index since it should not be used as a column.
47-
+ **azureml-dataprep**
48-
+ Fixed dataset authentication issue in sovereign cloud
49-
+ Fixed `Dataset.to_spark_dataframe` failure for datasets created from Azure PostgreSQL datastores
71+
+ Dataset: Fixed dataset download failure if data path containing unicode characters.
72+
+ Dataset: Improved dataset mount caching mechanism to respect the minimum disk space requirement in Azure Machine Learning Compute, which avoids making the node unusable and causing the job to be canceled.
73+
+ Dataset: We add an index for the timeseries column when you access a timeseries dataset as a pandas dataframes, which is used to speed up access to timeseries based data access. Previously, the index was given the same name as the timestamp column, confusing users about which is the actual timestamp column and which is the index. We now don't give any specific name to the index since it should not be used as a column.
74+
+ Dataset: Fixed dataset authentication issue in sovereign cloud.
75+
+ Dataset: Fixed `Dataset.to_spark_dataframe` failure for datasets created from Azure PostgreSQL datastores.
5076
+ **azureml-interpret**
5177
+ Added global scores to visualization if local importance values are sparse
5278
+ Updated azureml-interpret to use interpret-community 0.9.*
@@ -65,6 +91,7 @@ See [the list of known issues](resource-known-issues.md) to learn about known bu
6591
+ **azureml-opendatasets**
6692
+ Added additional telemetry for service monitor.
6793
+ Enable frontdoor for blob to increase stability
94+
6895
## 2020-03-23
6996

7097
### Azure Machine Learning SDK for Python v1.2.0
@@ -85,7 +112,7 @@ See [the list of known issues](resource-known-issues.md) to learn about known bu
85112
+ Added better error messages if time column has incorrect format.
86113
+ Enabled customized imputation with constant value for both X and y data forecasting tasks.
87114
+ **azureml-core**
88-
+ Add support for loading ServicePrincipal from environment variables: AZUREML_SERVICE_PRINCIPAL_ID, AZUREML_SERVICE_PRINCIPAL_TENANT_ID, and AZUREML_SERVICE_PRINCIPAL_PASSWORD
115+
+ Added support for loading ServicePrincipal from environment variables: AZUREML_SERVICE_PRINCIPAL_ID, AZUREML_SERVICE_PRINCIPAL_TENANT_ID, and AZUREML_SERVICE_PRINCIPAL_PASSWORD
89116
+ Introduced a new parameter `support_multi_line` to `Dataset.Tabular.from_delimited_files`: By default (`support_multi_line=False`), all line breaks, including those in quoted field values, will be interpreted as a record break. Reading data this way is faster and more optimized for parallel execution on multiple CPU cores. However, it may result in silently producing more records with misaligned field values. This should be set to `True` when the delimited files are known to contain quoted line breaks.
90117
+ Added the ability to register ADLS Gen2 in the Azure Machine Learning CLI
91118
+ Renamed parameter 'fine_grain_timestamp' to 'timestamp' and parameter 'coarse_grain_timestamp' to 'partition_timestamp' for the with_timestamp_columns() method in TabularDataset to better reflect the usage of the parameters.
@@ -145,19 +172,19 @@ See [the list of known issues](resource-known-issues.md) to learn about known bu
145172
+ **azureml-core**
146173
+ Deployed AzureML Webservices will now default to `INFO` logging. This can be controlled by setting the `AZUREML_LOG_LEVEL` environment variable in the deployed service.
147174
+ Python sdk uses discovery service to use 'api' endpoint instead of 'pipelines'.
148-
+ Swap to the new routes in all SDK calls
149-
+ Changes routing of calls to the ModelManagementService to a new unified structure
175+
+ Swap to the new routes in all SDK calls.
176+
+ Changed routing of calls to the ModelManagementService to a new unified structure.
150177
+ Made workspace update method publicly available.
151-
+ Added image_build_compute parameter in workspace update method to allow user updating the compute for image build
152-
+ Added deprecation messages to the old profiling workflow. Fixed profiling cpu and memory limits
153-
+ Added RSection as part of Environment to run R jobs
154-
+ Added validation to `Dataset.mount` to raise error when source of the dataset is not accessible or does not contain any data.
155-
+ Added `--grant-workspace-msi-access` as an additional parameter for the Datastore CLI for registering Azure Blob Container which will allow you to register Blob Container that is behind a VNet
178+
+ Added image_build_compute parameter in workspace update method to allow user updating the compute for image build.
179+
+ Added deprecation messages to the old profiling workflow. Fixed profiling cpu and memory limits.
180+
+ Added RSection as part of Environment to run R jobs.
181+
+ Added validation to `Dataset.mount` to raise error when source of the dataset is not accessible or does not contain any data.
182+
+ Added `--grant-workspace-msi-access` as an additional parameter for the Datastore CLI for registering Azure Blob Container which will allow you to register Blob Container that is behind a VNet.
156183
+ Single instance profiling was fixed to produce a recommendation and was made available in core sdk.
157-
+ Fixed the issue in aks.py _deploy
184+
+ Fixed the issue in aks.py _deploy.
158185
+ Validates the integrity of models being uploaded to avoid silent storage failures.
159186
+ User may now specify a value for the auth key when regenerating keys for webservices.
160-
+ Fixed bug where uppercase letters cannot be used as dataset's input name
187+
+ Fixed bug where uppercase letters cannot be used as dataset's input name.
161188
+ **azureml-defaults**
162189
+ `azureml-dataprep` will now be installed as part of `azureml-defaults`. It is no longer required to install dataprep[fuse] manually on compute targets to mount datasets.
163190
+ **azureml-interpret**

0 commit comments

Comments
 (0)