Bug fixes and requested edits.

fbsolo-ms1 · fbsolo-ms1 · commit 7e26aa12c1fb · 2023-04-06T15:38:41.000-07:00
diff --git a/articles/machine-learning/how-to-connection.md b/articles/machine-learning/how-to-connection.md
@@ -33,12 +33,12 @@ In this article, learn how to connect to data sources located outside of Azure,
 - An Azure Machine Learning workspace.
 
 > [!NOTE]
-> An Azure Machine Learning connection stores the credentials passed during connection creation in the Workspace Azure Key Vault. A connection references the credentials from that location for further use. The YAML cna pass the credentials. A CLI command or SDK can override them. We recommend that you **avoid** credential storage in YAML files.
+> An Azure Machine Learning connection securely stores the credentials passed during connection creation in the Workspace Azure Key Vault. A connection references the credentials from the key vault storage location for further use. You won't need to directly deal with the credentials after they are stored in the key vault. You have the option to store the credentials in the YAML file. A CLI command or SDK can override them. We recommend that you **avoid** credential storage in a YAML file, because a security breach could lead to a credential leak.
 
 ## Create a Snowflake DB connection
 
 # [CLI: Username/password](#tab/cli-username-password)
-This YAML script creates a Snowflake DB connection. Be sure to update the appropriate values:
+This YAML file creates a Snowflake DB connection. Be sure to update the appropriate values:
 
 ```yaml
 # my_snowflakedb_connection.yaml
@@ -56,7 +56,7 @@ credentials:
 
 Create the Azure Machine Learning datastore in the CLI:
 
-### Option 1: Use the username and password in a YAML script
+### Option 1: Use the username and password in YAML file
 
 ```azurecli
 az ml connection create --file my_snowflakedb_connection.yaml
@@ -70,7 +70,7 @@ az ml connection create --file my_snowflakedb_connection.yaml --set credentials.
 
 # [Python SDK: username/ password](#tab/sdk-username-password)
 
-### Option 1: Load the connection in a YAML script
+### Option 1: Load connection from YAML file
 
 ```python
 from azure.ai.ml import MLClient, load_workspace_connection
@@ -83,7 +83,6 @@ wps_connection.credentials.password="XXXXXXXX"
 ml_client.connections.create_or_update(workspace_connection=wps_connection)
 
 ```
----
 
 ### Option 2: Use WorkspaceConnection() in a Python script
 
@@ -104,6 +103,8 @@ ml_client.connections.create_or_update(workspace_connection=wps_connection)
 
 ```
 
+---
+
 ## Create an Azure SQL DB connection
 
 # [CLI: Username/password](#tab/cli-sql-username-password)
@@ -125,23 +126,23 @@ credentials:
     password: <password> # add the sql database password here or leave this blank and type in CLI command line
 ```
 
-Create the Azure Machine Learning datastore in the CLI:
+Create the Azure Machine Learning connection in the CLI:
 
-### Option 1: Use the username/ password in a YAML script
+### Option 1: Use the username / password from YAML file
 
 ```azurecli
 az ml connection create --file my_sqldb_connection.yaml
 ```
 
-### Option 2: Override the username and password in the YAML file
+### Option 2: Override the username and password in YAML file
 
 ```azurecli
 az ml connection create --file my_sqldb_connection.yaml --set credentials.username="XXXXX" credentials.password="XXXXX" 
 ```
 
 # [Python SDK: username/ password](#tab/sdk-sql-username-password)
 
-### Option 1: Load the connection in a YAML script
+### Option 1: Load connection from YAML file
 
 ```python
 from azure.ai.ml import MLClient, load_workspace_connection
@@ -154,7 +155,6 @@ wps_connection.credentials.password="XXXXXxXXX"
 ml_client.connections.create_or_update(workspace_connection=wps_connection)
 
 ```
----
 
 ### Option 2: Using WorkspaceConnection()
 
@@ -175,6 +175,8 @@ ml_client.connections.create_or_update(workspace_connection=wps_connection)
 
 ```
 
+---
+
 ## Create Amazon S3 connection
 
 # [CLI: Access key](#tab/cli-s3-access-key)
@@ -195,15 +197,15 @@ credentials:
     secret_access_key: XxXxXxXXXXXXXxXxXxxXxxXXXXXXXXxXxxXXxXXXXXXXxxxXxXXxXXXXXxXXxXXXxXxXxxxXXxXXxXXXXXxXxxXX # add access key secret
 ```
 
-Create the Azure Machine Learning datastore in the CLI:
+Create the Azure Machine Learning connection in the CLI:
 
 ```azurecli
 az ml connection create --file my_s3_connection.yaml
 ```
 
 # [Python SDK: Access key](#tab/sdk-s3-access-key)
 
-### Option 1: Load the connection in a YAML script
+### Option 1: Load connection from YAML file
 
 ```python
 from azure.ai.ml import MLClient, load_workspace_connection
diff --git a/articles/machine-learning/how-to-import-data-assets.md b/articles/machine-learning/how-to-import-data-assets.md
@@ -21,11 +21,11 @@ ms.custom: data4ml
 
 In this article, learn how to import data into the Azure Machine Learning platform from external sources. A successful import automatically creates and registers an Azure Machine Learning data asset with the name provided during the import. An Azure Machine Learning data asset resembles a web browser bookmark (favorites). You don't need to remember long storage paths (URIs) that point to your most-frequently used data. Instead, you can create a data asset, and then access that asset with a friendly name.
 
-A data import creates a *cache* of the source data, along with metadata, for faster, reliable data access in Azure Machine Learning training jobs. The data import avoids network and connection constraints. The cached data is versioned to support reproducibility, and to provide data lineage, even for data imported from SQL Server sources. A data import uses ADF (Azure Data Factory pipelines) behind the scenes, and users can avoid ADF interactions as a result. To optimize data transfer parallelization, Azure Machine Learning handles ADF compute resource provisioning and tear-down.
+A data import creates a cache of the source data, along with metadata, for faster and reliable data access in Azure Machine Learning training jobs. The data cache avoids network and connection constraints. The cached data is versioned to support reproducibility (which provides versioning capabilities for data imported from SQL Server sources). Additionally, the cached data provides data lineage for auditability. A data import uses ADF (Azure Data Factory pipelines) behind the scenes, which means that users can avoid complex interactions with ADF. Behind the scenes, Azure Machine Learning also handles management of ADF compute resource pool size, compute resource provisioning, and tear-down to optimize data transfer by determining proper parallelization.
 
-The transferred data is partitioned and securely stored in Azure storage, in parquet format. ADF compute and storage costs only involve the time that the data cached, because the cache is a copy of the data hosted in Azure storage. ADF compute facilitated the data transfer.
+The transferred data is partitioned and securely stored as parquet files in Azure storage. This enables faster processing during training. ADF compute costs only involve the time used for data transfers. Storage costs only involve the time needed to cache the data, because cached data is a copy of the data imported from an external source. That external source is hosted in Azure storage.
 
-The cached parquet-format data is readily available for Azure Machine Learning training job consumption, in a fast and efficient manner. This increases training run speeds, and it helps protect against connection timeouts for large data set training. It reduces recurring training compute costs, in comparison to direct connections to external source data while training.
+The caching feature involves upfront compute and storage costs. However, it pays for itself, and can save money, because it reduces recurring training compute costs compared to direct connections to external source data during training. It caches data as parquet files, which makes job training faster and more reliable against connection timeouts for larger data sets. This leads to fewer reruns, and fewer training failures.
 
 You can now import data from Snowflake, Amazon S3 and Azure SQL.
 
@@ -43,7 +43,7 @@ To create and work with data assets, you need:
 
 ## Importing from external database sources / import from external sources to create a meltable data asset
 
->__NOTE:__ The external databases can have Snowflake, Azure SQL, etc. formats.
+>NOTE: The external databases can have Snowflake, Azure SQL, etc. formats.
 
 The following code samples can import data from external databases. The `connection` that handles the import action determines the external database data source metadata. In this sample, the code imports data from a Snowflake resource. The connection points to a Snowflake source. With a little modification, the connection can point to an Azure SQL database source and an Azure SQL database source. The imported asset `type` from an external database source is `mltable`.
 
@@ -160,7 +160,7 @@ ml_client.data.import_data(data_import=data_import)
 
 ## Check the import status of external data sources
 
-The data import action is an asynchronous action. It can take a long time. After submission of an import data action via the CLI or SDK, the Azure Machine Learning service might need several minutes to connect to the external data source. Then the service would start the data import and handle data caching and registration. The time required for a data import also depends on the size of the source data set.
+The data import action is an asynchronous action. It can take a long time. After submission of an import data action via the CLI or SDK, the Azure Machine Learning service might need several minutes to connect to the external data source. Then the service would start the data import and handle data caching and registration. The time needed for a data import also depends on the size of the source data set.
 
 The next example returns the status of the submitted data import activity. The command or method uses the "data asset" name as the input to determine the status of the data materialization.
 
@@ -183,6 +183,8 @@ ml_client.data.show_materialization_status(name="<name>")
 
 ```
 
+---
+
 ## Next steps
 
 - [Read data in a job](how-to-read-write-data-v2.md#read-data-in-a-job)
diff --git a/articles/machine-learning/toc.yml b/articles/machine-learning/toc.yml
@@ -630,6 +630,14 @@
         - name: Data administration and authentication
           displayName: Data administration and authentication
           href: how-to-administrate-data-authentication.md
+        - name: Access data
+          items:
+            - name: Use Connections
+              displayName: Use Connections
+              href: how-to-connection.md
+            - name: Import Data
+              displayName: Import Data
+              href: how-to-import-data-assets.md
         # v1
         - name: Access data
           items: