Update how-to-sqldb-to-cosmosdb.md

kromerm · web-flow · commit 3a134b9aff33 · 2020-04-29T21:32:54.000-07:00
diff --git a/articles/data-factory/how-to-sqldb-to-cosmosdb.md b/articles/data-factory/how-to-sqldb-to-cosmosdb.md
@@ -1,6 +1,6 @@
 ---
 title: Migrate Azure SQL Database tables to Azure CosmosDB with Azure Data Factory
-description: Take an existing normalized database schema from Azure SQL Database and migrate to CosmosDB denormalized collection with Azure Data Factory.
+description: Take an existing normalized database schema from Azure SQL Database and migrate to an Azure CosmosDB denormalized container with Azure Data Factory.
 services: data-factory
 author: kromerm
 
@@ -12,9 +12,32 @@ ms.date: 04/29/2020
 ms.author: makromer
 ---
 
-# Migrate normalized database schema from Azure SQL Database to Azure CosmosDB denormalized collection
+# Migrate normalized database schema from Azure SQL Database to Azure CosmosDB denormalized container
 
-By using mapping data flows in Microsoft Azure Data Factory, you can transform data from fixed-width text files. In the following task, we'll define a dataset for a text file without a delimiter and then set up substring splits based on ordinal position.
+This guide will explain how to take an existing normalized database schema in Azure SQL Database and convert it into an Azure CosmosDB denomarlized schema in Azure CosmosDB. SQL schemas are typically modeled using third normal form and normalized schemas to provide high levels of data integrity and fewer duplicate data. Queries can join entities together across tables. CosmosDB is optimized for super-quick transactions and querying within a collection or container via denormalized schemas with data self-contained inside a document.
+
+Using Azure Data Factory, we'll build a pipeline that uses a single Mapping Data Flow to read from two Azure SQL Database normalized tables that contain primary and foreign keys as the entity relationship. ADF will join those tables into a single stream using the data flow Spark engine, collect joined rows into arrays and produce individual cleansed documents for insert into a new Azure CosmosDB container.
+
+This guide will build a new container on the fly called "Orders" that will use ```SalesOrderHeader``` and ```SalesOrderDetail``` from the standard SQL Server AdventureWorks sample database. Those tables represent sales transactions joined by ```SalesOrderID```. Each unique detail records has its own primary key of ```SalesOrderDetailID```. The relationship between header and detail is ```1:M```. We'll join on ```SalesOrderID``` in ADF and then roll each related detail record into an array called "detail".
+
+The representative SQL query for this guide is:
+
+```
+  SELECT
+  o.SalesOrderID,
+  o.OrderDate,
+  o.Status,
+  o.ShipDate,
+  o.SalesOrderNumber,
+  o.ShipMethod,
+  o.SubTotal,
+  (select SalesOrderDetailID, UnitPrice, OrderQty from SalesLT.SalesOrderDetail od where od.SalesOrderID = o.SalesOrderID for json auto) as OrderDetails
+FROM SalesLT.SalesOrderHeader o;
+```
+
+The resulting CosmosDB container will embed the inner query into a single document and look like this:
+
+![Collection](media/data-flow/cosmosb3.png)
 
 ## Create a pipeline