Merge pull request #101162 from ronortloff/master

tiburd · web-flow · commit 861fd81fac96 · 2020-01-14T15:06:52.000-08:00
Update sql-data-warehouse-workload-management-portal-monitor.md
diff --git a/articles/sql-data-warehouse/sql-data-warehouse-concept-resource-utilization-query-activity.md b/articles/sql-data-warehouse/sql-data-warehouse-concept-resource-utilization-query-activity.md
@@ -7,7 +7,7 @@ manager: craigg-msft
 ms.service: sql-data-warehouse
 ms.topic: conceptual
 ms.subservice: manage
-ms.date: 08/09/2019
+ms.date: 01/14/2020
 ms.author: kevin
 ms.reviewer: igorstan
 ms.custom: seo-lt-2019
@@ -22,23 +22,25 @@ The following metrics are available in the Azure portal for SQL Data Warehouse.
 
 | Metric Name             | Description                                                  | Aggregation Type |
 | ----------------------- | ------------------------------------------------------------ | ---------------- |
-| CPU percentage          | CPU utilization across all nodes for the data warehouse      | Maximum          |
-| Data IO percentage      | IO Utilization across all nodes for the data warehouse       | Maximum          |
-| Memory percentage       | Memory utilization (SQL Server) across all nodes for the data warehouse | Maximum          |
-| Successful Connections  | Number of successful connections to the data                 | Total            |
-| Failed Connections      | Number of failed connections to the data warehouse           | Total            |
-| Blocked by Firewall     | Number of logins to the data warehouse which was blocked     | Total            |
-| DWU limit               | Service level objective of the data warehouse                | Maximum          |
-| DWU percentage          | Maximum between CPU percentage and Data IO percentage        | Maximum          |
-| DWU used                | DWU limit * DWU percentage                                   | Maximum          |
-| Cache hit percentage    | (cache hits / cache miss) * 100  where cache hits is the sum of all columnstore segments hits in the local SSD cache and cache miss is the columnstore segments misses in the local SSD cache summed across all nodes | Maximum          |
-| Cache used percentage   | (cache used / cache capacity) * 100 where cache used is the sum of all bytes in the local SSD cache across all nodes and cache capacity is the sum of the storage capacity of the local SSD cache across all nodes | Maximum          |
-| Local tempdb percentage | Local tempdb utilization across all compute nodes - values are emitted every five minutes | Maximum          |
-
-> Things to consider when viewing metrics and setting alerts:
->
-> - Failed and successful connections are reported for a particular data warehouse - not for the logical server
-> - Memory percentage reflects utilization even if the data warehouse is in idle state - it does not reflect active workload memory consumption. Use and track this metric along with others (tempdb, gen2 cache) to make a holistic decision on if scaling for additional cache capacity will increase workload performance to meet your requirements.
+| CPU percentage          | CPU utilization across all nodes for the data warehouse      | Avg, Min, Max    |
+| Data IO percentage      | IO Utilization across all nodes for the data warehouse       | Avg, Min, Max    |
+| Memory percentage       | Memory utilization (SQL Server) across all nodes for the data warehouse | Avg, Min, Max   |
+| Active Queries          | Number of active queries executing on the system             | Sum              |
+| Queued Queries          | Number of queued queries waiting to start executing          | Sum              |
+| Successful Connections  | Number of successful connections to the data                 | Sum, Count       |
+| Failed Connections      | Number of failed connections to the data warehouse           | Sum, Count       |
+| Blocked by Firewall     | Number of logins to the data warehouse which was blocked     | Sum, Count       |
+| DWU limit               | Service level objective of the data warehouse                | Avg, Min, Max    |
+| DWU percentage          | Maximum between CPU percentage and Data IO percentage        | Avg, Min, Max    |
+| DWU used                | DWU limit * DWU percentage                                   | Avg, Min, Max    |
+| Cache hit percentage    | (cache hits / cache miss) * 100  where cache hits is the sum of all columnstore segments hits in the local SSD cache and cache miss is the columnstore segments misses in the local SSD cache summed across all nodes | Avg, Min, Max    |
+| Cache used percentage   | (cache used / cache capacity) * 100 where cache used is the sum of all bytes in the local SSD cache across all nodes and cache capacity is the sum of the storage capacity of the local SSD cache across all nodes | Avg, Min, Max    |
+| Local tempdb percentage | Local tempdb utilization across all compute nodes - values are emitted every five minutes | Avg, Min, Max    |
+
+Things to consider when viewing metrics and setting alerts:
+
+- Failed and successful connections are reported for a particular data warehouse - not for the logical server
+- Memory percentage reflects utilization even if the data warehouse is in idle state - it does not reflect active workload memory consumption. Use and track this metric along with others (tempdb, gen2 cache) to make a holistic decision on if scaling for additional cache capacity will increase workload performance to meet your requirements.
 
 
 ## Query activity
diff --git a/articles/sql-data-warehouse/sql-data-warehouse-workload-management-portal-monitor.md b/articles/sql-data-warehouse/sql-data-warehouse-workload-management-portal-monitor.md
@@ -7,7 +7,7 @@ manager: craigg
 ms.service: sql-data-warehouse
 ms.topic: conceptual
 ms.subservice: workload-management
-ms.date: 01/13/2020
+ms.date: 01/14/2020
 ms.author: rortloff
 ms.reviewer: jrasnick
 ms.custom: seo-lt-2019
@@ -45,10 +45,10 @@ CREATE WORKLOAD CLASSIFIER wcCEOPriority
 WITH ( WORKLOAD_GROUP = 'wgPriority'
       ,MEMBERNAME = 'TheCEO');
 ```
-The below chart is configured as follows:
-Metric 1: *Effective min resource percent* (Avg aggregation, `blue line`)
-Metric 2: *Workload group allocation by system percent* (Avg aggregation, `purple line`)
-Filter: [Workload Group] = `wgPriority`
+The below chart is configured as follows:<br>
+Metric 1: *Effective min resource percent* (Avg aggregation, `blue line`)<br>
+Metric 2: *Workload group allocation by system percent* (Avg aggregation, `purple line`)<br>
+Filter: [Workload Group] = `wgPriority`<br>
 ![underutilized-wg.png](media/sql-data-warehouse-workload-management-portal-monitor/underutilized-wg.png)
 The chart shows that with 25% workload isolation, only 10% is being used on average.  In this case, the `MIN_PERCENTAGE_RESOURCE` parameter value could be lowered to between 10 or 15 and allow for other workloads on the system to consume the resources.
 
@@ -65,11 +65,11 @@ CREATE WORKLOAD CLASSIFIER wcDataAnalyst
 WITH ( WORKLOAD_GROUP = 'wgDataAnalyst'
       ,MEMBERNAME = 'DataAnalyst');
 ```
-The below chart is configured as follows:
-Metric 1: *Effective cap resource percent* (Avg aggregation, `blue line`)
-Metric 2: *Workload group allocation by max resource percent* (Avg aggregation, `purple line`)
-Metric 3: *Workload group queued queries* (Sum aggregation, `turquoise line`)
-Filter: [Workload Group] = `wgDataAnalyst`
+The below chart is configured as follows:<br>
+Metric 1: *Effective cap resource percent* (Avg aggregation, `blue line`)<br>
+Metric 2: *Workload group allocation by max resource percent* (Avg aggregation, `purple line`)<br>
+Metric 3: *Workload group queued queries* (Sum aggregation, `turquoise line`)<br>
+Filter: [Workload Group] = `wgDataAnalyst`<br>
 ![bottle-necked-wg](media/sql-data-warehouse-workload-management-portal-monitor/bottle-necked-wg.png)
 The chart shows that with a 9% cap on resources, the workload group is 90%+ utilized (from the *Workload group allocation by max resource percent metric*).  There is a steady queuing of queries as shown from the *Workload group queued queries metric*.  In this case, increasing the `CAP_PERCENTAGE_RESOURCE` to a value higher than 9% will allow more queries to execute concurrently.  Increasing the `CAP_PERCENTAGE_RESOURCE` assumes that there are enough resources available and not isolated by other workload groups.  Verify the cap increased by checking the *Effective cap resource percent metric*.  If more throughput is desired, also consider increasing the `REQUEST_MIN_RESOURCE_GRANT_PERCENT` to a value greater than 3.  Increasing the `REQUEST_MIN_RESOURCE_GRANT_PERCENT` could allow queries to run faster.