Skip to content

Commit 16faf10

Browse files
committed
updates
1 parent b2b0ea1 commit 16faf10

File tree

3 files changed

+21
-13
lines changed

3 files changed

+21
-13
lines changed

articles/synapse-analytics/breadcrumb/toc.yml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -46,4 +46,12 @@
4646
tocHref: /azure/active-directory
4747
topicHref: /azure/synapse-analytics/index
4848

49+
- name: Azure
50+
tocHref: /azure/
51+
topicHref: /azure/index
52+
items:
53+
- name: Synapse Analytics
54+
tocHref: /azure/cosmos-db
55+
topicHref: /azure/synapse-analytics/index
56+
4957

articles/synapse-analytics/sql-data-warehouse/design-guidance-for-replicated-tables.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: Design guidance for replicated tables
3-
description: Recommendations for designing replicated tables in Synapse SQL
3+
description: Recommendations for designing replicated tables in Synapse SQL pool
44
services: synapse-analytics
55
author: XiaoyuMSFT
66
manager: craigg
@@ -13,27 +13,27 @@ ms.reviewer: igorstan
1313
ms.custom: seo-lt-2019, azure-synapse
1414
---
1515

16-
# Design guidance for using replicated tables in SQL Analytics
16+
# Design guidance for using replicated tables in Synapse SQL pool
1717

18-
This article gives recommendations for designing replicated tables in your SQL Analytics schema. Use these recommendations to improve query performance by reducing data movement and query complexity.
18+
This article gives recommendations for designing replicated tables in your Synapse SQL pool schema. Use these recommendations to improve query performance by reducing data movement and query complexity.
1919

2020
> [!VIDEO https://www.youtube.com/embed/1VS_F37GI9U]
2121
2222
## Prerequisites
2323

24-
This article assumes you are familiar with data distribution and data movement concepts in SQL Analytics.  For more information, see the [architecture](massively-parallel-processing-mpp-architecture.md) article.
24+
This article assumes you are familiar with data distribution and data movement concepts in SQL pool.  For more information, see the [architecture](massively-parallel-processing-mpp-architecture.md) article.
2525

2626
As part of table design, understand as much as possible about your data and how the data is queried.  For example, consider these questions:
2727

2828
- How large is the table?
2929
- How often is the table refreshed?
30-
- Do I have fact and dimension tables in a SQL Analytics database?
30+
- Do I have fact and dimension tables in a SQL pool database?
3131

3232
## What is a replicated table?
3333

3434
A replicated table has a full copy of the table accessible on each Compute node. Replicating a table removes the need to transfer data among Compute nodes before a join or aggregation. Since the table has multiple copies, replicated tables work best when the table size is less than 2 GB compressed. 2 GB is not a hard limit. If the data is static and does not change, you can replicate larger tables.
3535

36-
The following diagram shows a replicated table that is accessible on each Compute node. In SQL Analytics, the replicated table is fully copied to a distribution database on each Compute node.
36+
The following diagram shows a replicated table that is accessible on each Compute node. In SQL pool, the replicated table is fully copied to a distribution database on each Compute node.
3737

3838
![Replicated table](./media/design-guidance-for-replicated-tables/replicated-table.png "Replicated table")
3939

@@ -47,8 +47,8 @@ Consider using a replicated table when:
4747
Replicated tables may not yield the best query performance when:
4848

4949
- The table has frequent insert, update, and delete operations. The data manipulation language (DML) operations require a rebuild of the replicated table. Rebuilding frequently can cause slower performance.
50-
- The SQL Analytics database is scaled frequently. Scaling a SQL Analytics database changes the number of Compute nodes, which incurs rebuilding the replicated table.
51-
- The table has a large number of columns, but data operations typically access only a small number of columns. In this scenario, instead of replicating the entire table, it might be more effective to distribute the table, and then create an index on the frequently accessed columns. When a query requires data movement, SQL Analytics only moves data for the requested columns.
50+
- The SQL pool database is scaled frequently. Scaling a SQL pool database changes the number of Compute nodes, which incurs rebuilding the replicated table.
51+
- The table has a large number of columns, but data operations typically access only a small number of columns. In this scenario, instead of replicating the entire table, it might be more effective to distribute the table, and then create an index on the frequently accessed columns. When a query requires data movement, SQL pool only moves data for the requested columns.
5252

5353
## Use replicated tables with simple query predicates
5454

@@ -119,7 +119,7 @@ We re-created `DimDate` and `DimSalesTerritory` as replicated tables, and ran th
119119

120120
## Performance considerations for modifying replicated tables
121121

122-
SQL Analytics implements a replicated table by maintaining a master version of the table. It copies the master version to the first distribution database on each Compute node. When there is a change, SQL Analytics first updates the master version, then it rebuilds the tables on each Compute node. A rebuild of a replicated table includes copying the table to each Compute node and then building the indexes. For example, a replicated table on a DW2000c has 5 copies of the data. A master copy and a full copy on each Compute node. All data is stored in distribution databases. SQL Analytics uses this model to support faster data modification statements and flexible scaling operations.
122+
SQL pool implements a replicated table by maintaining a master version of the table. It copies the master version to the first distribution database on each Compute node. When there is a change, the master version is updated first, then the tables on each Compute node are rebuilt. A rebuild of a replicated table includes copying the table to each Compute node and then building the indexes. For example, a replicated table on a DW2000c has 5 copies of the data. A master copy and a full copy on each Compute node. All data is stored in distribution databases. SQL pool uses this model to support faster data modification statements and flexible scaling operations.
123123

124124
Rebuilds are required after:
125125

@@ -136,7 +136,7 @@ The rebuild does not happen immediately after data is modified. Instead, the reb
136136

137137
### Use indexes conservatively
138138

139-
Standard indexing practices apply to replicated tables. SQL Analytics rebuilds each replicated table index as part of the rebuild. Only use indexes when the performance gain outweighs the cost of rebuilding the indexes.
139+
Standard indexing practices apply to replicated tables. SQL pool rebuilds each replicated table index as part of the rebuild. Only use indexes when the performance gain outweighs the cost of rebuilding the indexes.
140140

141141
### Batch data load
142142

@@ -188,7 +188,7 @@ SELECT TOP 1 * FROM [ReplicatedTable]
188188

189189
To create a replicated table, use one of these statements:
190190

191-
- [CREATE TABLE (SQL Analytics)](/sql/t-sql/statements/create-table-azure-sql-data-warehouse?toc=/azure/synapse-analytics/sql-data-warehouse/toc.json&bc=/azure/synapse-analytics/sql-data-warehouse/breadcrumb/toc.json&view=azure-sqldw-latest)
192-
- [CREATE TABLE AS SELECT (SQL Analytics)](/sql/t-sql/statements/create-table-as-select-azure-sql-data-warehouse?toc=/azure/synapse-analytics/sql-data-warehouse/toc.json&bc=/azure/synapse-analytics/sql-data-warehouse/breadcrumb/toc.json&view=azure-sqldw-latest)
191+
- [CREATE TABLE (SQL pool)](/sql/t-sql/statements/create-table-azure-sql-data-warehouse?toc=/azure/synapse-analytics/sql-data-warehouse/toc.json&bc=/azure/synapse-analytics/sql-data-warehouse/breadcrumb/toc.json&view=azure-sqldw-latest)
192+
- [CREATE TABLE AS SELECT (SQL pool)](/sql/t-sql/statements/create-table-as-select-azure-sql-data-warehouse?toc=/azure/synapse-analytics/sql-data-warehouse/toc.json&bc=/azure/synapse-analytics/sql-data-warehouse/breadcrumb/toc.json&view=azure-sqldw-latest)
193193

194194
For an overview of distributed tables, see [distributed tables](sql-data-warehouse-tables-distribute.md).

articles/synapse-analytics/sql-data-warehouse/sql-data-warehouse-manage-compute-rest-api.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ ms.reviewer: igorstan
1313
ms.custom: seo-lt-2019, azure-synapse
1414
---
1515

16-
# REST APIs for Azure SQL Data Warehouse
16+
# REST APIs for Azure Synapse Analytics data warehouse
1717

1818
REST APIs for managing compute in Azure Synapse Analytics data warehouse.
1919

0 commit comments

Comments
 (0)